; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0167791 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0167791
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr06:21126055..21127708
RNA-Seq ExpressionCmc06g0167791
SyntenyCmc06g0167791
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR036397 - Ribonuclease H superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042119.1 pol protein [Cucumis melo var. makuwa]7.9e-27791.4Show/hide
Query:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDD+LIYSKTEAEHEE+LRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVD AKIEAV SW
Subjt:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAP+LTVPD                       GKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA

Query:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLEL AVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRK SHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH
        QAPLHRDLERAEIAVSV+AVTMQLAQLTVQPTLRQ+II+AQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELL +A SSPFSMH
Subjt:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH

Query:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA
        PGSTKMYQDLKRVYWWRN+KREVAEFVS+CLVCQQVKAPRQKPAGLLQPL+IPEWKWENVSMDFITGLPRTLRGFTVIWVV +RLTKSAHFV GKSTYTA
Subjt:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA

Query:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK
        SKWAQLYMSEIVRLHG+ VSIVSDRDARFTSKF K
Subjt:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK

KAA0048687.1 pol protein [Cucumis melo var. makuwa]7.9e-27791.4Show/hide
Query:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDD+LIYSKTEAEHEE+LRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVD AKIEAV  W
Subjt:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD                       GKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA

Query:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLEL AVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH
        QAPLHRDLERAEIAVSV AVTMQLAQLTVQPTLRQ+II+AQ NDPYLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDS VKTELLSEA+SSPFSMH
Subjt:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH

Query:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA
        PGSTKMY+D+KRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPL+IPEWKWENVSMDFITGLPRTLRGFTVIWVV +RLTKSAHFV GKSTYTA
Subjt:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA

Query:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK
        SKWAQLYMSEIVRLHG+PVSIVSDRDARFTSKF K
Subjt:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK

KAA0057672.1 pol protein [Cucumis melo var. makuwa]2.3e-27691.21Show/hide
Query:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDD+LIYSKTEAEHEE+LRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVD AKIEAV  W
Subjt:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSR ATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD                       GKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA

Query:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLEL AVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH
        QAPLHRDLERAEIAVSV AVTMQLAQLTVQPTLRQ+II+AQ NDPYLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDSAVKTELL+EA+SSPFSMH
Subjt:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH

Query:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA
        PGSTKMYQDLKR+YWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL+IPEWKWENVSMDFI GLPRTLRGFTVIWVV +RLTKSAHFV GKSTYT 
Subjt:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA

Query:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK
        SKWAQLYMSEIVRLHG+PVSIVSDRDARFTSKF K
Subjt:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK

KAA0062112.1 pol protein [Cucumis melo var. makuwa]1.0e-27691.4Show/hide
Query:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDD+LIYSKTEAEHE +LRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSVD AKIEAV  W
Subjt:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD                       GKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA

Query:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLEL AVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH
        QAPLHRDLERAEIAVSV AVTMQLAQLTVQPTLRQ+II+AQ NDPYLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDSAVKTELLSEA+SSPFSMH
Subjt:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH

Query:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA
        PGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPL+IPEWKWENVSMDFITGLPRTLRGFTVIWVV +RLTKSAHFV GKSTYT+
Subjt:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA

Query:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK
        SKWAQLYMSEIVRLHG+PVSIVSDRDARFTSKF K
Subjt:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK

TYK01613.1 pol protein [Cucumis melo var. makuwa]7.9e-27791.59Show/hide
Query:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDD+LIYSKTEAEHEE+LRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVD AKIEAV  W
Subjt:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTAPVLTVPD                       GKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA

Query:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLEL AVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH
        QAPLHRDLERAEIAVSV AVTMQLAQLTVQPTLRQ+II+AQ NDPYLVEKRGLAEAGQ  EFS+SSDGGLLFERRLCVPSDSAVKTELLSEA+SSPFSMH
Subjt:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH

Query:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA
        PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL+IPEWKWENVSMDFITGLPRTLRGFTVIWVV +RLTKSAHFV GKSTYTA
Subjt:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA

Query:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK
        SKWAQLYMSEIVRLHG+PVSIVSDRDARFTSKF K
Subjt:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK

TrEMBL top hitse value%identityAlignment
A0A5A7TLA3 Pol protein3.8e-27791.4Show/hide
Query:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDD+LIYSKTEAEHEE+LRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVD AKIEAV SW
Subjt:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAP+LTVPD                       GKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA

Query:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLEL AVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRK SHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH
        QAPLHRDLERAEIAVSV+AVTMQLAQLTVQPTLRQ+II+AQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELL +A SSPFSMH
Subjt:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH

Query:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA
        PGSTKMYQDLKRVYWWRN+KREVAEFVS+CLVCQQVKAPRQKPAGLLQPL+IPEWKWENVSMDFITGLPRTLRGFTVIWVV +RLTKSAHFV GKSTYTA
Subjt:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA

Query:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK
        SKWAQLYMSEIVRLHG+ VSIVSDRDARFTSKF K
Subjt:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK

A0A5A7U330 Reverse transcriptase3.8e-27791.4Show/hide
Query:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDD+LIYSKTEAEHEE+LRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVD AKIEAV  W
Subjt:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD                       GKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA

Query:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLEL AVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH
        QAPLHRDLERAEIAVSV AVTMQLAQLTVQPTLRQ+II+AQ NDPYLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDS VKTELLSEA+SSPFSMH
Subjt:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH

Query:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA
        PGSTKMY+D+KRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPL+IPEWKWENVSMDFITGLPRTLRGFTVIWVV +RLTKSAHFV GKSTYTA
Subjt:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA

Query:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK
        SKWAQLYMSEIVRLHG+PVSIVSDRDARFTSKF K
Subjt:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK

A0A5A7UP94 Pol protein1.1e-27691.21Show/hide
Query:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDD+LIYSKTEAEHEE+LRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVD AKIEAV  W
Subjt:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSR ATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD                       GKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA

Query:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLEL AVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH
        QAPLHRDLERAEIAVSV AVTMQLAQLTVQPTLRQ+II+AQ NDPYLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDSAVKTELL+EA+SSPFSMH
Subjt:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH

Query:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA
        PGSTKMYQDLKR+YWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL+IPEWKWENVSMDFI GLPRTLRGFTVIWVV +RLTKSAHFV GKSTYT 
Subjt:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA

Query:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK
        SKWAQLYMSEIVRLHG+PVSIVSDRDARFTSKF K
Subjt:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK

A0A5A7V1N3 Reverse transcriptase5.0e-27791.4Show/hide
Query:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDD+LIYSKTEAEHE +LRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSVD AKIEAV  W
Subjt:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD                       GKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA

Query:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLEL AVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH
        QAPLHRDLERAEIAVSV AVTMQLAQLTVQPTLRQ+II+AQ NDPYLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDSAVKTELLSEA+SSPFSMH
Subjt:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH

Query:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA
        PGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPL+IPEWKWENVSMDFITGLPRTLRGFTVIWVV +RLTKSAHFV GKSTYT+
Subjt:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA

Query:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK
        SKWAQLYMSEIVRLHG+PVSIVSDRDARFTSKF K
Subjt:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK

A0A5D3BPI1 Reverse transcriptase3.8e-27791.59Show/hide
Query:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDD+LIYSKTEAEHEE+LRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVD AKIEAV  W
Subjt:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTAPVLTVPD                       GKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD-----------------------GKVVAYA

Query:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLEL AVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH
        QAPLHRDLERAEIAVSV AVTMQLAQLTVQPTLRQ+II+AQ NDPYLVEKRGLAEAGQ  EFS+SSDGGLLFERRLCVPSDSAVKTELLSEA+SSPFSMH
Subjt:  QAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMH

Query:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA
        PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL+IPEWKWENVSMDFITGLPRTLRGFTVIWVV +RLTKSAHFV GKSTYTA
Subjt:  PGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTA

Query:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK
        SKWAQLYMSEIVRLHG+PVSIVSDRDARFTSKF K
Subjt:  SKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein5.1e-6930.16Show/hide
Query:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW
        M +G++ A A F   +N +  E  ++ V+ ++DD+LI+SK+E+EH ++++ VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V+ W
Subjt:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVL----------------TVPDGKV-----------
         +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL                 V  G V           
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVL----------------TVPDGKV-----------

Query:  -VAYASRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKV
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR  
Subjt:  -VAYASRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKV

Query:  SHSAALITRQAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELL
             ++    P+ +D E   I          + Q+++    + +++    ND  L+    L    + VE +I    GLL   +  + +P+D+ +   ++
Subjt:  SHSAALITRQAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELL

Query:  SEANSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSA
         + +     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VV +R +K A
Subjt:  SEANSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSA

Query:  HFVLGKSTYTASKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLKE
          V    + TA + A+++   ++   G P  I++D D  FTS+  K+
Subjt:  HFVLGKSTYTASKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLKE

P0CT35 Transposon Tf2-2 polyprotein5.1e-6930.16Show/hide
Query:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW
        M +G++ A A F   +N +  E  ++ V+ ++DD+LI+SK+E+EH ++++ VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V+ W
Subjt:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVL----------------TVPDGKV-----------
         +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL                 V  G V           
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVL----------------TVPDGKV-----------

Query:  -VAYASRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKV
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR  
Subjt:  -VAYASRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKV

Query:  SHSAALITRQAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELL
             ++    P+ +D E   I          + Q+++    + +++    ND  L+    L    + VE +I    GLL   +  + +P+D+ +   ++
Subjt:  SHSAALITRQAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELL

Query:  SEANSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSA
         + +     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VV +R +K A
Subjt:  SEANSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSA

Query:  HFVLGKSTYTASKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLKE
          V    + TA + A+++   ++   G P  I++D D  FTS+  K+
Subjt:  HFVLGKSTYTASKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLKE

P0CT36 Transposon Tf2-3 polyprotein5.1e-6930.16Show/hide
Query:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW
        M +G++ A A F   +N +  E  ++ V+ ++DD+LI+SK+E+EH ++++ VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V+ W
Subjt:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVL----------------TVPDGKV-----------
         +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL                 V  G V           
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVL----------------TVPDGKV-----------

Query:  -VAYASRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKV
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR  
Subjt:  -VAYASRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKV

Query:  SHSAALITRQAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELL
             ++    P+ +D E   I          + Q+++    + +++    ND  L+    L    + VE +I    GLL   +  + +P+D+ +   ++
Subjt:  SHSAALITRQAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELL

Query:  SEANSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSA
         + +     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VV +R +K A
Subjt:  SEANSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSA

Query:  HFVLGKSTYTASKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLKE
          V    + TA + A+++   ++   G P  I++D D  FTS+  K+
Subjt:  HFVLGKSTYTASKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLKE

P0CT37 Transposon Tf2-4 polyprotein5.1e-6930.16Show/hide
Query:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW
        M +G++ A A F   +N +  E  ++ V+ ++DD+LI+SK+E+EH ++++ VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V+ W
Subjt:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVL----------------TVPDGKV-----------
         +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL                 V  G V           
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVL----------------TVPDGKV-----------

Query:  -VAYASRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKV
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR  
Subjt:  -VAYASRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKV

Query:  SHSAALITRQAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELL
             ++    P+ +D E   I          + Q+++    + +++    ND  L+    L    + VE +I    GLL   +  + +P+D+ +   ++
Subjt:  SHSAALITRQAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELL

Query:  SEANSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSA
         + +     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VV +R +K A
Subjt:  SEANSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSA

Query:  HFVLGKSTYTASKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLKE
          V    + TA + A+++   ++   G P  I++D D  FTS+  K+
Subjt:  HFVLGKSTYTASKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLKE

P0CT41 Transposon Tf2-12 polyprotein5.1e-6930.16Show/hide
Query:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW
        M +G++ A A F   +N +  E  ++ V+ ++DD+LI+SK+E+EH ++++ VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V+ W
Subjt:  MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVL----------------TVPDGKV-----------
         +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL                 V  G V           
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVL----------------TVPDGKV-----------

Query:  -VAYASRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKV
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR  
Subjt:  -VAYASRQLKSHEQNYPTHDLELEAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKV

Query:  SHSAALITRQAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELL
             ++    P+ +D E   I          + Q+++    + +++    ND  L+    L    + VE +I    GLL   +  + +P+D+ +   ++
Subjt:  SHSAALITRQAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKIINAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELL

Query:  SEANSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSA
         + +     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VV +R +K A
Subjt:  SEANSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSA

Query:  HFVLGKSTYTASKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLKE
          V    + TA + A+++   ++   G P  I++D D  FTS+  K+
Subjt:  HFVLGKSTYTASKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLKE

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.7e-2543.75Show/hide
Query:  YLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDLAKIEAVISWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW
        +L MVLQ    ++ YA   KC F   Q+++LG  H++S  GVS D AK+EA++ W  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W
Subjt:  YLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDLAKIEAVISWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW

Query:  SKACEDSFQNLKQKLVTAPVLTVPDGKV
        ++    +F+ LK  + T PVL +PD K+
Subjt:  SKACEDSFQNLKQKLVTAPVLTVPDGKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTTGGTTTGACGAATGCTCTGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATTGTGTTTATTGATGATGTC
TTGATATACTCCAAGACGGAGGCCGAGCATGAGGAGTATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGTTT
TGGCTGAAGCAGGTGTCCTTTCTAGGCCATGTGGTTTCTAAGGCTGGAGTTTCTGTGGATCTAGCTAAGATAGAGGCAGTCATCAGTTGGACCCGACCTTCCACA
GTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGAATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGA
GCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTAAGGTAGTTGCT
TATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTAGAATTGGAAGCAGTGGTTTTTGCATTGAAAATATGGAGGCATTACTTGTAT
GGTGAAAAGATACAGATCTTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGGCAGCGAAGATGGCTTGAGTTAGTGAAGGAT
TACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCA
TTGCATCGAGACCTTGAGCGGGCTGAGATTGCAGTGTCAGTGAGGGCAGTCACTATGCAGTTAGCCCAATTGACGGTACAACCGACGTTGAGGCAAAAGATCATT
AATGCTCAGCGTAACGATCCATATTTGGTTGAGAAGCGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTCTATATCCTCTGATGGTGGACTTTTGTTTGAG
AGGCGCCTCTGTGTGCCATCAGATAGTGCGGTTAAAACAGAATTATTATCTGAGGCTAACAGTTCCCCATTTTCCATGCACCCAGGGAGTACGAAGATGTATCAG
GACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAAACCA
GCGGGTTTATTACAACCCTTGAACATACCGGAATGGAAGTGGGAAAACGTGTCCATGGATTTCATTACAGGATTGCCAAGAACTCTGAGGGGTTTTACAGTGATT
TGGGTTGTGGCGAACAGACTTACCAAATCAGCGCACTTCGTCCTGGGTAAATCCACCTATACCGCTAGTAAGTGGGCACAGTTGTACATGTCTGAGATAGTGAGA
TTACATGGAATGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTCTTGAAGGAAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTTGGTTTGACGAATGCTCTGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATTGTGTTTATTGATGATGTC
TTGATATACTCCAAGACGGAGGCCGAGCATGAGGAGTATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGTTT
TGGCTGAAGCAGGTGTCCTTTCTAGGCCATGTGGTTTCTAAGGCTGGAGTTTCTGTGGATCTAGCTAAGATAGAGGCAGTCATCAGTTGGACCCGACCTTCCACA
GTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGAATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGA
GCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTAAGGTAGTTGCT
TATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTAGAATTGGAAGCAGTGGTTTTTGCATTGAAAATATGGAGGCATTACTTGTAT
GGTGAAAAGATACAGATCTTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGGCAGCGAAGATGGCTTGAGTTAGTGAAGGAT
TACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCA
TTGCATCGAGACCTTGAGCGGGCTGAGATTGCAGTGTCAGTGAGGGCAGTCACTATGCAGTTAGCCCAATTGACGGTACAACCGACGTTGAGGCAAAAGATCATT
AATGCTCAGCGTAACGATCCATATTTGGTTGAGAAGCGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTCTATATCCTCTGATGGTGGACTTTTGTTTGAG
AGGCGCCTCTGTGTGCCATCAGATAGTGCGGTTAAAACAGAATTATTATCTGAGGCTAACAGTTCCCCATTTTCCATGCACCCAGGGAGTACGAAGATGTATCAG
GACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAAACCA
GCGGGTTTATTACAACCCTTGAACATACCGGAATGGAAGTGGGAAAACGTGTCCATGGATTTCATTACAGGATTGCCAAGAACTCTGAGGGGTTTTACAGTGATT
TGGGTTGTGGCGAACAGACTTACCAAATCAGCGCACTTCGTCCTGGGTAAATCCACCTATACCGCTAGTAAGTGGGCACAGTTGTACATGTCTGAGATAGTGAGA
TTACATGGAATGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTCTTGAAGGAAAGTTGA
Protein sequenceShow/hide protein sequence
MSFGLTNALAVFMDLMNRVFREFLDTFVIVFIDDVLIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDLAKIEAVISWTRPST
VSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGKVVAYASRQLKSHEQNYPTHDLELEAVVFALKIWRHYLY
GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVRAVTMQLAQLTVQPTLRQKII
NAQRNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEANSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKP
AGLLQPLNIPEWKWENVSMDFITGLPRTLRGFTVIWVVANRLTKSAHFVLGKSTYTASKWAQLYMSEIVRLHGMPVSIVSDRDARFTSKFLKES