; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0189721 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0189721
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr07:8453075..8455684
RNA-Seq ExpressionCmc07g0189721
SyntenyCmc07g0189721
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR036397 - Ribonuclease H superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR001584 - Integrase, catalytic core


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025469.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0083.41Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD
        MLSKEKVKACQIEIAGHVIEVTLLVLDML FDVILGMDWLAANHASIDCS KEV FNPPSM SFKFKGEGSRSLPQVISA+R SKLL+      LASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD

Query:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI
        TR+VDVSLSSEPVVR+Y DVFPEELPGLP HRE+EFAIELEPGTVPISRA YRMAPAEL+ELK+QLQE LDKGFIRPSVSPWGAPVLFVK KDGSMRLCI
Subjt:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKN+YPLPRIDDLF QLQ ATVFSKIDLRSGYHQLRIK GDVPKTAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTS----------------------------------
        DILIYSKTEA  EEHLR+V QTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT+                                  
Subjt:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTS----------------------------------

Query:  ------------------------------KVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLE
                                      KV  YAS QLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHK LKYFFTQKELNMRQRRWLE
Subjt:  ------------------------------KVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLE

Query:  LVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQAVEFSI
        LVKDYDCEILYH GKANVVADALSRKVS+S ALITRQAPLHRDL RAEIAVSVGAVT+QLAQLT                     K GLAEAGQAV FSI
Subjt:  LVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQAVEFSI

Query:  SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDF
        SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP STKMYQDLKRV WWRNMKREVAEFVS+CLVCQQVKAP QKPAGLLQPLSIPEWKWENVSMDF
Subjt:  SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDF

Query:  ITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQ
        ITGLPRTLRGFTVIWVV+DR+ KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRD+RFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQ
Subjt:  ITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQ

Query:  VLEDMLRACALEFPGSWDSHLHLMEF
        VLEDMLRACALEFPGSWDSHLHLMEF
Subjt:  VLEDMLRACALEFPGSWDSHLHLMEF

KAA0031931.1 pol protein [Cucumis melo var. makuwa]0.0e+0080.7Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD
        MLSKEKVKACQIEIAGHVIEVTLLVLDML FDVILGMDWLAANHASIDCS KEV FNPPSM SFKFKG GSRSLPQVISA+RASKLL+      LASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD

Query:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI
        TREVDVSLSSEPVVR+Y DVFPEELPGLP HRE+EFAIELEPGTVPISRA YRMAPAEL+ELKVQLQELLDKGFIRPS+SPWGAPVLFVK KDGSMRLCI
Subjt:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKNRYPLPRIDDLF QLQ ATVFSKIDLRSGYHQLRIK GDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT-----------------------------------
        DILIYSKTEA  EEHLR+V QTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT                                   
Subjt:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT-----------------------------------

Query:  ------------------------------------------------------------SKVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLY
                                                                     KV  YASRQLKSHEQN PTHDLELAAVVFALKIWRHYLY
Subjt:  ------------------------------------------------------------SKVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKILKYFFTQKELNMRQRRWLELVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT-----
        GEKIQIFTDHK LKYFFTQKELNMRQRRWLELVKDYDCEILYH  KANVVADALSRKVSHS ALITRQAPLHRDL+RAEIAVSVGAVT QLAQLT     
Subjt:  GEKIQIFTDHKILKYFFTQKELNMRQRRWLELVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT-----

Query:  ----------------KCGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVC
                        K GLAEAGQAVEFS+SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP STKMYQDLKRV WWRNMKREVAEFVSKCLVC
Subjt:  ----------------KCGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVC

Query:  QQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKF
        QQVKAPRQKPAGLLQPLSIPEWKWEN+SMDFITGLPRTLRGF VIWVV+DR+ KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIV DRD+RFTSKF
Subjt:  QQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKF

Query:  WKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFPYN
        WKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEF YN
Subjt:  WKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFPYN

KAA0037817.1 pol protein [Cucumis melo var. makuwa]0.0e+0082.51Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD
        MLSKEKVKACQIEIAGHVIEVTLLVLDML FDVILGMDWLAANHASIDCS KEV FNPPSM SFKFKGEGS+SLPQVISA+RASKLL+      LASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD

Query:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI
        T+EVDVSLSSEPVVR+Y DVFPEELPGLP HRE+EFAIEL+PGTVPISRA YRMAPAEL+ELKVQLQELLDKGFIRPS+SPWGAPVLFVK KDGSMRLCI
Subjt:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKNRYPLPRIDDLF QLQ ATVFSKIDLRSGYHQLRIK GDVPKTAFRSRYGHYEFIVMSF LTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT-----------------------------------
        DILIYSKTEA  EEHLR+V QTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT                                   
Subjt:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT-----------------------------------

Query:  -----------------------------------SKVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMR
                                            KV  YASRQLKSHEQN PTHDLELAA+VFALKIWRHYLYGEKIQIFT HK LKYFFTQKELNMR
Subjt:  -----------------------------------SKVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMR

Query:  QRRWLELVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQ
        QRRWLELVKDYDCEILYH GKANVVADALSRKVSHS ALITRQA LHRDL+RAEIAVSVGAVTMQLAQLT                     K GLAEAGQ
Subjt:  QRRWLELVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQ

Query:  AVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWE
        AVEFS+SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP STKMYQDLKRV WWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIP+WKWE
Subjt:  AVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWE

Query:  NVSMDFITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ
        NVSMDFITGLPRTLRGFTVIWVV+DR+ KSAHFVPGKSTYTASKWAQLYMSEIV+LHGVPVSIVSDRD+RFTSKFWKGLQTAMGTRLDFS AFHPQTDGQ
Subjt:  NVSMDFITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ

Query:  TERLNQVLEDMLRACALEFPGSWDSHLHLMEFPYN
        TERLNQVLEDMLRACALEFPGSWDSHLHLMEF YN
Subjt:  TERLNQVLEDMLRACALEFPGSWDSHLHLMEFPYN

KAA0047001.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0085.31Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD
        MLSKEKVKACQ+EIAGHVIEVTLLVLDML FDVILGMDWLAANHASIDCS KEV +NPPSM SFKFKG GS+SLPQVISA+RASKLLN      LASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD

Query:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI
        TREVDVSLSSEPVVR+Y DVFPEELPGLP HRE+EFAIELEPGTVPISRA YRMAPAEL+ELKVQLQELLDKGFIRPSVSPWGAPVL VK KDGSMRLCI
Subjt:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKNRYPLPRIDDLF QLQ ATVFSKIDLRSGYHQLRIK GDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLM RVFREFLDTF+IVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT-----------------------------------
        DILIYSKTEA  EEHLRMV QTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGV VDPAKIEAVT                                   
Subjt:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT-----------------------------------

Query:  ----------SKVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLELVKDYDCEILYHQGKANVV
                   KV  YASRQLKSHEQN PTHDLELA VVFALKIWRHYLYGEKIQIFTD+K LKYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVV
Subjt:  ----------SKVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLELVKDYDCEILYHQGKANVV

Query:  ADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQAVEFSISSDGGLLFERRLCVPSDSA
        ADALSRKVSHS ALITRQAPLHRDL+RAEIAVSVGAVTMQLAQLT                     K GLAEAGQAVEFS+SSDGGLLFERRLCVPSDSA
Subjt:  ADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQAVEFSISSDGGLLFERRLCVPSDSA

Query:  VKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVID
        VKTELLSEAHSSPFSMHP STKMYQDLKRV WWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVV+D
Subjt:  VKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVID

Query:  RIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDS
        R+ KSAHFVPGKS YTASKWAQLYMSEIVRLHGVPVSIVSDRD+RFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDS
Subjt:  RIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDS

Query:  HLHLMEFPYN
        HLHLMEF YN
Subjt:  HLHLMEFPYN

KAA0051744.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0082.51Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD
        MLSKEKVKACQIEIAGHVI VTL+VLDML FDVILGMDWLAANHAS+DCS KEV FNPPSM SFKFKG GS+SLPQVISA+RASKLL+      LASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD

Query:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI
        TREVDVSLSSEPVVR+Y DVFPEELPGLP HRE+EFAIELEPGTVPIS+A YRMAPAEL+ELKVQLQELLDKGFIRPSVSPWGAPVLFVK KDG MRLCI
Subjt:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKNRYPLPRIDDLF QLQ ATVFSKIDLR G+HQLRIK  DVPKTAFRSRYGHYEFI+MSFGLTNAP VFMDLMNRVF EFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTS----------------------------------
        DILIYSKTEA  EEHLRMV QTLRDNKLYAKFSKCEFWLKQVSFL HVVSKAGVSVDPAKIEAVT+                                  
Subjt:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTS----------------------------------

Query:  ------------------------------KVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLE
                                      KV  YASRQLKSHEQN PTHDLELAAVVFALKIWRHYLYGEKIQIFTD+K LKYFFTQKELNMRQRRWLE
Subjt:  ------------------------------KVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLE

Query:  LVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQAVEFSI
        LVKDYDCEILYH GKANVVADALSRKVSHS ALITRQAPLHRDL+RAEIAVSVGAVTMQLAQLT                     K GLAEAGQA EFS+
Subjt:  LVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQAVEFSI

Query:  SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDF
        SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP STKMYQDLKRV WWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDF
Subjt:  SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDF

Query:  ITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQ
        ITGL RTLRGFTVIWVV+DR+ KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRD+RFTSKFWKGLQTAMGTRLDF TAFHPQTDGQTERLNQ
Subjt:  ITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQ

Query:  VLEDMLRACALEFPGSWDSHLHLMEFPYN
        VLEDMLRACALEFPGSWDSHLHLMEF YN
Subjt:  VLEDMLRACALEFPGSWDSHLHLMEFPYN

TrEMBL top hitse value%identityAlignment
A0A5A7SJH3 Reverse transcriptase0.0e+0083.41Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD
        MLSKEKVKACQIEIAGHVIEVTLLVLDML FDVILGMDWLAANHASIDCS KEV FNPPSM SFKFKGEGSRSLPQVISA+R SKLL+      LASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD

Query:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI
        TR+VDVSLSSEPVVR+Y DVFPEELPGLP HRE+EFAIELEPGTVPISRA YRMAPAEL+ELK+QLQE LDKGFIRPSVSPWGAPVLFVK KDGSMRLCI
Subjt:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKN+YPLPRIDDLF QLQ ATVFSKIDLRSGYHQLRIK GDVPKTAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTS----------------------------------
        DILIYSKTEA  EEHLR+V QTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT+                                  
Subjt:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTS----------------------------------

Query:  ------------------------------KVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLE
                                      KV  YAS QLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHK LKYFFTQKELNMRQRRWLE
Subjt:  ------------------------------KVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLE

Query:  LVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQAVEFSI
        LVKDYDCEILYH GKANVVADALSRKVS+S ALITRQAPLHRDL RAEIAVSVGAVT+QLAQLT                     K GLAEAGQAV FSI
Subjt:  LVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQAVEFSI

Query:  SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDF
        SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP STKMYQDLKRV WWRNMKREVAEFVS+CLVCQQVKAP QKPAGLLQPLSIPEWKWENVSMDF
Subjt:  SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDF

Query:  ITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQ
        ITGLPRTLRGFTVIWVV+DR+ KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRD+RFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQ
Subjt:  ITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQ

Query:  VLEDMLRACALEFPGSWDSHLHLMEF
        VLEDMLRACALEFPGSWDSHLHLMEF
Subjt:  VLEDMLRACALEFPGSWDSHLHLMEF

A0A5A7U077 Reverse transcriptase0.0e+0085.31Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD
        MLSKEKVKACQ+EIAGHVIEVTLLVLDML FDVILGMDWLAANHASIDCS KEV +NPPSM SFKFKG GS+SLPQVISA+RASKLLN      LASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD

Query:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI
        TREVDVSLSSEPVVR+Y DVFPEELPGLP HRE+EFAIELEPGTVPISRA YRMAPAEL+ELKVQLQELLDKGFIRPSVSPWGAPVL VK KDGSMRLCI
Subjt:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKNRYPLPRIDDLF QLQ ATVFSKIDLRSGYHQLRIK GDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLM RVFREFLDTF+IVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT-----------------------------------
        DILIYSKTEA  EEHLRMV QTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGV VDPAKIEAVT                                   
Subjt:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT-----------------------------------

Query:  ----------SKVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLELVKDYDCEILYHQGKANVV
                   KV  YASRQLKSHEQN PTHDLELA VVFALKIWRHYLYGEKIQIFTD+K LKYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVV
Subjt:  ----------SKVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLELVKDYDCEILYHQGKANVV

Query:  ADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQAVEFSISSDGGLLFERRLCVPSDSA
        ADALSRKVSHS ALITRQAPLHRDL+RAEIAVSVGAVTMQLAQLT                     K GLAEAGQAVEFS+SSDGGLLFERRLCVPSDSA
Subjt:  ADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQAVEFSISSDGGLLFERRLCVPSDSA

Query:  VKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVID
        VKTELLSEAHSSPFSMHP STKMYQDLKRV WWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVV+D
Subjt:  VKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVID

Query:  RIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDS
        R+ KSAHFVPGKS YTASKWAQLYMSEIVRLHGVPVSIVSDRD+RFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDS
Subjt:  RIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDS

Query:  HLHLMEFPYN
        HLHLMEF YN
Subjt:  HLHLMEFPYN

A0A5A7U6Z4 Ty3-gypsy retrotransposon protein0.0e+0079.88Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD
        MLSKEKVKACQIEIAGHVIEVTLLVLDML FDVILGMDWLAANHASIDCS KEV FNPPSM SFKFKGEGSRSLPQVISA+RASKLL+      LASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD

Query:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI
        TREVDVSLSSEPVVR+Y DVFPEEL GLP HRE+EFAIELEPGTVPISRA YRMAPAEL+ELKVQLQELLDK FI+PSVSPWGAPVLFVK KDGSMRLCI
Subjt:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKNRYPLPRIDDLF +LQ ATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT-----------------------------------
        DILIYSKTE   EEHLR+V QTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT                                   
Subjt:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT-----------------------------------

Query:  ----------SKVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLELVKDYDCEILYHQGKANVV
                   KV  Y SRQLKSHEQN PT DLELAAVVFALKIWRHYLY                                            GKANVV
Subjt:  ----------SKVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLELVKDYDCEILYHQGKANVV

Query:  ADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQAVEFSISSDGGLLFERRLCVPSDSA
        ADALSRKVSHS ALITRQAPLHRDL+RAEIAVSVGAVT+QLAQLT                     K G AEA QAVEFSISSDGGLLF RRLCVPSDSA
Subjt:  ADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQAVEFSISSDGGLLFERRLCVPSDSA

Query:  VKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVID
        VKTELLSEAHSSPFSMHP STKMYQDLKRV WWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVV++
Subjt:  VKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVID

Query:  RIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDS
        R+ KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRD+RFTSKFW+GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDS
Subjt:  RIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDS

Query:  HLHLMEFPYN
        HLHLMEF YN
Subjt:  HLHLMEFPYN

A0A5A7U8T5 Reverse transcriptase0.0e+0082.51Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD
        MLSKEKVKACQIEIAGHVI VTL+VLDML FDVILGMDWLAANHAS+DCS KEV FNPPSM SFKFKG GS+SLPQVISA+RASKLL+      LASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVD

Query:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI
        TREVDVSLSSEPVVR+Y DVFPEELPGLP HRE+EFAIELEPGTVPIS+A YRMAPAEL+ELKVQLQELLDKGFIRPSVSPWGAPVLFVK KDG MRLCI
Subjt:  TREVDVSLSSEPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKNRYPLPRIDDLF QLQ ATVFSKIDLR G+HQLRIK  DVPKTAFRSRYGHYEFI+MSFGLTNAP VFMDLMNRVF EFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTS----------------------------------
        DILIYSKTEA  EEHLRMV QTLRDNKLYAKFSKCEFWLKQVSFL HVVSKAGVSVDPAKIEAVT+                                  
Subjt:  DILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTS----------------------------------

Query:  ------------------------------KVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLE
                                      KV  YASRQLKSHEQN PTHDLELAAVVFALKIWRHYLYGEKIQIFTD+K LKYFFTQKELNMRQRRWLE
Subjt:  ------------------------------KVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLE

Query:  LVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQAVEFSI
        LVKDYDCEILYH GKANVVADALSRKVSHS ALITRQAPLHRDL+RAEIAVSVGAVTMQLAQLT                     K GLAEAGQA EFS+
Subjt:  LVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQAVEFSI

Query:  SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDF
        SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP STKMYQDLKRV WWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDF
Subjt:  SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDF

Query:  ITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQ
        ITGL RTLRGFTVIWVV+DR+ KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRD+RFTSKFWKGLQTAMGTRLDF TAFHPQTDGQTERLNQ
Subjt:  ITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQ

Query:  VLEDMLRACALEFPGSWDSHLHLMEFPYN
        VLEDMLRACALEFPGSWDSHLHLMEF YN
Subjt:  VLEDMLRACALEFPGSWDSHLHLMEFPYN

A0A5A7UL17 Reverse transcriptase0.0e+0079.46Show/hide
Query:  QIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVDTREVDVSLSS
        +IEIAGHVI+VTLLVLDML FDVILG+DWLAANHASIDCS KEVAFNP SMVSFKFKGEGSRSLPQVISAMRASKLL+      LASVVDTREVDVSLSS
Subjt:  QIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLN------LASVVDTREVDVSLSS

Query:  EPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCIDYRELNKVTV
        EPVVR+Y DVFPEEL GLP HRE+EFAIELEPGTVPISRA YRMAPAEL+ELKVQLQELLDKGFIRPSVSPWGAPVLFVK KDGSMRLCIDYRELNK   
Subjt:  EPVVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCIDYRELNKVTV

Query:  KNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEA
                                         LRIK GDVPKT FRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFI DILIYSKTEA
Subjt:  KNRYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEA

Query:  GQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTS--------------------------------------------
          EEHLRMV +TL DNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTS                                            
Subjt:  GQEEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTS--------------------------------------------

Query:  --------------KVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLELVKDYDCEILYHQGKA
                      KV  YASRQLKSHEQN PTHDLE AAVVFALKIWRHYLYGEKIQIFTDHK LKYFFTQKELNMRQRRWLELVKDYDCEILYH GKA
Subjt:  --------------KVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLELVKDYDCEILYHQGKA

Query:  NVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQAVEFSISSDGGLLFERRLCVPS
        NVVADALSRKVSHS AL+TRQAPLHRDL+RAEIAVSVGAVTMQLAQLT                     K GLAEAGQA+EFSISSDGGLLFERRLCVPS
Subjt:  NVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLT---------------------KCGLAEAGQAVEFSISSDGGLLFERRLCVPS

Query:  DSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV
        DSA+KTELLS+AHSSPFSMHP STKMYQDLKRV WWRNMKREVAEFVSKCLVC+QVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV
Subjt:  DSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV

Query:  VIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGS
        V+DR+ KSAHFVPGKS YTASKWAQLYMSEIVRLHGVPVSIVSDRD+RFTSKFWKGLQTAMGTRLDF+TAFHPQTDGQTERLNQVLEDMLRACALEFPGS
Subjt:  VIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGS

Query:  WDSHLHLMEFPYN
        WDSHLHLMEF YN
Subjt:  WDSHLHLMEFPYN

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.6e-8729.12Show/hide
Query:  IEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFYQLQRATVFS
        +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV  K+G++R+ +DY+ LNK    N YPLP I+ L  ++Q +T+F+
Subjt:  IEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFYQLQRATVFS

Query:  KIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFS
        K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+   +H++ V Q L++  L    +
Subjt:  KIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFS

Query:  KCEFWLKQVSFLGHVVSKAGV-----SVD---------------------------------------------------PAKIEAV-------------
        KCEF   QV F+G+ +S+ G      ++D                                                   P + +A+             
Subjt:  KCEFWLKQVSFLGHVVSKAGV-----SVD---------------------------------------------------PAKIEAV-------------

Query:  --------------TSKVAT-----------------YASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKILKYFFTQKE--LNM
                       S VA                  Y S ++   + N    D E+ A++ +LK WRHYL    E  +I TDH+ L    T +    N 
Subjt:  --------------TSKVAT-----------------YASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKILKYFFTQKE--LNM

Query:  RQRRWLELVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEI----AVSV-----GAVTMQLAQLTKC--GLAEAGQAVEFSISSD
        R  RW   ++D++ EI Y  G AN +ADALSR       ++    P+ +D +   I     +S+       V  +    TK    L    + VE +I   
Subjt:  RQRRWLELVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEI----AVSV-----GAVTMQLAQLTKC--GLAEAGQAVEFSISSD

Query:  GGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFI
         GLL   +  + +P+D+ +   ++ + H     +HP    +   + R   W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFI
Subjt:  GGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFI

Query:  TGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQV
        T LP +  G+  ++VV+DR  K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ 
Subjt:  TGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQV

Query:  LEDMLRACALEFPGSWDSHLHLMEFPYN
        +E +LR      P +W  H+ L++  YN
Subjt:  LEDMLRACALEFPGSWDSHLHLMEFPYN

P0CT35 Transposon Tf2-2 polyprotein1.6e-8729.12Show/hide
Query:  IEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFYQLQRATVFS
        +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV  K+G++R+ +DY+ LNK    N YPLP I+ L  ++Q +T+F+
Subjt:  IEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFYQLQRATVFS

Query:  KIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFS
        K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+   +H++ V Q L++  L    +
Subjt:  KIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFS

Query:  KCEFWLKQVSFLGHVVSKAGV-----SVD---------------------------------------------------PAKIEAV-------------
        KCEF   QV F+G+ +S+ G      ++D                                                   P + +A+             
Subjt:  KCEFWLKQVSFLGHVVSKAGV-----SVD---------------------------------------------------PAKIEAV-------------

Query:  --------------TSKVAT-----------------YASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKILKYFFTQKE--LNM
                       S VA                  Y S ++   + N    D E+ A++ +LK WRHYL    E  +I TDH+ L    T +    N 
Subjt:  --------------TSKVAT-----------------YASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKILKYFFTQKE--LNM

Query:  RQRRWLELVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEI----AVSV-----GAVTMQLAQLTKC--GLAEAGQAVEFSISSD
        R  RW   ++D++ EI Y  G AN +ADALSR       ++    P+ +D +   I     +S+       V  +    TK    L    + VE +I   
Subjt:  RQRRWLELVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEI----AVSV-----GAVTMQLAQLTKC--GLAEAGQAVEFSISSD

Query:  GGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFI
         GLL   +  + +P+D+ +   ++ + H     +HP    +   + R   W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFI
Subjt:  GGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFI

Query:  TGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQV
        T LP +  G+  ++VV+DR  K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ 
Subjt:  TGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQV

Query:  LEDMLRACALEFPGSWDSHLHLMEFPYN
        +E +LR      P +W  H+ L++  YN
Subjt:  LEDMLRACALEFPGSWDSHLHLMEFPYN

P0CT41 Transposon Tf2-12 polyprotein1.6e-8729.12Show/hide
Query:  IEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFYQLQRATVFS
        +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV  K+G++R+ +DY+ LNK    N YPLP I+ L  ++Q +T+F+
Subjt:  IEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFYQLQRATVFS

Query:  KIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFS
        K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+   +H++ V Q L++  L    +
Subjt:  KIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFS

Query:  KCEFWLKQVSFLGHVVSKAGV-----SVD---------------------------------------------------PAKIEAV-------------
        KCEF   QV F+G+ +S+ G      ++D                                                   P + +A+             
Subjt:  KCEFWLKQVSFLGHVVSKAGV-----SVD---------------------------------------------------PAKIEAV-------------

Query:  --------------TSKVAT-----------------YASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKILKYFFTQKE--LNM
                       S VA                  Y S ++   + N    D E+ A++ +LK WRHYL    E  +I TDH+ L    T +    N 
Subjt:  --------------TSKVAT-----------------YASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKILKYFFTQKE--LNM

Query:  RQRRWLELVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEI----AVSV-----GAVTMQLAQLTKC--GLAEAGQAVEFSISSD
        R  RW   ++D++ EI Y  G AN +ADALSR       ++    P+ +D +   I     +S+       V  +    TK    L    + VE +I   
Subjt:  RQRRWLELVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEI----AVSV-----GAVTMQLAQLTKC--GLAEAGQAVEFSISSD

Query:  GGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFI
         GLL   +  + +P+D+ +   ++ + H     +HP    +   + R   W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFI
Subjt:  GGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFI

Query:  TGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQV
        T LP +  G+  ++VV+DR  K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ 
Subjt:  TGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQV

Query:  LEDMLRACALEFPGSWDSHLHLMEFPYN
        +E +LR      P +W  H+ L++  YN
Subjt:  LEDMLRACALEFPGSWDSHLHLMEFPYN

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.2e-8830.16Show/hide
Query:  VVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCIDYRELNKVTVKN
        ++RN     P ++  +P+  +    IE++PG        Y +     +E+   +Q+LLD  FI PS SP  +PV+ V  KDG+ RLC+DYR LNK T+ +
Subjt:  VVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCIDYRELNKVTVKN

Query:  RYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAGQ
         +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++  D  KTAF +  G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++    
Subjt:  RYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAGQ

Query:  EEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVS--------------------------------------------------------------
         +HL  V + L++  L  K  KC+F  ++  FLG+ +                                                               
Subjt:  EEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVVS--------------------------------------------------------------

Query:  ---------KAGVSVDP-----------------------AKIEAVTSK-----VATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFT
                 KA +   P                       A +E V +K     V  Y S+ L+S ++N P  +LEL  ++ AL  +R+ L+G+   + T
Subjt:  ---------KAGVSVDP-----------------------AKIEAVTSK-----VATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFT

Query:  DHKILKYFFTQKELNMRQRRWLELVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLTKCGLA--------
        DH  L     + E   R +RWL+ +  YD  + Y  G  NVVADA+SR +   T   +R           +      AV + + +LT+  +         
Subjt:  DHKILKYFFTQKELNMRQRRWLELVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLTKCGLA--------

Query:  ------EAGQAVEFSIS-SDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLL
              E  +    + S  D  + ++ RL VP         L   H+  F  H   T     +  + +W  ++  + +++  C+ CQ +K+ R +  GLL
Subjt:  ------EAGQAVEFSIS-SDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLL

Query:  QPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDF
        QPL I E +W ++SMDF+TGLP T     +I VV+DR  K AHF+  + T  A++   L    I   HG P +I SDRD R T+  ++ L   +G +   
Subjt:  QPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDF

Query:  STAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFPYN
        S+A HPQTDGQ+ER  Q L  +LRA       +W  +L  +EF YN
Subjt:  STAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFPYN

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.1e-8830.29Show/hide
Query:  VVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCIDYRELNKVTVKN
        ++RN     P ++  +P+  +    IE++PG        Y +     +E+   +Q+LLD  FI PS SP  +PV+ V  KDG+ RLC+DYR LNK T+ +
Subjt:  VVRNYSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCIDYRELNKVTVKN

Query:  RYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAGQ
         +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++  D  KTAF +  G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++    
Subjt:  RYPLPRIDDLFYQLQRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAGQ

Query:  EEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVV--------------------------------------------------------------S
         +HL  V + L++  L  K  KC+F  ++  FLG+ +                                                               
Subjt:  EEHLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLGHVV--------------------------------------------------------------S

Query:  KAGVSVDPAK--------------------------------IEAVTSK-----VATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFT
        K   ++D  K                                +E V +K     V  Y S+ L+S ++N P  +LEL  ++ AL  +R+ L+G+   + T
Subjt:  KAGVSVDPAK--------------------------------IEAVTSK-----VATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFT

Query:  DHKILKYFFTQKELNMRQRRWLELVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLTKCGLA--------
        DH  L     + E   R +RWL+ +  YD  + Y  G  NVVADA+SR V   T   +R           +      AV + + +LT+  +         
Subjt:  DHKILKYFFTQKELNMRQRRWLELVKDYDCEILYHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLTKCGLA--------

Query:  ------EAGQAVEFSIS-SDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLL
              E  +    + S  D  + ++ RL VP         L   H+  F  H   T     +  + +W  ++  + +++  C+ CQ +K+ R +  GLL
Subjt:  ------EAGQAVEFSIS-SDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTKMYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLL

Query:  QPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDF
        QPL I E +W ++SMDF+TGLP T     +I VV+DR  K AHF+  + T  A++   L    I   HG P +I SDRD R T+  ++ L   +G +   
Subjt:  QPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDF

Query:  STAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFPYN
        S+A HPQTDGQ+ER  Q L  +LRA A     +W  +L  +EF YN
Subjt:  STAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFPYN

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.7e-0546.15Show/hide
Query:  HLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAV
        HL MV Q    ++ YA   KC F   Q+++LG  H++S  GVS DPAK+EA+
Subjt:  HLRMVWQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTCGAAGGAAAAGGTGAAAGCATGCCAGATTGAGATAGCAGGCCATGTGATTGAAGTAACGCTGTTAGTCCTGGACATGCTCCACTTTGATGTAATTCTGGGTAT
GGATTGGTTGGCCGCTAACCATGCCAGCATAGATTGTTCCAGTAAGGAGGTAGCGTTTAACCCTCCATCGATGGTCAGTTTTAAATTTAAGGGAGAAGGGTCAAGGTCGT
TACCTCAGGTAATCTCAGCAATGAGGGCCAGCAAACTGCTCAATTTAGCGAGCGTGGTGGATACTAGAGAGGTTGATGTATCCCTGTCATCAGAACCAGTGGTGAGGAAC
TATTCGGATGTCTTTCCTGAAGAACTTCCAGGGTTACCTCTTCACAGAGAGATTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCTATACAG
AATGGCCCCAGCAGAGTTGGAAGAACTGAAAGTGCAGTTACAGGAATTGCTTGATAAGGGATTCATTCGACCGAGTGTGTCACCTTGGGGTGCGCCAGTTTTATTTGTTA
AGAATAAGGATGGATCGATGCGTCTATGCATTGACTACAGGGAGTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCGATGATCTGTTTTACCAGTTA
CAGAGAGCTACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATATCATCAGTTGAGGATTAAGGGTGGTGATGTACCGAAGACAGCATTTCGTTCCAGATACGGACACTA
TGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCTTAGACACTTTTGTGATCGTGTTTATTG
ATGATATCTTGATATATTCCAAGACGGAGGCCGGGCAAGAGGAGCATTTACGTATGGTTTGGCAAACACTTCGGGATAATAAATTGTATGCAAAGTTCTCGAAATGCGAG
TTTTGGCTGAAGCAGGTGTCCTTTCTAGGCCATGTGGTTTCCAAGGCTGGAGTTTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCAGTAAGGTAGCAACTTATGCTTC
GCGTCAGTTGAAGAGTCATGAGCAGAATTGCCCTACACACGATTTAGAGTTGGCAGCAGTGGTTTTTGCATTGAAAATATGGAGGCATTACTTGTATGGTGAAAAGATAC
AGATTTTCACGGATCATAAGATCTTGAAATACTTCTTTACTCAGAAAGAATTGAATATGAGACAGCGAAGATGGCTTGAGTTAGTGAAGGATTACGACTGTGAGATATTG
TATCATCAAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAACAGCACTTATTACCCGACAGGCCCCATTGCATCGAGATCTTGACCGGGC
TGAAATTGCAGTGTCAGTGGGGGCAGTCACTATGCAGTTAGCCCAATTGACGAAGTGTGGCCTAGCAGAGGCAGGGCAAGCAGTTGAGTTCTCCATATCCTCTGATGGTG
GACTTTTGTTTGAGAGGCGCCTCTGTGTGCCATCAGATAGTGCGGTTAAAACAGAATTATTATCGGAGGCTCATAGTTCCCCATTTTCCATGCACCCAATTAGTACGAAG
ATGTATCAGGACCTGAAGCGGGTTTGTTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAA
ACCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAAAACGTGTCCATGGATTTCATTACAGGACTGCCAAGAACTCTGAGGGGTTTTACAGTGATTT
GGGTTGTGATTGATAGGATTATCAAATCAGCGCACTTCGTTCCGGGTAAATCCACCTATACCGCTAGTAAGTGGGCACAGTTGTACATGTCTGAGATAGTAAGACTACAT
GGAGTGCCGGTGTCGATTGTTTCTGATAGAGATTCTCGTTTCACTTCCAAATTCTGGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTAGACTTTAGTACAGCTTTCCA
TCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAAGTTTTAGAAGATATGTTGCGAGCGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTGCATTTGATGG
AATTTCCTTATAATGTGGTAGATCCCCTGTTTGCTGGGGTGAGGTGGGTAAGGAGAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGTCGAAGGAAAAGGTGAAAGCATGCCAGATTGAGATAGCAGGCCATGTGATTGAAGTAACGCTGTTAGTCCTGGACATGCTCCACTTTGATGTAATTCTGGGTAT
GGATTGGTTGGCCGCTAACCATGCCAGCATAGATTGTTCCAGTAAGGAGGTAGCGTTTAACCCTCCATCGATGGTCAGTTTTAAATTTAAGGGAGAAGGGTCAAGGTCGT
TACCTCAGGTAATCTCAGCAATGAGGGCCAGCAAACTGCTCAATTTAGCGAGCGTGGTGGATACTAGAGAGGTTGATGTATCCCTGTCATCAGAACCAGTGGTGAGGAAC
TATTCGGATGTCTTTCCTGAAGAACTTCCAGGGTTACCTCTTCACAGAGAGATTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCTATACAG
AATGGCCCCAGCAGAGTTGGAAGAACTGAAAGTGCAGTTACAGGAATTGCTTGATAAGGGATTCATTCGACCGAGTGTGTCACCTTGGGGTGCGCCAGTTTTATTTGTTA
AGAATAAGGATGGATCGATGCGTCTATGCATTGACTACAGGGAGTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCGATGATCTGTTTTACCAGTTA
CAGAGAGCTACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATATCATCAGTTGAGGATTAAGGGTGGTGATGTACCGAAGACAGCATTTCGTTCCAGATACGGACACTA
TGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCTTAGACACTTTTGTGATCGTGTTTATTG
ATGATATCTTGATATATTCCAAGACGGAGGCCGGGCAAGAGGAGCATTTACGTATGGTTTGGCAAACACTTCGGGATAATAAATTGTATGCAAAGTTCTCGAAATGCGAG
TTTTGGCTGAAGCAGGTGTCCTTTCTAGGCCATGTGGTTTCCAAGGCTGGAGTTTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCAGTAAGGTAGCAACTTATGCTTC
GCGTCAGTTGAAGAGTCATGAGCAGAATTGCCCTACACACGATTTAGAGTTGGCAGCAGTGGTTTTTGCATTGAAAATATGGAGGCATTACTTGTATGGTGAAAAGATAC
AGATTTTCACGGATCATAAGATCTTGAAATACTTCTTTACTCAGAAAGAATTGAATATGAGACAGCGAAGATGGCTTGAGTTAGTGAAGGATTACGACTGTGAGATATTG
TATCATCAAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAACAGCACTTATTACCCGACAGGCCCCATTGCATCGAGATCTTGACCGGGC
TGAAATTGCAGTGTCAGTGGGGGCAGTCACTATGCAGTTAGCCCAATTGACGAAGTGTGGCCTAGCAGAGGCAGGGCAAGCAGTTGAGTTCTCCATATCCTCTGATGGTG
GACTTTTGTTTGAGAGGCGCCTCTGTGTGCCATCAGATAGTGCGGTTAAAACAGAATTATTATCGGAGGCTCATAGTTCCCCATTTTCCATGCACCCAATTAGTACGAAG
ATGTATCAGGACCTGAAGCGGGTTTGTTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAA
ACCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAAAACGTGTCCATGGATTTCATTACAGGACTGCCAAGAACTCTGAGGGGTTTTACAGTGATTT
GGGTTGTGATTGATAGGATTATCAAATCAGCGCACTTCGTTCCGGGTAAATCCACCTATACCGCTAGTAAGTGGGCACAGTTGTACATGTCTGAGATAGTAAGACTACAT
GGAGTGCCGGTGTCGATTGTTTCTGATAGAGATTCTCGTTTCACTTCCAAATTCTGGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTAGACTTTAGTACAGCTTTCCA
TCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAAGTTTTAGAAGATATGTTGCGAGCGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTGCATTTGATGG
AATTTCCTTATAATGTGGTAGATCCCCTGTTTGCTGGGGTGAGGTGGGTAAGGAGAGATTGA
Protein sequenceShow/hide protein sequence
MLSKEKVKACQIEIAGHVIEVTLLVLDMLHFDVILGMDWLAANHASIDCSSKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLNLASVVDTREVDVSLSSEPVVRN
YSDVFPEELPGLPLHREIEFAIELEPGTVPISRALYRMAPAELEELKVQLQELLDKGFIRPSVSPWGAPVLFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFYQL
QRATVFSKIDLRSGYHQLRIKGGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAGQEEHLRMVWQTLRDNKLYAKFSKCE
FWLKQVSFLGHVVSKAGVSVDPAKIEAVTSKVATYASRQLKSHEQNCPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKILKYFFTQKELNMRQRRWLELVKDYDCEIL
YHQGKANVVADALSRKVSHSTALITRQAPLHRDLDRAEIAVSVGAVTMQLAQLTKCGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPISTK
MYQDLKRVCWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVIDRIIKSAHFVPGKSTYTASKWAQLYMSEIVRLH
GVPVSIVSDRDSRFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFPYNVVDPLFAGVRWVRRD