; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0025541 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0025541
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr01:26359548..26361011
RNA-Seq ExpressionCmc01g0025541
SyntenyCmc01g0025541
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026081.1 pol protein [Cucumis melo var. makuwa]4.2e-24593.51Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPS+ASFKFRG GIVCIPKVISAMKA+KLLSQGTWGILASVVDTREPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP
        PDVFPDELPGFPFSREIDFAIELEPDTTPI RA YRMA AELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDG MRLCIDYRELNK+ +++ Y   
Subjt:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP

Query:  RIDDLFDQFDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLK
        RI D     DIPKTAFRS YGHYEFIVMSFGLT+APAVFMDLMN+VFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLK
Subjt:  RIDDLFDQFDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLK

Query:  TVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSG
        TVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGL  YYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTV DGSG
Subjt:  TVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE
        SFVIYSDASKKGLGCVLMQQDRVVAYA RQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG+
Subjt:  SFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE

KAA0058812.1 pol protein [Cucumis melo var. makuwa]3.5e-23986.45Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHA+IDCFGKEVVFNPPS ASFKFRG GIVCIPKVISAMKASKLLSQGTWGILASVVD REPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP
        PDVFPDELPG P  RE+DFAIELEP T PISRA YRMA AELKE KVQLQELLDK FI+ SVSPWGAPVLFVKKKDG MRLCIDYRELNKVT+KNRYPLP
Subjt:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP

Query:  RIDDLFDQF-----------------------DIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLY
        RIDDLFDQ                        DIPKTAF SRYGHYEF+VMSFGLTNAPAVFMDLMNRVFKDF+D+FVIVFIDDILIYSKTE EHEEHL+
Subjt:  RIDDLFDQF-----------------------DIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLY

Query:  RVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACE
        +VLE LRANKLYAKFSKCEFWL+ VTFL HVVSSEGVSVDP KIEA+T+WPRPST+SEIRSFLGL  YYRRFVEDFSRIASP TQLTRKGTPFVWSPACE
Subjt:  RVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACE

Query:  SSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKI
        SSFQELKQKLV+APVLTVPDGSG+FVIYSDASKKGLGCVLMQQ +VVAYASRQLK HEQNYPTHDLELA VVFALKIWRHYLYGEKI
Subjt:  SSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKI

KAA0060745.1 pol protein [Cucumis melo var. makuwa]1.6e-23986.86Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHA+IDCFGKEVVFNPPS ASFKFRG G+VCIPKVISAMKASKLLSQGTWGILASVVD REPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP
        PDVFPDELPG P  RE+DFAIELEP T PISRA YRMA AELKE KVQLQELLDK FI+ SVSPWGAPVLFVKKKDG MRLCIDYRELNKVTVKNRYPLP
Subjt:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP

Query:  RIDDLFDQF-----------------------DIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLY
        +IDDLFDQ                        DIPKTAFRSRYGHYEF+VMSFGLTNAPAVFMDLMNRVFKDFLD+FVIVFIDDILIYSKTE EHEEHL+
Subjt:  RIDDLFDQF-----------------------DIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLY

Query:  RVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACE
        +VLE LRANKLYAKFSKCEFWL+ VTFL HVVSSEGVSVDP KIEA+T+WPRPST+SEIRSFLGL  YYRRFVEDFSRIASP TQLTRKGTPFVWSPACE
Subjt:  RVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACE

Query:  SSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKI
        SSFQELKQKLV APVLTVPDGSG+FVIYSDASKKGLGCVLMQQ +VVAYASRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKI
Subjt:  SSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKI

KAA0063793.1 pol protein [Cucumis melo var. makuwa]5.9e-23986.45Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHA+IDCFGKEVVFNPPS+ASFKFRG G+VCIPKVISAMKASKLLS GTWGILASVVD REPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP
        PDVFPD+LPG P  RE+DFAIELEP T PISRA YRMA AELKE KVQLQELLDK FI+ SVSPWGAPVLFVKKKDG MRLCIDYRELNKVTVKNRYPLP
Subjt:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP

Query:  RIDDLFDQF-----------------------DIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLY
        RIDDLFDQ                        DIPKTAFRSRYGHYEF+VMSFGLTNAPAVFMDLMNRVFKDFLD+FVIVFIDDILIYSKTE EHEEHL+
Subjt:  RIDDLFDQF-----------------------DIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLY

Query:  RVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACE
        +VLE LRANKLYAKFSKCEFWL+ VTFL HVVSSEGVSVDP KIEA+T+WPRPST+SEIRSFLGL  YYRRFVEDFSRIASP TQLTRKGTPFVWSPACE
Subjt:  RVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACE

Query:  SSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKI
         SFQELKQKLV+APVLTVPDGSG+FVIYSDASKKGLGCVLMQQ +VVAYASRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKI
Subjt:  SSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKI

TYJ96331.1 pol protein [Cucumis melo var. makuwa]4.2e-24593.51Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPS+ASFKFRG GIVCIPKVISAMKA+KLLSQGTWGILASVVDTREPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP
        PDVFPDELPGFPFSREIDFAIELEPDTTPI RA YRMA AELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDG MRLCIDYRELNK+ +++ Y   
Subjt:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP

Query:  RIDDLFDQFDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLK
        RI D     DIPKTAFRS YGHYEFIVMSFGLT+APAVFMDLMN+VFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLK
Subjt:  RIDDLFDQFDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLK

Query:  TVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSG
        TVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGL  YYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTV DGSG
Subjt:  TVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE
        SFVIYSDASKKGLGCVLMQQDRVVAYA RQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG+
Subjt:  SFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE

TrEMBL top hitse value%identityAlignment
A0A5A7SPG1 Pol protein2.1e-24593.51Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPS+ASFKFRG GIVCIPKVISAMKA+KLLSQGTWGILASVVDTREPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP
        PDVFPDELPGFPFSREIDFAIELEPDTTPI RA YRMA AELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDG MRLCIDYRELNK+ +++ Y   
Subjt:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP

Query:  RIDDLFDQFDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLK
        RI D     DIPKTAFRS YGHYEFIVMSFGLT+APAVFMDLMN+VFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLK
Subjt:  RIDDLFDQFDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLK

Query:  TVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSG
        TVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGL  YYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTV DGSG
Subjt:  TVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE
        SFVIYSDASKKGLGCVLMQQDRVVAYA RQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG+
Subjt:  SFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE

A0A5A7USG7 Reverse transcriptase1.7e-23986.45Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHA+IDCFGKEVVFNPPS ASFKFRG GIVCIPKVISAMKASKLLSQGTWGILASVVD REPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP
        PDVFPDELPG P  RE+DFAIELEP T PISRA YRMA AELKE KVQLQELLDK FI+ SVSPWGAPVLFVKKKDG MRLCIDYRELNKVT+KNRYPLP
Subjt:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP

Query:  RIDDLFDQF-----------------------DIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLY
        RIDDLFDQ                        DIPKTAF SRYGHYEF+VMSFGLTNAPAVFMDLMNRVFKDF+D+FVIVFIDDILIYSKTE EHEEHL+
Subjt:  RIDDLFDQF-----------------------DIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLY

Query:  RVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACE
        +VLE LRANKLYAKFSKCEFWL+ VTFL HVVSSEGVSVDP KIEA+T+WPRPST+SEIRSFLGL  YYRRFVEDFSRIASP TQLTRKGTPFVWSPACE
Subjt:  RVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACE

Query:  SSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKI
        SSFQELKQKLV+APVLTVPDGSG+FVIYSDASKKGLGCVLMQQ +VVAYASRQLK HEQNYPTHDLELA VVFALKIWRHYLYGEKI
Subjt:  SSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKI

A0A5A7V4E4 Reverse transcriptase7.5e-24086.86Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHA+IDCFGKEVVFNPPS ASFKFRG G+VCIPKVISAMKASKLLSQGTWGILASVVD REPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP
        PDVFPDELPG P  RE+DFAIELEP T PISRA YRMA AELKE KVQLQELLDK FI+ SVSPWGAPVLFVKKKDG MRLCIDYRELNKVTVKNRYPLP
Subjt:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP

Query:  RIDDLFDQF-----------------------DIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLY
        +IDDLFDQ                        DIPKTAFRSRYGHYEF+VMSFGLTNAPAVFMDLMNRVFKDFLD+FVIVFIDDILIYSKTE EHEEHL+
Subjt:  RIDDLFDQF-----------------------DIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLY

Query:  RVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACE
        +VLE LRANKLYAKFSKCEFWL+ VTFL HVVSSEGVSVDP KIEA+T+WPRPST+SEIRSFLGL  YYRRFVEDFSRIASP TQLTRKGTPFVWSPACE
Subjt:  RVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACE

Query:  SSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKI
        SSFQELKQKLV APVLTVPDGSG+FVIYSDASKKGLGCVLMQQ +VVAYASRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKI
Subjt:  SSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKI

A0A5A7V6R2 Reverse transcriptase2.9e-23986.45Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHA+IDCFGKEVVFNPPS+ASFKFRG G+VCIPKVISAMKASKLLS GTWGILASVVD REPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP
        PDVFPD+LPG P  RE+DFAIELEP T PISRA YRMA AELKE KVQLQELLDK FI+ SVSPWGAPVLFVKKKDG MRLCIDYRELNKVTVKNRYPLP
Subjt:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP

Query:  RIDDLFDQF-----------------------DIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLY
        RIDDLFDQ                        DIPKTAFRSRYGHYEF+VMSFGLTNAPAVFMDLMNRVFKDFLD+FVIVFIDDILIYSKTE EHEEHL+
Subjt:  RIDDLFDQF-----------------------DIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLY

Query:  RVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACE
        +VLE LRANKLYAKFSKCEFWL+ VTFL HVVSSEGVSVDP KIEA+T+WPRPST+SEIRSFLGL  YYRRFVEDFSRIASP TQLTRKGTPFVWSPACE
Subjt:  RVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACE

Query:  SSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKI
         SFQELKQKLV+APVLTVPDGSG+FVIYSDASKKGLGCVLMQQ +VVAYASRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKI
Subjt:  SSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKI

A0A5D3BCJ4 Pol protein2.1e-24593.51Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPS+ASFKFRG GIVCIPKVISAMKA+KLLSQGTWGILASVVDTREPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP
        PDVFPDELPGFPFSREIDFAIELEPDTTPI RA YRMA AELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDG MRLCIDYRELNK+ +++ Y   
Subjt:  PDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLP

Query:  RIDDLFDQFDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLK
        RI D     DIPKTAFRS YGHYEFIVMSFGLT+APAVFMDLMN+VFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLK
Subjt:  RIDDLFDQFDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLK

Query:  TVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSG
        TVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGL  YYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTV DGSG
Subjt:  TVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE
        SFVIYSDASKKGLGCVLMQQDRVVAYA RQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG+
Subjt:  SFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.4e-6435.44Show/hide
Query:  VVREYPDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGL-----MRLCIDYRELNK
        ++++Y D+   E     F+ +    I  + +    S+  Y  A  +  ES  Q+Q++L++  I+ S SP+ +P+  V KK         R+ IDYR+LN+
Subjt:  VVREYPDVFPDELPGFPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGL-----MRLCIDYRELNK

Query:  VTVKNRYPLPRIDDL-----------------------FDQFDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSK
        +TV +R+P+P +D++                        D   + KTAF +++GHYE++ M FGL NAPA F   MN + +  L+   +V++DDI+++S 
Subjt:  VTVKNRYPLPRIDDL-----------------------FDQFDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSK

Query:  TEVEHEEHLYRVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKG
        +  EH + L  V E L    L  +  KCEF  +  TFL HV++ +G+  +P KIEAI  +P P+   EI++FLGL  YYR+F+ +F+ IA P T+  +K 
Subjt:  TEVEHEEHLYRVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKG

Query:  TPF-VWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG
              +P  +S+F++LK  +   P+L VPD +  F + +DAS   LG VL Q    ++Y SR L  HE NY T + EL A+V+A K +RHYL G
Subjt:  TPF-VWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG

P20825 Retrovirus-related Pol polyprotein from transposon 2975.0e-6336.16Show/hide
Query:  TPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKD-----GLMRLCIDYRELNKVTVKNRYPLPRIDDL-----------------
        +PI    Y +A     E + Q+QE+L++  I+ S SP+ +P   V KK         R+ IDYR+LN++T+ +RYP+P +D++                 
Subjt:  TPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKD-----GLMRLCIDYRELNKVTVKNRYPLPRIDDL-----------------

Query:  ------FDQFDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWL
               D+  I KTAF ++ GHYE++ M FGL NAPA F   MN + +  L+   +V++DDI+I+S +  EH   +  V   L    L  +  KCEF  
Subjt:  ------FDQFDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWL

Query:  KTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPF-VWSPACESSFQELKQKLVSAPVLTVPDG
        K   FL H+V+ +G+  +P+K++AI S+P P+   EIR+FLGL  YYR+F+ +++ IA P T   +K T           +F++LK  ++  P+L +PD 
Subjt:  KTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPF-VWSPACESSFQELKQKLVSAPVLTVPDG

Query:  SGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEK
           FV+ +DAS   LG VL Q    +++ SR L  HE NY   + EL A+V+A K +RHYL G +
Subjt:  SGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.8e-5833.03Show/hide
Query:  KASKLLSQGTWGILASVVDTREPEVSLSSEP---------VVREYPDVFPDELPGFPF---SREIDFAIELEPDTTPISRALYRMASAELKESKVQLQEL
        +AS L   G +  + S + + EP  +  S           + ++Y ++  ++LP  P    +  +   IE++P         Y +     +E    +Q+L
Subjt:  KASKLLSQGTWGILASVVDTREPEVSLSSEP---------VVREYPDVFPDELPGFPF---SREIDFAIELEPDTTPISRALYRMASAELKESKVQLQEL

Query:  LDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPRIDDLFDQF-----------------------DIPKTAFRSRYGHYEFIVMS
        LD  FI  S SP  +PV+ V KKDG  RLC+DYR LNK T+ + +PLPRID+L  +                        D  KTAF +  G YE+ VM 
Subjt:  LDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPRIDDLFDQF-----------------------DIPKTAFRSRYGHYEFIVMS

Query:  FGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPR
        FGL NAP+ F   M   F+D    FV V++DDILI+S++  EH +HL  VLE L+   L  K  KC+F  +   FL + +  + ++    K  AI  +P 
Subjt:  FGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPR

Query:  PSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDR------V
        P T+ + + FLG+++YYRRF+ + S+IA P        +   W+   + + ++LK  L ++PVL   +   ++ + +DASK G+G VL + D       V
Subjt:  PSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDR------V

Query:  VAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE
        V Y S+ L+S ++NYP  +LEL  ++ AL  +R+ L+G+
Subjt:  VAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.0e-5530.91Show/hide
Query:  DVTLLVL-DMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLS----QGTWGILASVVDTREPEVSLSSEPVV
        D+T  VL ++  FD I+G D L    A +D     ++  P         GI I  + +  +++  + LL+     GT  IL S++               
Subjt:  DVTLLVL-DMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLS----QGTWGILASVVDTREPEVSLSSEPVV

Query:  REYPDVFPDELPGFPFSREIDFAIELEPDT-TPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKK-----DGLMRLCIDYRELNKV
         E+P +F   L G   S E     E+  +T  PI    Y        E + Q+ ELL    I+ S SP+ +P+  V KK     +   R+ +D++ LN V
Subjt:  REYPDVFPDELPGFPFSREIDFAIELEPDT-TPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKK-----DGLMRLCIDYRELNKV

Query:  TVKNRYPLPRID---------------DLFDQF--------DIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKT
        T+ + YP+P I+               DL   F        DIPKTAF +  G YEF+ + FGL NAPA+F  +++ + ++ +     V+IDDI+++S+ 
Subjt:  TVKNRYPLPRID---------------DLFDQF--------DIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKT

Query:  EVEHEEHLYRVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTR---
           H ++L  VL  L    L     K  F    V FL ++V+++G+  DP K+ AI+  P P+++ E++ FLG+  YYR+F++D++++A P T LTR   
Subjt:  EVEHEEHLYRVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTR---

Query:  --------KGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQ----QDRVVAYASRQLKSHEQNYPTHDLELAAVVFALK
                   P         SF +LK  L S+ +L  P  +  F + +DAS   +G VL Q    +DR +AY SR L   E+NY T + E+ A++++L 
Subjt:  --------KGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQ----QDRVVAYASRQLKSHEQNYPTHDLELAAVVFALK

Query:  IWRHYLYG
          R YLYG
Subjt:  IWRHYLYG

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.7e-5833.03Show/hide
Query:  KASKLLSQGTWGILASVVDTREPEVSLSSEP---------VVREYPDVFPDELPGFPF---SREIDFAIELEPDTTPISRALYRMASAELKESKVQLQEL
        +AS L   G +  + S + + EP  +  S           + ++Y ++  ++LP  P    +  +   IE++P         Y +     +E    +Q+L
Subjt:  KASKLLSQGTWGILASVVDTREPEVSLSSEP---------VVREYPDVFPDELPGFPF---SREIDFAIELEPDTTPISRALYRMASAELKESKVQLQEL

Query:  LDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPRIDDLFDQF-----------------------DIPKTAFRSRYGHYEFIVMS
        LD  FI  S SP  +PV+ V KKDG  RLC+DYR LNK T+ + +PLPRID+L  +                        D  KTAF +  G YE+ VM 
Subjt:  LDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPRIDDLFDQF-----------------------DIPKTAFRSRYGHYEFIVMS

Query:  FGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPR
        FGL NAP+ F   M   F+D    FV V++DDILI+S++  EH +HL  VLE L+   L  K  KC+F  +   FL + +  + ++    K  AI  +P 
Subjt:  FGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPR

Query:  PSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDR------V
        P T+ + + FLG+++YYRRF+ + S+IA P        +   W+   + +  +LK  L ++PVL   +   ++ + +DASK G+G VL + D       V
Subjt:  PSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDR------V

Query:  VAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE
        V Y S+ L+S ++NYP  +LEL  ++ AL  +R+ L+G+
Subjt:  VAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.8e-2340.46Show/hide
Query:  HLYRVLEILRANKLYAKFSKCEFWLKTVTFLD--HVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVW
        HL  VL+I   ++ YA   KC F    + +L   H++S EGVS DP K+EA+  WP P   +E+R FLGL  YYRRFV+++ +I  P T+L +K +   W
Subjt:  HLYRVLEILRANKLYAKFSKCEFWLKTVTFLD--HVVSSEGVSVDPVKIEAITSWPRPSTISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVW

Query:  SPACESSFQELKQKLVSAPVLTVPDGSGSFV
        +     +F+ LK  + + PVL +PD    FV
Subjt:  SPACESSFQELKQKLVSAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGATGTGACCTTACTAGTGTTAGACATGCAGGACTTTGATGTAATCCTAGGCATGGATTGGTTGTCTGCTAACCATGCAAGTATAGACTGTTTCGGTAAGGAAGT
CGTTTTTAACCCTCCCTCTAAGGCTAGTTTCAAATTTAGGGGGATAGGAATTGTATGTATACCCAAGGTCATCTCAGCCATGAAGGCTAGTAAACTACTCAGCCAGGGTA
CTTGGGGCATCTTGGCAAGCGTAGTAGATACCAGAGAACCAGAAGTTTCCCTGTCCTCCGAACCAGTGGTAAGGGAGTACCCCGATGTTTTCCCCGACGAGCTTCCTGGA
TTTCCGTTTTCTAGGGAGATAGACTTCGCCATCGAGTTAGAGCCGGACACTACTCCTATCTCAAGGGCCCTTTACAGAATGGCTTCAGCCGAGCTGAAAGAGTCGAAGGT
ACAGCTGCAGGAGTTGCTGGACAAGGATTTCATCCAACTCAGTGTGTCACCTTGGGGAGCACCAGTGTTGTTTGTGAAGAAGAAGGATGGGTTGATGCGCCTTTGCATTG
ACTACAGAGAGTTGAACAAGGTGACAGTCAAGAACCGTTACCCCTTGCCCAGGATTGATGACTTGTTCGATCAGTTTGATATTCCTAAGACGGCTTTCCGTTCTAGATAC
GGGCACTACGAGTTCATCGTGATGTCCTTTGGTTTGACTAATGCTCCTGCGGTATTCATGGACTTGATGAACAGGGTGTTTAAGGATTTCTTAGACACGTTTGTCATAGT
TTTCATTGACGACATTTTGATTTACTCCAAGACTGAGGTCGAGCACGAGGAGCATTTGTACCGGGTTTTGGAGATTCTTCGAGCCAATAAGTTGTACGCCAAGTTCTCAA
AGTGTGAGTTCTGGCTAAAAACGGTGACTTTCCTCGACCACGTGGTTTCCAGTGAGGGAGTTTCTGTAGACCCGGTAAAGATCGAAGCGATTACCAGTTGGCCTCGACCG
TCGACAATTAGCGAGATTCGTAGTTTCTTGGGTTTGGTAGATTACTACAGGAGGTTCGTGGAAGACTTCTCTCGTATAGCCAGTCCCTGGACTCAGTTGACCAGGAAGGG
GACCCCTTTTGTTTGGAGCCCAGCTTGCGAGAGTAGCTTCCAGGAGCTTAAACAGAAGCTTGTGTCTGCACCAGTCCTGACAGTGCCAGATGGATCGGGGAGCTTTGTGA
TCTACAGTGATGCCTCCAAGAAAGGACTGGGTTGCGTGCTGATGCAGCAGGACAGGGTAGTTGCTTATGCCTCCCGTCAGCTGAAGAGTCATGAGCAGAACTACCCTACT
CATGACTTAGAGTTGGCAGCAGTGGTGTTTGCACTGAAGATATGGAGGCACTACTTGTACGGTGAGAAGATATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTAGATGTGACCTTACTAGTGTTAGACATGCAGGACTTTGATGTAATCCTAGGCATGGATTGGTTGTCTGCTAACCATGCAAGTATAGACTGTTTCGGTAAGGAAGT
CGTTTTTAACCCTCCCTCTAAGGCTAGTTTCAAATTTAGGGGGATAGGAATTGTATGTATACCCAAGGTCATCTCAGCCATGAAGGCTAGTAAACTACTCAGCCAGGGTA
CTTGGGGCATCTTGGCAAGCGTAGTAGATACCAGAGAACCAGAAGTTTCCCTGTCCTCCGAACCAGTGGTAAGGGAGTACCCCGATGTTTTCCCCGACGAGCTTCCTGGA
TTTCCGTTTTCTAGGGAGATAGACTTCGCCATCGAGTTAGAGCCGGACACTACTCCTATCTCAAGGGCCCTTTACAGAATGGCTTCAGCCGAGCTGAAAGAGTCGAAGGT
ACAGCTGCAGGAGTTGCTGGACAAGGATTTCATCCAACTCAGTGTGTCACCTTGGGGAGCACCAGTGTTGTTTGTGAAGAAGAAGGATGGGTTGATGCGCCTTTGCATTG
ACTACAGAGAGTTGAACAAGGTGACAGTCAAGAACCGTTACCCCTTGCCCAGGATTGATGACTTGTTCGATCAGTTTGATATTCCTAAGACGGCTTTCCGTTCTAGATAC
GGGCACTACGAGTTCATCGTGATGTCCTTTGGTTTGACTAATGCTCCTGCGGTATTCATGGACTTGATGAACAGGGTGTTTAAGGATTTCTTAGACACGTTTGTCATAGT
TTTCATTGACGACATTTTGATTTACTCCAAGACTGAGGTCGAGCACGAGGAGCATTTGTACCGGGTTTTGGAGATTCTTCGAGCCAATAAGTTGTACGCCAAGTTCTCAA
AGTGTGAGTTCTGGCTAAAAACGGTGACTTTCCTCGACCACGTGGTTTCCAGTGAGGGAGTTTCTGTAGACCCGGTAAAGATCGAAGCGATTACCAGTTGGCCTCGACCG
TCGACAATTAGCGAGATTCGTAGTTTCTTGGGTTTGGTAGATTACTACAGGAGGTTCGTGGAAGACTTCTCTCGTATAGCCAGTCCCTGGACTCAGTTGACCAGGAAGGG
GACCCCTTTTGTTTGGAGCCCAGCTTGCGAGAGTAGCTTCCAGGAGCTTAAACAGAAGCTTGTGTCTGCACCAGTCCTGACAGTGCCAGATGGATCGGGGAGCTTTGTGA
TCTACAGTGATGCCTCCAAGAAAGGACTGGGTTGCGTGCTGATGCAGCAGGACAGGGTAGTTGCTTATGCCTCCCGTCAGCTGAAGAGTCATGAGCAGAACTACCCTACT
CATGACTTAGAGTTGGCAGCAGTGGTGTTTGCACTGAAGATATGGAGGCACTACTTGTACGGTGAGAAGATATAG
Protein sequenceShow/hide protein sequence
MLDVTLLVLDMQDFDVILGMDWLSANHASIDCFGKEVVFNPPSKASFKFRGIGIVCIPKVISAMKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELPG
FPFSREIDFAIELEPDTTPISRALYRMASAELKESKVQLQELLDKDFIQLSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPRIDDLFDQFDIPKTAFRSRY
GHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEVEHEEHLYRVLEILRANKLYAKFSKCEFWLKTVTFLDHVVSSEGVSVDPVKIEAITSWPRP
STISEIRSFLGLVDYYRRFVEDFSRIASPWTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQDRVVAYASRQLKSHEQNYPT
HDLELAAVVFALKIWRHYLYGEKI