; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C02G036267 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C02G036267
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionReverse transcriptase
Genome locationCla97Chr02:16653407..16655092
RNA-Seq ExpressionCla97C02G036267
SyntenyCla97C02G036267
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035455.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.7e-19663.1Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------
        TAFRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                                   
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------

Query:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
                      ERP + TEVRSFLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+APIL LP    ++ +YCDASR GL
Subjt:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        GCVLMQ G VI+YASRQLK+HECNY THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADA
Subjt:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSRKSRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK
        LSRKSR  K+++  I   L  ELR S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LK
Subjt:  LSRKSRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK

Query:  NVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        N ILEEAHSSAYAMHPGS KMYRTL+  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  NVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

KAA0042295.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]7.0e-19865.67Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV------------------------ERPTNVTEVRS
        T FRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                        ERP + TEVRS
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV------------------------ERPTNVTEVRS

Query:  FLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLGCVLMQKGKVISYASRQLKKHECNY
        FLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+ PIL LP    +  +YCDASR GLGCVLMQ G VI+YASRQLK+HECNY
Subjt:  FLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLGCVLMQKGKVISYASRQLKKHECNY

Query:  LTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADALSRKSRQVKASMNAINAELTTELRR
         THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADALSRKSR  K+++  I   L  ELR 
Subjt:  LTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADALSRKSRQVKASMNAINAELTTELRR

Query:  SNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLKNVILEEAHSSAYAMHPGSKKMYRTL
        S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LKN ILEEAHSSAYAMHPGS KMYRTL
Subjt:  SNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLKNVILEEAHSSAYAMHPGSKKMYRTL

Query:  RGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        +  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  RGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

KAA0050527.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.7e-19663.1Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------
        TAFRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                                   
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------

Query:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
                      ERP + TEVRSFLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+APIL LP    ++ +YCDASR GL
Subjt:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        GCVLMQ G VI+YASRQLK+HECNY THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADA
Subjt:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSRKSRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK
        LSRKSR  K+++  I   L  ELR S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LK
Subjt:  LSRKSRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK

Query:  NVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        N ILEEAHSSAYAMHPGS KMYRTL+  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  NVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

KAA0066849.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.7e-19663.1Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------
        TAFRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                                   
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------

Query:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
                      ERP + TEVRSFLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+APIL LP    ++ +YCDASR GL
Subjt:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        GCVLMQ G VI+YASRQLK+HECNY THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADA
Subjt:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSRKSRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK
        LSRKSR  K+++  I   L  ELR S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LK
Subjt:  LSRKSRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK

Query:  NVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        N ILEEAHSSAYAMHPGS KMYRTL+  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  NVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

TYK00844.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.3e-19663.1Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------
        TAFRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                                   
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------

Query:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
                      ERP + TEVRSFLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+APIL LP    ++ +YCDASR GL
Subjt:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        GCVLMQ G VI+YASRQLK+HECNY THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADA
Subjt:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSRKSRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK
        LSRKSR  K+++  I   L  ELR S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LK
Subjt:  LSRKSRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK

Query:  NVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        N ILEEAHSSAYAMHPGS KMYRTL+  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  NVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

TrEMBL top hitse value%identityAlignment
A0A5A7T1Y5 Reverse transcriptase8.3e-19763.1Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------
        TAFRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                                   
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------

Query:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
                      ERP + TEVRSFLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+APIL LP    ++ +YCDASR GL
Subjt:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        GCVLMQ G VI+YASRQLK+HECNY THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADA
Subjt:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSRKSRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK
        LSRKSR  K+++  I   L  ELR S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LK
Subjt:  LSRKSRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK

Query:  NVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        N ILEEAHSSAYAMHPGS KMYRTL+  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  NVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

A0A5A7TLH7 Reverse transcriptase3.4e-19865.67Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV------------------------ERPTNVTEVRS
        T FRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                        ERP + TEVRS
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV------------------------ERPTNVTEVRS

Query:  FLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLGCVLMQKGKVISYASRQLKKHECNY
        FLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+ PIL LP    +  +YCDASR GLGCVLMQ G VI+YASRQLK+HECNY
Subjt:  FLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLGCVLMQKGKVISYASRQLKKHECNY

Query:  LTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADALSRKSRQVKASMNAINAELTTELRR
         THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADALSRKSR  K+++  I   L  ELR 
Subjt:  LTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADALSRKSRQVKASMNAINAELTTELRR

Query:  SNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLKNVILEEAHSSAYAMHPGSKKMYRTL
        S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LKN ILEEAHSSAYAMHPGS KMYRTL
Subjt:  SNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLKNVILEEAHSSAYAMHPGSKKMYRTL

Query:  RGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        +  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  RGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

A0A5A7U2V7 Reverse transcriptase8.3e-19763.1Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------
        TAFRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                                   
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------

Query:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
                      ERP + TEVRSFLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+APIL LP    ++ +YCDASR GL
Subjt:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        GCVLMQ G VI+YASRQLK+HECNY THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADA
Subjt:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSRKSRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK
        LSRKSR  K+++  I   L  ELR S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LK
Subjt:  LSRKSRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK

Query:  NVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        N ILEEAHSSAYAMHPGS KMYRTL+  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  NVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

A0A5D3BHI1 Reverse transcriptase8.3e-19763.1Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------
        TAFRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                                   
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------

Query:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
                      ERP + TEVRSFLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+APIL LP    ++ +YCDASR GL
Subjt:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        GCVLMQ G VI+YASRQLK+HECNY THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADA
Subjt:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSRKSRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK
        LSRKSR  K+++  I   L  ELR S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LK
Subjt:  LSRKSRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK

Query:  NVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        N ILEEAHSSAYAMHPGS KMYRTL+  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  NVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

A0A5D3BS67 Reverse transcriptase6.4e-19763.1Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------
        TAFRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                                   
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------

Query:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
                      ERP + TEVRSFLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+APIL LP    ++ +YCDASR GL
Subjt:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        GCVLMQ G VI+YASRQLK+HECNY THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADA
Subjt:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSRKSRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK
        LSRKSR  K+++  I   L  ELR S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LK
Subjt:  LSRKSRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK

Query:  NVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        N ILEEAHSSAYAMHPGS KMYRTL+  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  NVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.68.3e-6130.27Show/hide
Query:  KELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGT-----LRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKT
        +E++ Q+Q+++ +G +R S SP+ + +  V KK         R+ IDY                           + F+ IDL  G+HQ+++    + KT
Subjt:  KELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGT-----LRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKT

Query:  AFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHL--------------------------------------
        AF T++GHYE+L MPFGL NAPA F   MN I  P L++  +V++DDI+V+S + ++H + L                                      
Subjt:  AFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHL--------------------------------------

Query:  -----RVER------PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTN-ECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
             ++E       PT   E+++FLGL GYYR+F+  F+ IA P++   KK  K + TN E + +F+KLK  +   PIL +P    +F +  DAS   L
Subjt:  -----RVER------PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTN-ECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        G VL Q G  +SY SR L +HE NY T + EL  +V A K +RHYL G    I SDH+ L +++  K+ N +  RW   + ++D  I+Y  GK N VADA
Subjt:  GCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSR-------KSRQVKASMNAINAEL--TTELRRSNASLSVDALGG-----LFAHFHLRPT------LTEEIVNKQMEDSILRKILEEVKLKKRADFE--
        LSR        S Q + S    N++L   TE   +  +  V    G     +  +F    T      +T E   + + D    K    + ++  ADFE  
Subjt:  LSR-------KSRQVKASMNAINAEL--TTELRRSNASLSVDALGG-----LFAHFHLRPT------LTEEIVNKQMEDSILRKILEEVKLKKRADFE--

Query:  -------IRNDGTLLKQGRLCVPNDLT---LKNVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQ
               I    T + +  + + N  T    K +IL  AH     +HPG +K  +     Y++P  +  I   +  C IC   K E +
Subjt:  -------IRNDGTLLKQGRLCVPNDLT---LKNVILEEAHSSAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQ

P20825 Retrovirus-related Pol polyprotein from transposon 2973.6e-5634.58Show/hide
Query:  ELKVQLQELIYKGYVRPSVSPWGALVLFVKKKD-----GTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKTA
        E++ Q+QE++ +G +R S SP+ +    V KK         R+ IDY                             F+ IDL  G+HQ+++    I KTA
Subjt:  ELKVQLQELIYKGYVRPSVSPWGALVLFVKKKD-----GTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKTA

Query:  FRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKH----------------------TEHLRVER-------------
        F T+ GHYE+L MPFGL NAPA F   MN I  P L++  +V++DDI+++S +  +H                       E L+ E              
Subjt:  FRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKH----------------------TEHLRVER-------------

Query:  --------------PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTN-ECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLG
                      PT   E+R+FLGL GYYR+F+  ++ IA P++S  KK  K +    E  ++F+KLK  ++  PIL LP    +F +  DAS   LG
Subjt:  --------------PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTN-ECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLG

Query:  CVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADAL
         VL Q G  IS+ SR L  HE NY   + EL  +V A K +RHYL G +  I SDH+ L+++ + KE   +  RW   + +Y   I+Y  GK N VADAL
Subjt:  CVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADAL

Query:  SR
        SR
Subjt:  SR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.0e-5528.37Show/hide
Query:  KELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKTAF
        K  +E+   +Q+L+   ++ PS SP  + V+ V KKDGT RLC+DY                          A +F+ +DL  GYHQ+ ++  D  KTAF
Subjt:  KELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKTAF

Query:  RTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLR--VER----------------------------------
         T  G YE+ VMPFGL NAP+ F   M   F     +F+ V++DDIL++S + E+H +HL   +ER                                  
Subjt:  RTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLR--VER----------------------------------

Query:  -------------PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLGCV
                     P  V + + FLG+  YYRRF+   SKIA P+        K +WT + +++ +KLK  L ++P+L     +  + +  DAS+ G+G V
Subjt:  -------------PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLGCV

Query:  LMQ---KGK---VISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVV
        L +   K K   V+ Y S+ L+  + NY   +LEL  ++ AL  +R+ L+G+   + +DH SL  + ++ E   R +RW++ +  YD ++EY  G  NVV
Subjt:  LMQ---KGK---VISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVV

Query:  ADALSRKSRQV-KASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKK--RADFEIRNDGTLLKQGRLCVP
        ADA+SR    +   +   I+ E      +S+   S   +       H++      +  + M  S  R   ++++L +  R ++ +  D  +  Q RL VP
Subjt:  ADALSRKSRQV-KASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKK--RADFEIRNDGTLLKQGRLCVP

Query:  NDLTLKNVILEEAHS-SAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQR
          +  +N ++   H  + +  H G       +   Y+WP ++  I + +  C+ CQ +K  R R
Subjt:  NDLTLKNVILEEAHS-SAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus5.4e-5230.72Show/hide
Query:  ELKVQLQELIYKGYVRPSVSPWGALVLFVKKK-----DGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKTA
        E++ Q+ EL+  G +RPS SP+ + +  V KK     +   R+ +D+                          A  F+ +DL  G+HQ+ +K +DIPKTA
Subjt:  ELKVQLQELIYKGYVRPSVSPWGALVLFVKKK-----DGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKTA

Query:  FRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLR--------------------------------------
        F T  G YEFL +PFGL NAPA+F  +++ I   ++ +   V+IDDI+V+S + + H ++LR                                      
Subjt:  FRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLR--------------------------------------

Query:  -----------VERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKV-AKFEWTNECE----------QSFQKLKEKLVSAPILTLPTPRMEFEV
                   +  PT+V E++ FLG+  YYR+F++ ++K+A PL++LT+ + A  + +   +          QSF  LK  L S+ IL  P     F +
Subjt:  -----------VERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKV-AKFEWTNECE----------QSFQKLKEKLVSAPILTLPTPRMEFEV

Query:  YCDASRQGLGCVLMQ----KGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGE-RCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCS
          DAS   +G VL Q    + + I+Y SR L K E NY T + E+  ++ +L   R YLYG    ++ +DH+ L +    +  N + +RW   I++Y+C 
Subjt:  YCDASRQGLGCVLMQ----KGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGE-RCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCS

Query:  IEYHPGKANVVADALSRKSRQVKASMNAINAEL
        + Y PGK+NVVADALSR    +   +N ++ +L
Subjt:  IEYHPGKANVVADALSRKSRQVKASMNAINAEL

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.1e-5528.37Show/hide
Query:  KELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKTAF
        K  +E+   +Q+L+   ++ PS SP  + V+ V KKDGT RLC+DY                          A +F+ +DL  GYHQ+ ++  D  KTAF
Subjt:  KELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKTAF

Query:  RTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLR--VER----------------------------------
         T  G YE+ VMPFGL NAP+ F   M   F     +F+ V++DDIL++S + E+H +HL   +ER                                  
Subjt:  RTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLR--VER----------------------------------

Query:  -------------PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLGCV
                     P  V + + FLG+  YYRRF+   SKIA P+        K +WT + +++  KLK+ L ++P+L     +  + +  DAS+ G+G V
Subjt:  -------------PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLGCV

Query:  LMQ---KGK---VISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVV
        L +   K K   V+ Y S+ L+  + NY   +LEL  ++ AL  +R+ L+G+   + +DH SL  + ++ E   R +RW++ +  YD ++EY  G  NVV
Subjt:  LMQ---KGK---VISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVV

Query:  ADALSRKSRQV-KASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKK--RADFEIRNDGTLLKQGRLCVP
        ADA+SR    +   +   I+ E      +S+   S   +       H++      +  + M  S  R   ++++L +  R ++ +  D  +  Q RL VP
Subjt:  ADALSRKSRQV-KASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKK--RADFEIRNDGTLLKQGRLCVP

Query:  NDLTLKNVILEEAHS-SAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQR
          +  +N ++   H  + +  H G       +   Y+WP ++  I + +  C+ CQ +K  R R
Subjt:  NDLTLKNVILEEAHS-SAYAMHPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein9.6e-1245.21Show/hide
Query:  PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEF
        P N TE+R FLGL GYYRRFV+ + KI  PL+ L KK    +WT     +F+ LK  + + P+L LP  ++ F
Subjt:  PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCGAAGGAGTTAAAGGAACTGAAAGTACAATTGCAAGAATTGATTTACAAGGGGTATGTACGACCTAGTGTGTCGCCGTGGGGAGCACTGGTGTTGTTTGTGAA
GAAGAAGGATGGCACACTTAGATTATGTATTGATTACTGTGCTTCAGTATTTTCCAAGATTGATCTACGATTTGGATATCATCAGATGAAGATCAAAGGAACAGATATAC
CAAAGACTGCATTTAGAACGAGGTATGGACATTACGAATTTCTAGTAATGCCTTTTGGATTAACAAATGCTCCGGCTGTATTTATGGACCTCATGAATCGCATATTCCAT
CCATACCTTGATCAATTCATTGTAGTGTTCATCGATGATATACTAGTATATTCTGGGAATAAGGAAAAGCATACAGAACACCTTCGAGTGGAAAGACCTACTAATGTCAC
AGAAGTTCGTAGTTTTCTGGGACTAGCAGGTTATTATCGCCGATTTGTTGAAGGATTTTCAAAGATAGCACTACCCTTATCAAGCTTAACCAAGAAGGTAGCCAAGTTTG
AATGGACAAATGAATGTGAACAGAGTTTTCAAAAGTTGAAGGAAAAATTAGTGTCAGCACCAATATTGACATTGCCTACCCCTAGAATGGAGTTTGAAGTGTATTGCGAT
GCGTCACGACAGGGGTTAGGATGTGTACTCATGCAGAAGGGAAAAGTGATTTCCTATGCGTCGAGGCAGCTGAAGAAGCACGAATGTAACTATCTGACGCATGATTTGGA
GTTGGCAGAAGTTGTGTTAGCCTTGAAAATTTGGCGACATTACTTGTACGGAGAAAGGTGTCGTATTCTTTCCGATCATAAAAGTTTGAAGTATATCTTTGATCAAAAGG
AACTAAATTTACGACAAAGAAGATGGATGGAATTGATTAAAGACTATGATTGCTCAATAGAATACCATCCAGGAAAGGCCAATGTGGTAGCCGACGCATTGAGTAGGAAG
TCGAGACAAGTCAAGGCTTCAATGAACGCTATCAATGCGGAATTAACAACAGAGCTTAGGCGTTCAAACGCATCCTTATCGGTGGACGCATTAGGAGGATTGTTTGCACA
CTTCCATCTAAGACCTACTTTGACAGAGGAGATTGTTAATAAACAGATGGAAGACTCAATACTTAGAAAAATATTAGAAGAAGTGAAACTTAAGAAAAGAGCGGATTTTG
AAATTAGAAATGATGGAACTTTATTGAAACAAGGAAGATTATGCGTTCCTAACGATTTAACATTGAAAAACGTCATTCTAGAGGAAGCTCATAGTTCTGCTTATGCAATG
CACCCCGGTAGTAAAAAGATGTACAGAACTCTAAGAGGGTATTATTGGTGGCCAGGAATGAAACGAGAAATTGCTGAATGTGTAGCAAGATGCTTGATATGTCAGCAAGT
TAAACCAGAAAGACAAAGACCTGGGGACTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCGAAGGAGTTAAAGGAACTGAAAGTACAATTGCAAGAATTGATTTACAAGGGGTATGTACGACCTAGTGTGTCGCCGTGGGGAGCACTGGTGTTGTTTGTGAA
GAAGAAGGATGGCACACTTAGATTATGTATTGATTACTGTGCTTCAGTATTTTCCAAGATTGATCTACGATTTGGATATCATCAGATGAAGATCAAAGGAACAGATATAC
CAAAGACTGCATTTAGAACGAGGTATGGACATTACGAATTTCTAGTAATGCCTTTTGGATTAACAAATGCTCCGGCTGTATTTATGGACCTCATGAATCGCATATTCCAT
CCATACCTTGATCAATTCATTGTAGTGTTCATCGATGATATACTAGTATATTCTGGGAATAAGGAAAAGCATACAGAACACCTTCGAGTGGAAAGACCTACTAATGTCAC
AGAAGTTCGTAGTTTTCTGGGACTAGCAGGTTATTATCGCCGATTTGTTGAAGGATTTTCAAAGATAGCACTACCCTTATCAAGCTTAACCAAGAAGGTAGCCAAGTTTG
AATGGACAAATGAATGTGAACAGAGTTTTCAAAAGTTGAAGGAAAAATTAGTGTCAGCACCAATATTGACATTGCCTACCCCTAGAATGGAGTTTGAAGTGTATTGCGAT
GCGTCACGACAGGGGTTAGGATGTGTACTCATGCAGAAGGGAAAAGTGATTTCCTATGCGTCGAGGCAGCTGAAGAAGCACGAATGTAACTATCTGACGCATGATTTGGA
GTTGGCAGAAGTTGTGTTAGCCTTGAAAATTTGGCGACATTACTTGTACGGAGAAAGGTGTCGTATTCTTTCCGATCATAAAAGTTTGAAGTATATCTTTGATCAAAAGG
AACTAAATTTACGACAAAGAAGATGGATGGAATTGATTAAAGACTATGATTGCTCAATAGAATACCATCCAGGAAAGGCCAATGTGGTAGCCGACGCATTGAGTAGGAAG
TCGAGACAAGTCAAGGCTTCAATGAACGCTATCAATGCGGAATTAACAACAGAGCTTAGGCGTTCAAACGCATCCTTATCGGTGGACGCATTAGGAGGATTGTTTGCACA
CTTCCATCTAAGACCTACTTTGACAGAGGAGATTGTTAATAAACAGATGGAAGACTCAATACTTAGAAAAATATTAGAAGAAGTGAAACTTAAGAAAAGAGCGGATTTTG
AAATTAGAAATGATGGAACTTTATTGAAACAAGGAAGATTATGCGTTCCTAACGATTTAACATTGAAAAACGTCATTCTAGAGGAAGCTCATAGTTCTGCTTATGCAATG
CACCCCGGTAGTAAAAAGATGTACAGAACTCTAAGAGGGTATTATTGGTGGCCAGGAATGAAACGAGAAATTGCTGAATGTGTAGCAAGATGCTTGATATGTCAGCAAGT
TAAACCAGAAAGACAAAGACCTGGGGACTTTTAA
Protein sequenceShow/hide protein sequence
MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDYCASVFSKIDLRFGYHQMKIKGTDIPKTAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFH
PYLDQFIVVFIDDILVYSGNKEKHTEHLRVERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCD
ASRQGLGCVLMQKGKVISYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADALSRK
SRQVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLKNVILEEAHSSAYAM
HPGSKKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF