; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G10670 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G10670
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
Genome locationClcChr02:16581987..16583672
RNA-Seq ExpressionClc02G10670
SyntenyClc02G10670
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035455.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.5e-19763.46Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------
        TAFRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                                   
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------

Query:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
                      ERP + TEVRSFLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+APIL LP    ++ +YCDASR GL
Subjt:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        GCVLMQ G VIAYASRQLK+HECNY THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADA
Subjt:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSRKSRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK
        LSRKSR  K+++  I   L  ELR S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LK
Subjt:  LSRKSRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK

Query:  NVILEEAHSSAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        N ILEEAHSSAYAMHPGSTKMYRTL+  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  NVILEEAHSSAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

KAA0042295.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]6.3e-19966.04Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV------------------------ERPTNVTEVRS
        T FRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                        ERP + TEVRS
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV------------------------ERPTNVTEVRS

Query:  FLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLGCVLMQKGKVIAYASRQLKKHECNY
        FLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+ PIL LP    +  +YCDASR GLGCVLMQ G VIAYASRQLK+HECNY
Subjt:  FLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLGCVLMQKGKVIAYASRQLKKHECNY

Query:  LTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADALSRKSRKVKASMNAINAELTTELRR
         THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADALSRKSR  K+++  I   L  ELR 
Subjt:  LTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADALSRKSRKVKASMNAINAELTTELRR

Query:  SNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLKNVILEEAHSSAYAMHPGSTKMYRTL
        S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LKN ILEEAHSSAYAMHPGSTKMYRTL
Subjt:  SNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLKNVILEEAHSSAYAMHPGSTKMYRTL

Query:  RGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        +  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  RGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

KAA0050527.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.5e-19763.46Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------
        TAFRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                                   
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------

Query:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
                      ERP + TEVRSFLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+APIL LP    ++ +YCDASR GL
Subjt:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        GCVLMQ G VIAYASRQLK+HECNY THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADA
Subjt:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSRKSRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK
        LSRKSR  K+++  I   L  ELR S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LK
Subjt:  LSRKSRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK

Query:  NVILEEAHSSAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        N ILEEAHSSAYAMHPGSTKMYRTL+  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  NVILEEAHSSAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

KAA0066849.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.5e-19763.46Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------
        TAFRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                                   
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------

Query:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
                      ERP + TEVRSFLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+APIL LP    ++ +YCDASR GL
Subjt:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        GCVLMQ G VIAYASRQLK+HECNY THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADA
Subjt:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSRKSRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK
        LSRKSR  K+++  I   L  ELR S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LK
Subjt:  LSRKSRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK

Query:  NVILEEAHSSAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        N ILEEAHSSAYAMHPGSTKMYRTL+  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  NVILEEAHSSAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

TYK00844.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.2e-19763.46Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------
        TAFRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                                   
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------

Query:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
                      ERP + TEVRSFLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+APIL LP    ++ +YCDASR GL
Subjt:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        GCVLMQ G VIAYASRQLK+HECNY THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADA
Subjt:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSRKSRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK
        LSRKSR  K+++  I   L  ELR S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LK
Subjt:  LSRKSRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK

Query:  NVILEEAHSSAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        N ILEEAHSSAYAMHPGSTKMYRTL+  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  NVILEEAHSSAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

TrEMBL top hitse value%identityAlignment
A0A5A7T1Y5 Reverse transcriptase7.5e-19863.46Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------
        TAFRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                                   
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------

Query:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
                      ERP + TEVRSFLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+APIL LP    ++ +YCDASR GL
Subjt:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        GCVLMQ G VIAYASRQLK+HECNY THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADA
Subjt:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSRKSRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK
        LSRKSR  K+++  I   L  ELR S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LK
Subjt:  LSRKSRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK

Query:  NVILEEAHSSAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        N ILEEAHSSAYAMHPGSTKMYRTL+  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  NVILEEAHSSAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

A0A5A7TLH7 Reverse transcriptase3.0e-19966.04Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV------------------------ERPTNVTEVRS
        T FRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                        ERP + TEVRS
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV------------------------ERPTNVTEVRS

Query:  FLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLGCVLMQKGKVIAYASRQLKKHECNY
        FLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+ PIL LP    +  +YCDASR GLGCVLMQ G VIAYASRQLK+HECNY
Subjt:  FLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLGCVLMQKGKVIAYASRQLKKHECNY

Query:  LTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADALSRKSRKVKASMNAINAELTTELRR
         THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADALSRKSR  K+++  I   L  ELR 
Subjt:  LTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADALSRKSRKVKASMNAINAELTTELRR

Query:  SNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLKNVILEEAHSSAYAMHPGSTKMYRTL
        S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LKN ILEEAHSSAYAMHPGSTKMYRTL
Subjt:  SNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLKNVILEEAHSSAYAMHPGSTKMYRTL

Query:  RGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        +  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  RGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

A0A5A7U2V7 Reverse transcriptase7.5e-19863.46Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------
        TAFRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                                   
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------

Query:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
                      ERP + TEVRSFLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+APIL LP    ++ +YCDASR GL
Subjt:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        GCVLMQ G VIAYASRQLK+HECNY THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADA
Subjt:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSRKSRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK
        LSRKSR  K+++  I   L  ELR S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LK
Subjt:  LSRKSRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK

Query:  NVILEEAHSSAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        N ILEEAHSSAYAMHPGSTKMYRTL+  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  NVILEEAHSSAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

A0A5D3BHI1 Reverse transcriptase7.5e-19863.46Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------
        TAFRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                                   
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------

Query:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
                      ERP + TEVRSFLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+APIL LP    ++ +YCDASR GL
Subjt:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        GCVLMQ G VIAYASRQLK+HECNY THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADA
Subjt:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSRKSRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK
        LSRKSR  K+++  I   L  ELR S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LK
Subjt:  LSRKSRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK

Query:  NVILEEAHSSAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        N ILEEAHSSAYAMHPGSTKMYRTL+  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  NVILEEAHSSAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

A0A5D3BS67 Reverse transcriptase5.7e-19863.46Show/hide
Query:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK
        MA  ELKELK+QLQEL+ KGY+RPSVSPWGA VLFVKKKDGTLRLCIDY                          A++FSKIDLR GYHQ+K++ +DI K
Subjt:  MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPK

Query:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------
        TAFRTRYGHYEF VMPFGLTNAPAVFMDLMNRIFH YLDQF++VFIDDILVYS ++E H EHLR+                                   
Subjt:  TAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLRV-----------------------------------

Query:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
                      ERP + TEVRSFLGLAGYYRRF+E FS++ALPL++LT+K  KFEW+++CEQSFQ+LK++LV+APIL LP    ++ +YCDASR GL
Subjt:  --------------ERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        GCVLMQ G VIAYASRQLK+HECNY THDLELA VVLALKIWRHYL+GE+C I +DHKSLKYIFDQKELNLRQRRW+ELIKDYDC+IEYHPGKANVVADA
Subjt:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSRKSRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK
        LSRKSR  K+++  I   L  ELR S A ++ +  G L A F +R +L  EIV +Q EDS L+K  E+ K     +FE+R DG ++KQGRLCVPN   LK
Subjt:  LSRKSRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLK

Query:  NVILEEAHSSAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF
        N ILEEAHSSAYAMHPGSTKMYRTL+  YWW GMK+EIAE V RCLICQQVKP RQRPG F
Subjt:  NVILEEAHSSAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.67.8e-5934Show/hide
Query:  KELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGT-----LRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKT
        +E++ Q+Q+++ +G +R S SP+ + +  V KK         R+ IDY                           + F+ IDL  G+HQ+++    + KT
Subjt:  KELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGT-----LRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKT

Query:  AFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHL--------------------------------------
        AF T++GHYE+L MPFGL NAPA F   MN I  P L++  +V++DDI+V+S + ++H + L                                      
Subjt:  AFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHL--------------------------------------

Query:  -----RVER------PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTN-ECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL
             ++E       PT   E+++FLGL GYYR+F+  F+ IA P++   KK  K + TN E + +F+KLK  +   PIL +P    +F +  DAS   L
Subjt:  -----RVER------PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTN-ECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGL

Query:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA
        G VL Q G  ++Y SR L +HE NY T + EL  +V A K +RHYL G    I SDH+ L +++  K+ N +  RW   + ++D  I+Y  GK N VADA
Subjt:  GCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADA

Query:  LSR
        LSR
Subjt:  LSR

P20825 Retrovirus-related Pol polyprotein from transposon 2976.2e-5634.33Show/hide
Query:  ELKVQLQELIYKGYVRPSVSPWGALVLFVKKKD-----GTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKTA
        E++ Q+QE++ +G +R S SP+ +    V KK         R+ IDY                             F+ IDL  G+HQ+++    I KTA
Subjt:  ELKVQLQELIYKGYVRPSVSPWGALVLFVKKKD-----GTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKTA

Query:  FRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKH----------------------TEHLRVER-------------
        F T+ GHYE+L MPFGL NAPA F   MN I  P L++  +V++DDI+++S +  +H                       E L+ E              
Subjt:  FRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKH----------------------TEHLRVER-------------

Query:  --------------PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTN-ECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLG
                      PT   E+R+FLGL GYYR+F+  ++ IA P++S  KK  K +    E  ++F+KLK  ++  PIL LP    +F +  DAS   LG
Subjt:  --------------PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTN-ECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLG

Query:  CVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADAL
         VL Q G  I++ SR L  HE NY   + EL  +V A K +RHYL G +  I SDH+ L+++ + KE   +  RW   + +Y   I+Y  GK N VADAL
Subjt:  CVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADAL

Query:  SR
        SR
Subjt:  SR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein8.1e-5628.55Show/hide
Query:  KELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKTAF
        K  +E+   +Q+L+   ++ PS SP  + V+ V KKDGT RLC+DY                          A +F+ +DL  GYHQ+ ++  D  KTAF
Subjt:  KELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKTAF

Query:  RTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLR--VER----------------------------------
         T  G YE+ VMPFGL NAP+ F   M   F     +F+ V++DDIL++S + E+H +HL   +ER                                  
Subjt:  RTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLR--VER----------------------------------

Query:  -------------PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLGCV
                     P  V + + FLG+  YYRRF+   SKIA P+        K +WT + +++ +KLK  L ++P+L     +  + +  DAS+ G+G V
Subjt:  -------------PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLGCV

Query:  LMQ---KGK---VIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVV
        L +   K K   V+ Y S+ L+  + NY   +LEL  ++ AL  +R+ L+G+   + +DH SL  + ++ E   R +RW++ +  YD ++EY  G  NVV
Subjt:  LMQ---KGK---VIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVV

Query:  ADALSRKSRKV-KASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKK--RADFEIRNDGTLLKQGRLCVP
        ADA+SR    +   +   I+ E      +S+   S   +       H++      +  + M  S  R   ++++L +  R ++ +  D  +  Q RL VP
Subjt:  ADALSRKSRKV-KASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKK--RADFEIRNDGTLLKQGRLCVP

Query:  NDLTLKNVILEEAHS-SAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQR
          +  +N ++   H  + +  H G T     +   Y+WP ++  I + +  C+ CQ +K  R R
Subjt:  NDLTLKNVILEEAHS-SAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus2.4e-5230.95Show/hide
Query:  ELKVQLQELIYKGYVRPSVSPWGALVLFVKKK-----DGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKTA
        E++ Q+ EL+  G +RPS SP+ + +  V KK     +   R+ +D+                          A  F+ +DL  G+HQ+ +K +DIPKTA
Subjt:  ELKVQLQELIYKGYVRPSVSPWGALVLFVKKK-----DGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKTA

Query:  FRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLR--------------------------------------
        F T  G YEFL +PFGL NAPA+F  +++ I   ++ +   V+IDDI+V+S + + H ++LR                                      
Subjt:  FRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLR--------------------------------------

Query:  -----------VERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKV-AKFEWTNECE----------QSFQKLKEKLVSAPILTLPTPRMEFEV
                   +  PT+V E++ FLG+  YYR+F++ ++K+A PL++LT+ + A  + +   +          QSF  LK  L S+ IL  P     F +
Subjt:  -----------VERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKV-AKFEWTNECE----------QSFQKLKEKLVSAPILTLPTPRMEFEV

Query:  YCDASRQGLGCVLMQ----KGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGE-RCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCS
          DAS   +G VL Q    + + IAY SR L K E NY T + E+  ++ +L   R YLYG    ++ +DH+ L +    +  N + +RW   I++Y+C 
Subjt:  YCDASRQGLGCVLMQ----KGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGE-RCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCS

Query:  IEYHPGKANVVADALSRKSRKVKASMNAINAEL
        + Y PGK+NVVADALSR    +   +N ++ +L
Subjt:  IEYHPGKANVVADALSRKSRKVKASMNAINAEL

Q99315 Transposon Ty3-G Gag-Pol polyprotein6.2e-5628.55Show/hide
Query:  KELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKTAF
        K  +E+   +Q+L+   ++ PS SP  + V+ V KKDGT RLC+DY                          A +F+ +DL  GYHQ+ ++  D  KTAF
Subjt:  KELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDY-------------------------CASVFSKIDLRFGYHQMKIKGTDIPKTAF

Query:  RTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLR--VER----------------------------------
         T  G YE+ VMPFGL NAP+ F   M   F     +F+ V++DDIL++S + E+H +HL   +ER                                  
Subjt:  RTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFHPYLDQFIVVFIDDILVYSGNKEKHTEHLR--VER----------------------------------

Query:  -------------PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLGCV
                     P  V + + FLG+  YYRRF+   SKIA P+        K +WT + +++  KLK+ L ++P+L     +  + +  DAS+ G+G V
Subjt:  -------------PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCDASRQGLGCV

Query:  LMQ---KGK---VIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVV
        L +   K K   V+ Y S+ L+  + NY   +LEL  ++ AL  +R+ L+G+   + +DH SL  + ++ E   R +RW++ +  YD ++EY  G  NVV
Subjt:  LMQ---KGK---VIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVV

Query:  ADALSRKSRKV-KASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKK--RADFEIRNDGTLLKQGRLCVP
        ADA+SR    +   +   I+ E      +S+   S   +       H++      +  + M  S  R   ++++L +  R ++ +  D  +  Q RL VP
Subjt:  ADALSRKSRKV-KASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKK--RADFEIRNDGTLLKQGRLCVP

Query:  NDLTLKNVILEEAHS-SAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQR
          +  +N ++   H  + +  H G T     +   Y+WP ++  I + +  C+ CQ +K  R R
Subjt:  NDLTLKNVILEEAHS-SAYAMHPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein9.6e-1245.21Show/hide
Query:  PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEF
        P N TE+R FLGL GYYRRFV+ + KI  PL+ L KK    +WT     +F+ LK  + + P+L LP  ++ F
Subjt:  PTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCGAAGGAGTTAAAGGAACTGAAAGTACAATTGCAAGAATTGATTTACAAGGGGTATGTACGACCTAGTGTGTCGCCGTGGGGAGCACTGGTGTTGTTTGTGAA
GAAGAAGGATGGCACACTTAGATTATGTATTGATTACTGTGCTTCAGTATTTTCCAAGATTGATCTACGATTTGGATATCATCAGATGAAGATCAAAGGAACAGATATAC
CAAAGACTGCATTTAGAACGAGGTATGGACATTACGAATTTCTAGTAATGCCTTTTGGATTAACAAATGCTCCGGCTGTATTTATGGACCTCATGAATCGCATATTCCAT
CCATACCTTGATCAATTCATTGTAGTGTTCATCGATGATATACTAGTATATTCTGGGAATAAGGAAAAGCATACAGAACACCTTCGAGTGGAAAGACCTACTAATGTCAC
AGAAGTTCGTAGTTTTCTGGGACTAGCAGGTTATTATCGCCGATTTGTTGAAGGATTTTCAAAGATAGCACTACCCTTATCAAGCTTAACCAAGAAGGTAGCCAAGTTTG
AATGGACAAATGAATGTGAACAGAGTTTTCAAAAGTTGAAGGAAAAATTAGTGTCAGCACCAATATTGACATTGCCTACCCCTAGAATGGAGTTTGAAGTGTATTGCGAT
GCGTCACGACAGGGGTTAGGATGTGTACTCATGCAGAAGGGAAAAGTGATTGCCTATGCGTCGAGGCAGCTGAAGAAGCACGAATGTAACTATTTGACGCATGATTTGGA
GTTGGCAGAAGTTGTGTTAGCCTTGAAAATTTGGCGACATTACTTGTACGGAGAAAGGTGTCGTATTCTTTCCGATCATAAAAGTTTGAAGTATATCTTTGATCAAAAGG
AACTAAATTTACGACAAAGAAGATGGATGGAATTGATTAAAGACTATGATTGCTCAATAGAATACCATCCAGGAAAGGCCAATGTGGTAGCCGACGCATTGAGTAGGAAG
TCGAGAAAAGTCAAGGCTTCAATGAACGCTATCAATGCGGAATTAACAACAGAGCTTAGGCGTTCAAACGCATCCTTATCGGTGGACGCATTAGGAGGATTGTTTGCGCA
CTTCCATCTAAGACCTACTTTGACAGAGGAGATTGTTAATAAACAGATGGAAGACTCAATACTTAGAAAAATATTAGAAGAAGTGAAACTTAAGAAAAGAGCGGATTTTG
AAATTAGAAATGATGGAACTTTATTGAAACAAGGAAGATTATGCGTTCCTAACGATTTAACATTGAAAAACGTCATTCTAGAGGAAGCTCATAGTTCTGCTTATGCAATG
CACCCCGGTAGTACAAAGATGTACAGAACTCTAAGAGGGTATTATTGGTGGCCAGGAATGAAACGAGAAATTGCTGAATGTGTAGCAAGATGCTTGATATGTCAGCAAGT
TAAACCAGAAAGACAAAGACCTGGGGACTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCGAAGGAGTTAAAGGAACTGAAAGTACAATTGCAAGAATTGATTTACAAGGGGTATGTACGACCTAGTGTGTCGCCGTGGGGAGCACTGGTGTTGTTTGTGAA
GAAGAAGGATGGCACACTTAGATTATGTATTGATTACTGTGCTTCAGTATTTTCCAAGATTGATCTACGATTTGGATATCATCAGATGAAGATCAAAGGAACAGATATAC
CAAAGACTGCATTTAGAACGAGGTATGGACATTACGAATTTCTAGTAATGCCTTTTGGATTAACAAATGCTCCGGCTGTATTTATGGACCTCATGAATCGCATATTCCAT
CCATACCTTGATCAATTCATTGTAGTGTTCATCGATGATATACTAGTATATTCTGGGAATAAGGAAAAGCATACAGAACACCTTCGAGTGGAAAGACCTACTAATGTCAC
AGAAGTTCGTAGTTTTCTGGGACTAGCAGGTTATTATCGCCGATTTGTTGAAGGATTTTCAAAGATAGCACTACCCTTATCAAGCTTAACCAAGAAGGTAGCCAAGTTTG
AATGGACAAATGAATGTGAACAGAGTTTTCAAAAGTTGAAGGAAAAATTAGTGTCAGCACCAATATTGACATTGCCTACCCCTAGAATGGAGTTTGAAGTGTATTGCGAT
GCGTCACGACAGGGGTTAGGATGTGTACTCATGCAGAAGGGAAAAGTGATTGCCTATGCGTCGAGGCAGCTGAAGAAGCACGAATGTAACTATTTGACGCATGATTTGGA
GTTGGCAGAAGTTGTGTTAGCCTTGAAAATTTGGCGACATTACTTGTACGGAGAAAGGTGTCGTATTCTTTCCGATCATAAAAGTTTGAAGTATATCTTTGATCAAAAGG
AACTAAATTTACGACAAAGAAGATGGATGGAATTGATTAAAGACTATGATTGCTCAATAGAATACCATCCAGGAAAGGCCAATGTGGTAGCCGACGCATTGAGTAGGAAG
TCGAGAAAAGTCAAGGCTTCAATGAACGCTATCAATGCGGAATTAACAACAGAGCTTAGGCGTTCAAACGCATCCTTATCGGTGGACGCATTAGGAGGATTGTTTGCGCA
CTTCCATCTAAGACCTACTTTGACAGAGGAGATTGTTAATAAACAGATGGAAGACTCAATACTTAGAAAAATATTAGAAGAAGTGAAACTTAAGAAAAGAGCGGATTTTG
AAATTAGAAATGATGGAACTTTATTGAAACAAGGAAGATTATGCGTTCCTAACGATTTAACATTGAAAAACGTCATTCTAGAGGAAGCTCATAGTTCTGCTTATGCAATG
CACCCCGGTAGTACAAAGATGTACAGAACTCTAAGAGGGTATTATTGGTGGCCAGGAATGAAACGAGAAATTGCTGAATGTGTAGCAAGATGCTTGATATGTCAGCAAGT
TAAACCAGAAAGACAAAGACCTGGGGACTTTTAA
Protein sequenceShow/hide protein sequence
MAAKELKELKVQLQELIYKGYVRPSVSPWGALVLFVKKKDGTLRLCIDYCASVFSKIDLRFGYHQMKIKGTDIPKTAFRTRYGHYEFLVMPFGLTNAPAVFMDLMNRIFH
PYLDQFIVVFIDDILVYSGNKEKHTEHLRVERPTNVTEVRSFLGLAGYYRRFVEGFSKIALPLSSLTKKVAKFEWTNECEQSFQKLKEKLVSAPILTLPTPRMEFEVYCD
ASRQGLGCVLMQKGKVIAYASRQLKKHECNYLTHDLELAEVVLALKIWRHYLYGERCRILSDHKSLKYIFDQKELNLRQRRWMELIKDYDCSIEYHPGKANVVADALSRK
SRKVKASMNAINAELTTELRRSNASLSVDALGGLFAHFHLRPTLTEEIVNKQMEDSILRKILEEVKLKKRADFEIRNDGTLLKQGRLCVPNDLTLKNVILEEAHSSAYAM
HPGSTKMYRTLRGYYWWPGMKREIAECVARCLICQQVKPERQRPGDF