; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G17250 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G17250
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase
Genome locationChr4:14744760..14747592
RNA-Seq ExpressionCSPI04G17250
SyntenyCSPI04G17250
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040699.1 pol protein [Cucumis melo var. makuwa]1.6e-25056.14Show/hide
Query:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE
        +LGHYAL LFDSGSSHSFISS FV HA LEVEPL +VLSVSTPSGE MLSKEK+KACQI I   V++VTL+VLD+ DFDVILGMDWLA NHASIDCS K 
Subjt:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE

Query:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK
        V                                           VVDT+EV++ L+ EPVVR+Y DVFPEELPGL PHRE++FA+ELEPGT    R PY+
Subjt:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK

Query:  MTPTDLKEL---------------------------KKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGAT-----------------------
        M P +LKEL                           KKKDGSMRLCIDYRELNKVT KNRYPLP+IDDLFDQLQGAT                       
Subjt:  MTPTDLKEL---------------------------KKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGAT-----------------------

Query:  -----------------------------------------------------------------MTFLGHVVSSEGVYVDPAKIEAITSWSRSSTVSEV
                                                                         ++FLGHVVS  GV VDPAKIEA+T W+R STVSEV
Subjt:  -----------------------------------------------------------------MTFLGHVVSSEGVYVDPAKIEAITSWSRSSTVSEV

Query:  CSFLGLAG---------------------RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGS---FSHEQNYPTHDLELAA----------------
         SFLGLAG                     +G  FVWS ACE SFQN KQKLVTA    V   + S    SHEQNYPTHDLELAA                
Subjt:  CSFLGLAG---------------------RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGS---FSHEQNYPTHDLELAA----------------

Query:  -------------------------------DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQK
                                       DYD EILYH GKANVVADALSRKVSHS ALITRQ P+H+D +R EIAVSVG +T QL QLTVQPTLRQ+
Subjt:  -------------------------------DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQK

Query:  IIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQV
        IIDAQ ND  LVEK  L E GQA  FS+SSDGGLLFER LCVP+D+ VKT+LL+EAHSSPFSMHPGSTKMYQDLKR+YWWRNMKR+VAEFVS CLVCQQV
Subjt:  IIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQV

Query:  KALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---------------PEYLYRL
        KA RQKP GLLQPLS+PKWK ENVSMDF+TGL  TL+G+TVIWVVVD+LTKSA FVPGKSTYTASKWAQLY+++IVRL                   ++ 
Subjt:  KALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---------------PEYLYRL

Query:  FQTEM------------HTDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDEQRMMSPEL
         QT M             TD QTERLNQVLEDMLRACALEFPG+WDSHLHLMEFAYNN++QATIGM PFEALYGKCCRS VCW EV EQR+M PEL
Subjt:  FQTEM------------HTDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDEQRMMSPEL

KAA0043391.1 pol protein [Cucumis melo var. makuwa]3.2e-25154.77Show/hide
Query:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE
        +LGHYAL LFDSGSSHSFISS FV HA LEVEPL +VLSVSTPSGE MLSKEK+KACQI I   V++VTL+VLD+ DFDVILGMDWLA NHASIDCS KE
Subjt:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE

Query:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK
        V                                           VVDT+E ++ L+ EPVVR+Y DVFPEELPGLPPHRE++FA+ELEPGT    R PY+
Subjt:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK

Query:  MTPTDLKEL---------------------------KKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGAT-----------------------
        M P +LKEL                           KKKDGSMRLCIDYRELNKVT KNRYPLP+IDDLFDQLQGAT                       
Subjt:  MTPTDLKEL---------------------------KKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGAT-----------------------

Query:  ----------------------------------------------------------------------------------------MTFLGHVVSSEG
                                                                                                ++FLGHVVS  G
Subjt:  ----------------------------------------------------------------------------------------MTFLGHVVSSEG

Query:  VYVDPAKIEAITSWSRSSTVSEVCSFLGLAG---------------------RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSF-----------
        V VDPAKIEA+T W+R STVSEV SFLGLAG                     +G  FVWS ACE SFQN KQKLVTAP+ TVPDG GSF           
Subjt:  VYVDPAKIEAITSWSRSSTVSEVCSFLGLAG---------------------RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSF-----------

Query:  -------------------SHEQNYPTHDLELAA-----------------------------------------------DYDYEILYHLGKANVVADA
                           SH+QNYPTHDLELAA                                               DYD E LYH GKANVVADA
Subjt:  -------------------SHEQNYPTHDLELAA-----------------------------------------------DYDYEILYHLGKANVVADA

Query:  LSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKT
        LSRKVSHS ALITRQ  +H+D +R +IAVSVG VT QL QLTVQPTLRQ+I+DAQ ND  LVEK  L E GQA  FS+SSDGGLLFER LCVP D+ VK 
Subjt:  LSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKT

Query:  DLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLT
        +LL+EAHSSPFSMHPGSTKMYQDLKR+YWWRNMKREVAEFVS CLVCQQVKA RQKP GLLQPLS+P+WK ENVSMDF+TGL  TL+G+TVIWVVVD+LT
Subjt:  DLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLT

Query:  KSAQFVPGKSTYTASKWAQLYLTKIVRLPEYL-YRL-FQTEMH--TDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALY
        KSA FVPGKSTYTASKWAQLY+++I  L   +  RL F T  H  TD QTERLNQVLEDMLRACALEFPG+WDSHLHLMEFAYNN++QATIGM PFEALY
Subjt:  KSAQFVPGKSTYTASKWAQLYLTKIVRLPEYL-YRL-FQTEMH--TDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALY

Query:  GKCCRSSVCWDEVDEQRMMSPE
        GKCCRS VCW EV EQR+M PE
Subjt:  GKCCRSSVCWDEVDEQRMMSPE

KAA0051368.1 pol protein [Cucumis melo var. makuwa]1.1e-25656.47Show/hide
Query:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE
        +LGHYAL LFDSGSSHSFISS FV HA LEVEPL +VLSVSTPSGE +LS+EK+KACQI I   V++VTL+VLD+ DFDVILGMDWLA NHASIDCS KE
Subjt:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE

Query:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK
        V                                           VVDT+E ++ L+ EPVVR+Y DVFPEELPGLPPHRE++FA+ELEPGT    R PY+
Subjt:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK

Query:  MTPTDLKELKKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGA---------------------------------------------------
        M P +LKEL KKDGSMRLCIDYRELNKVT KNRYPLP+IDDLFDQLQGA                                                   
Subjt:  MTPTDLKELKKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGA---------------------------------------------------

Query:  -------------------------------------TMTFLGHVVSSEGVYVDPAKIEAITSWSRSSTVSEVCSFLGLAG-------------------
                                              ++FLGHVVS  GV VDPAKIEA+T W+R STVSEV SFLGLAG                   
Subjt:  -------------------------------------TMTFLGHVVSSEGVYVDPAKIEAITSWSRSSTVSEVCSFLGLAG-------------------

Query:  --RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSF------------------------------SHEQNYPTHDLELAA----------------
          +G  FVWS ACE SFQN KQKLVTAP+ TVPDG GSF                              SHEQNYPTHDLELAA                
Subjt:  --RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSF------------------------------SHEQNYPTHDLELAA----------------

Query:  -------------------------------DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQK
                                       DYD EILYH GKANVVADALSRKVSHS ALITRQ P+H+D +R EIAVSVG VT QL QLTVQPTLRQ+
Subjt:  -------------------------------DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQK

Query:  IIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQV
        IIDAQ ND  LVEK  L E GQ   FS+SSDGGLLFER LCVP D+ VKT+LL+EAHSSPFSMHPGSTKMYQDLKR+YWWRNMKREVAEFVS CLVCQQV
Subjt:  IIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQV

Query:  KALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---------------PEYLYRL
        KA RQKP GLLQPLS+P+WK ENVSMDF+TGL  TL+G+TVIWVVVD+LTKSA FVPGKSTYTASKWAQLY+++IVRL                   ++ 
Subjt:  KALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---------------PEYLYRL

Query:  FQTEM------------HTDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDEQRMMSPEL
         QT M             TD QTERLNQVLEDMLRACALEFPG+WDSHLHLMEFAYNN++QATIGM PFEALYG+CCRS VCW EV EQR+M PEL
Subjt:  FQTEM------------HTDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDEQRMMSPEL

KAA0053368.1 pol protein [Cucumis melo var. makuwa]1.9e-25153.91Show/hide
Query:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE
        +LGHYAL LFDSGSSHSFISS FV HA LEVEPL +VLSVSTPSGE MLSKE++KACQI I   V++VTLLVLD+ DFDVILGMDWLA NHASIDCS K 
Subjt:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE

Query:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK
        V                                           VVDT+EV++ L+ EPVVR+Y DVFPEELPGLPPHRE++FA+ELEPGT    R PY+
Subjt:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK

Query:  MTPTDLKEL---------------------------KKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGAT-----------------------
        M P +LKEL                           KKKDGSMRLCIDYRELNKVT KNRYPLP+IDDLFDQLQGAT                       
Subjt:  MTPTDLKEL---------------------------KKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGAT-----------------------

Query:  ----------------------------------------------------------------------------------------MTFLGHVVSSEG
                                                                                                ++FLGHVVS  G
Subjt:  ----------------------------------------------------------------------------------------MTFLGHVVSSEG

Query:  VYVDPAKIEAITSWSRSSTVSEVCSFLGLAG---------------------RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSF-----------
        V VDPAKIEA+T W+R STVSEV SFLGLAG                     +G  FVWS ACE SFQN KQKLVTAPI TVPDG GSF           
Subjt:  VYVDPAKIEAITSWSRSSTVSEVCSFLGLAG---------------------RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSF-----------

Query:  -------------------SHEQNYPTHDLELAA-----------------------------------------------DYDYEILYHLGKANVVADA
                           SHEQNYPTHDLELAA                                               DYD EILYH GKANVVADA
Subjt:  -------------------SHEQNYPTHDLELAA-----------------------------------------------DYDYEILYHLGKANVVADA

Query:  LSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKT
        LSRKVSHS ALITRQ P+H+D +R EIAVSVG VT QL QLTVQ TLRQ+II AQ ND  LVEK  L E GQA+GFSISSDGGL FER LCVP D+V+KT
Subjt:  LSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKT

Query:  DLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLT
        +LL+EAHSSPFSMHPGSTKMYQDLKR+YWWRNMKREVAEFVS CLVCQQVKA RQKP GLLQPLS+P+WK ENVSMDF+TGL  TL+G+TVIWVVVD+LT
Subjt:  DLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLT

Query:  KSAQFVPGKSTYTASKWAQLYLTKIVRL---------------PEYLYRLFQTEM------------HTDDQTERLNQVLEDMLRACALEFPGTWDSHLH
        KS  FVPGKST TASKWAQLY+++IVRL                   ++  QT M             TD QTERLNQVLEDMLRACALEFPG+WDSHLH
Subjt:  KSAQFVPGKSTYTASKWAQLYLTKIVRL---------------PEYLYRLFQTEM------------HTDDQTERLNQVLEDMLRACALEFPGTWDSHLH

Query:  LMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDEQRMMSPEL
        LMEFAYNN++QATIGM PFEALYGKCCRS VCW EV EQR+M PEL
Subjt:  LMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDEQRMMSPEL

KAA0062245.1 pol protein [Cucumis melo var. makuwa]5.3e-25455.62Show/hide
Query:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE
        +LGHYAL LFDSGSSHSFISS FV HA LEVEPL +VLSVSTPSGE MLSKEK+KACQI I + V++VTL+VLD+ DFDVILGMDWL  NHASIDCS KE
Subjt:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE

Query:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK
        V                                           VVDT+E ++ L+ EPVVR+Y DVFPEELPGLP HRE++FA+ELEPGT    R PY+
Subjt:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK

Query:  MTPTDLKEL---------------------------KKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGAT-----------------------
        M P +LKEL                           KKKDGSMRLCIDYRELNKVT KNRYPLP+IDDLFDQLQGAT                       
Subjt:  MTPTDLKEL---------------------------KKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGAT-----------------------

Query:  ----------------------------------------------------------------------------------------MTFLGHVVSSEG
                                                                                                ++FLGHVVS  G
Subjt:  ----------------------------------------------------------------------------------------MTFLGHVVSSEG

Query:  VYVDPAKIEAITSWSRSSTVSEVCSFLGLAG---------------------RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSF-----------
        V VDPAKIEA+T W+R ST+SEV SFLGLAG                     +G  FVWS ACE SFQN KQKLVTAP+ TVPDG GSF           
Subjt:  VYVDPAKIEAITSWSRSSTVSEVCSFLGLAG---------------------RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSF-----------

Query:  -------------------SHEQNYPTHDLELAA---------DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQL
                           SHEQNYPTHDLELAA         DYD EILYH GKANVVADALSRKVSHS ALITRQ P+H+D +R EIAVSVG VT QL
Subjt:  -------------------SHEQNYPTHDLELAA---------DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQL

Query:  DQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVA
         QLTVQPTLRQ+IIDAQ ND  LVEK  L E GQA  FS+SSDGGLLFER LCVP D+ VKT+LL+EAHSSPFSMHPGSTKMYQDLKR+YWWRNMKREVA
Subjt:  DQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVA

Query:  EFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL----------
        EFVS CLVCQQVKA  QKP GLLQPLS+P+WK ENVSMDF+TGL  TL+G++VIWVVVD+LTKSA FV GKSTYTASKWAQLY+++IVRL          
Subjt:  EFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL----------

Query:  -----PEYLYRLFQTEM------------HTDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDE
                 ++  QT M             TD QTERLNQVLEDMLRACALEFPG+WDSHLHLMEFAYNN++QATIGM PFEALYGKCC+S VCW EV E
Subjt:  -----PEYLYRLFQTEM------------HTDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDE

Query:  QRMMSPEL
        QR+M PEL
Subjt:  QRMMSPEL

TrEMBL top hitse value%identityAlignment
A0A5A7THF3 Reverse transcriptase7.7e-25156.14Show/hide
Query:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE
        +LGHYAL LFDSGSSHSFISS FV HA LEVEPL +VLSVSTPSGE MLSKEK+KACQI I   V++VTL+VLD+ DFDVILGMDWLA NHASIDCS K 
Subjt:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE

Query:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK
        V                                           VVDT+EV++ L+ EPVVR+Y DVFPEELPGL PHRE++FA+ELEPGT    R PY+
Subjt:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK

Query:  MTPTDLKEL---------------------------KKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGAT-----------------------
        M P +LKEL                           KKKDGSMRLCIDYRELNKVT KNRYPLP+IDDLFDQLQGAT                       
Subjt:  MTPTDLKEL---------------------------KKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGAT-----------------------

Query:  -----------------------------------------------------------------MTFLGHVVSSEGVYVDPAKIEAITSWSRSSTVSEV
                                                                         ++FLGHVVS  GV VDPAKIEA+T W+R STVSEV
Subjt:  -----------------------------------------------------------------MTFLGHVVSSEGVYVDPAKIEAITSWSRSSTVSEV

Query:  CSFLGLAG---------------------RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGS---FSHEQNYPTHDLELAA----------------
         SFLGLAG                     +G  FVWS ACE SFQN KQKLVTA    V   + S    SHEQNYPTHDLELAA                
Subjt:  CSFLGLAG---------------------RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGS---FSHEQNYPTHDLELAA----------------

Query:  -------------------------------DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQK
                                       DYD EILYH GKANVVADALSRKVSHS ALITRQ P+H+D +R EIAVSVG +T QL QLTVQPTLRQ+
Subjt:  -------------------------------DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQK

Query:  IIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQV
        IIDAQ ND  LVEK  L E GQA  FS+SSDGGLLFER LCVP+D+ VKT+LL+EAHSSPFSMHPGSTKMYQDLKR+YWWRNMKR+VAEFVS CLVCQQV
Subjt:  IIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQV

Query:  KALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---------------PEYLYRL
        KA RQKP GLLQPLS+PKWK ENVSMDF+TGL  TL+G+TVIWVVVD+LTKSA FVPGKSTYTASKWAQLY+++IVRL                   ++ 
Subjt:  KALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---------------PEYLYRL

Query:  FQTEM------------HTDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDEQRMMSPEL
         QT M             TD QTERLNQVLEDMLRACALEFPG+WDSHLHLMEFAYNN++QATIGM PFEALYGKCCRS VCW EV EQR+M PEL
Subjt:  FQTEM------------HTDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDEQRMMSPEL

A0A5A7TP96 Reverse transcriptase1.5e-25154.77Show/hide
Query:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE
        +LGHYAL LFDSGSSHSFISS FV HA LEVEPL +VLSVSTPSGE MLSKEK+KACQI I   V++VTL+VLD+ DFDVILGMDWLA NHASIDCS KE
Subjt:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE

Query:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK
        V                                           VVDT+E ++ L+ EPVVR+Y DVFPEELPGLPPHRE++FA+ELEPGT    R PY+
Subjt:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK

Query:  MTPTDLKEL---------------------------KKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGAT-----------------------
        M P +LKEL                           KKKDGSMRLCIDYRELNKVT KNRYPLP+IDDLFDQLQGAT                       
Subjt:  MTPTDLKEL---------------------------KKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGAT-----------------------

Query:  ----------------------------------------------------------------------------------------MTFLGHVVSSEG
                                                                                                ++FLGHVVS  G
Subjt:  ----------------------------------------------------------------------------------------MTFLGHVVSSEG

Query:  VYVDPAKIEAITSWSRSSTVSEVCSFLGLAG---------------------RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSF-----------
        V VDPAKIEA+T W+R STVSEV SFLGLAG                     +G  FVWS ACE SFQN KQKLVTAP+ TVPDG GSF           
Subjt:  VYVDPAKIEAITSWSRSSTVSEVCSFLGLAG---------------------RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSF-----------

Query:  -------------------SHEQNYPTHDLELAA-----------------------------------------------DYDYEILYHLGKANVVADA
                           SH+QNYPTHDLELAA                                               DYD E LYH GKANVVADA
Subjt:  -------------------SHEQNYPTHDLELAA-----------------------------------------------DYDYEILYHLGKANVVADA

Query:  LSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKT
        LSRKVSHS ALITRQ  +H+D +R +IAVSVG VT QL QLTVQPTLRQ+I+DAQ ND  LVEK  L E GQA  FS+SSDGGLLFER LCVP D+ VK 
Subjt:  LSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKT

Query:  DLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLT
        +LL+EAHSSPFSMHPGSTKMYQDLKR+YWWRNMKREVAEFVS CLVCQQVKA RQKP GLLQPLS+P+WK ENVSMDF+TGL  TL+G+TVIWVVVD+LT
Subjt:  DLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLT

Query:  KSAQFVPGKSTYTASKWAQLYLTKIVRLPEYL-YRL-FQTEMH--TDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALY
        KSA FVPGKSTYTASKWAQLY+++I  L   +  RL F T  H  TD QTERLNQVLEDMLRACALEFPG+WDSHLHLMEFAYNN++QATIGM PFEALY
Subjt:  KSAQFVPGKSTYTASKWAQLYLTKIVRLPEYL-YRL-FQTEMH--TDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALY

Query:  GKCCRSSVCWDEVDEQRMMSPE
        GKCCRS VCW EV EQR+M PE
Subjt:  GKCCRSSVCWDEVDEQRMMSPE

A0A5A7U7V9 Reverse transcriptase5.5e-25756.47Show/hide
Query:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE
        +LGHYAL LFDSGSSHSFISS FV HA LEVEPL +VLSVSTPSGE +LS+EK+KACQI I   V++VTL+VLD+ DFDVILGMDWLA NHASIDCS KE
Subjt:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE

Query:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK
        V                                           VVDT+E ++ L+ EPVVR+Y DVFPEELPGLPPHRE++FA+ELEPGT    R PY+
Subjt:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK

Query:  MTPTDLKELKKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGA---------------------------------------------------
        M P +LKEL KKDGSMRLCIDYRELNKVT KNRYPLP+IDDLFDQLQGA                                                   
Subjt:  MTPTDLKELKKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGA---------------------------------------------------

Query:  -------------------------------------TMTFLGHVVSSEGVYVDPAKIEAITSWSRSSTVSEVCSFLGLAG-------------------
                                              ++FLGHVVS  GV VDPAKIEA+T W+R STVSEV SFLGLAG                   
Subjt:  -------------------------------------TMTFLGHVVSSEGVYVDPAKIEAITSWSRSSTVSEVCSFLGLAG-------------------

Query:  --RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSF------------------------------SHEQNYPTHDLELAA----------------
          +G  FVWS ACE SFQN KQKLVTAP+ TVPDG GSF                              SHEQNYPTHDLELAA                
Subjt:  --RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSF------------------------------SHEQNYPTHDLELAA----------------

Query:  -------------------------------DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQK
                                       DYD EILYH GKANVVADALSRKVSHS ALITRQ P+H+D +R EIAVSVG VT QL QLTVQPTLRQ+
Subjt:  -------------------------------DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQK

Query:  IIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQV
        IIDAQ ND  LVEK  L E GQ   FS+SSDGGLLFER LCVP D+ VKT+LL+EAHSSPFSMHPGSTKMYQDLKR+YWWRNMKREVAEFVS CLVCQQV
Subjt:  IIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQV

Query:  KALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---------------PEYLYRL
        KA RQKP GLLQPLS+P+WK ENVSMDF+TGL  TL+G+TVIWVVVD+LTKSA FVPGKSTYTASKWAQLY+++IVRL                   ++ 
Subjt:  KALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---------------PEYLYRL

Query:  FQTEM------------HTDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDEQRMMSPEL
         QT M             TD QTERLNQVLEDMLRACALEFPG+WDSHLHLMEFAYNN++QATIGM PFEALYG+CCRS VCW EV EQR+M PEL
Subjt:  FQTEM------------HTDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDEQRMMSPEL

A0A5A7UE75 Reverse transcriptase9.1e-25253.91Show/hide
Query:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE
        +LGHYAL LFDSGSSHSFISS FV HA LEVEPL +VLSVSTPSGE MLSKE++KACQI I   V++VTLLVLD+ DFDVILGMDWLA NHASIDCS K 
Subjt:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE

Query:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK
        V                                           VVDT+EV++ L+ EPVVR+Y DVFPEELPGLPPHRE++FA+ELEPGT    R PY+
Subjt:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK

Query:  MTPTDLKEL---------------------------KKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGAT-----------------------
        M P +LKEL                           KKKDGSMRLCIDYRELNKVT KNRYPLP+IDDLFDQLQGAT                       
Subjt:  MTPTDLKEL---------------------------KKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGAT-----------------------

Query:  ----------------------------------------------------------------------------------------MTFLGHVVSSEG
                                                                                                ++FLGHVVS  G
Subjt:  ----------------------------------------------------------------------------------------MTFLGHVVSSEG

Query:  VYVDPAKIEAITSWSRSSTVSEVCSFLGLAG---------------------RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSF-----------
        V VDPAKIEA+T W+R STVSEV SFLGLAG                     +G  FVWS ACE SFQN KQKLVTAPI TVPDG GSF           
Subjt:  VYVDPAKIEAITSWSRSSTVSEVCSFLGLAG---------------------RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSF-----------

Query:  -------------------SHEQNYPTHDLELAA-----------------------------------------------DYDYEILYHLGKANVVADA
                           SHEQNYPTHDLELAA                                               DYD EILYH GKANVVADA
Subjt:  -------------------SHEQNYPTHDLELAA-----------------------------------------------DYDYEILYHLGKANVVADA

Query:  LSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKT
        LSRKVSHS ALITRQ P+H+D +R EIAVSVG VT QL QLTVQ TLRQ+II AQ ND  LVEK  L E GQA+GFSISSDGGL FER LCVP D+V+KT
Subjt:  LSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKT

Query:  DLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLT
        +LL+EAHSSPFSMHPGSTKMYQDLKR+YWWRNMKREVAEFVS CLVCQQVKA RQKP GLLQPLS+P+WK ENVSMDF+TGL  TL+G+TVIWVVVD+LT
Subjt:  DLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLT

Query:  KSAQFVPGKSTYTASKWAQLYLTKIVRL---------------PEYLYRLFQTEM------------HTDDQTERLNQVLEDMLRACALEFPGTWDSHLH
        KS  FVPGKST TASKWAQLY+++IVRL                   ++  QT M             TD QTERLNQVLEDMLRACALEFPG+WDSHLH
Subjt:  KSAQFVPGKSTYTASKWAQLYLTKIVRL---------------PEYLYRLFQTEM------------HTDDQTERLNQVLEDMLRACALEFPGTWDSHLH

Query:  LMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDEQRMMSPEL
        LMEFAYNN++QATIGM PFEALYGKCCRS VCW EV EQR+M PEL
Subjt:  LMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDEQRMMSPEL

A0A5A7V8L8 Pol protein2.6e-25455.62Show/hide
Query:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE
        +LGHYAL LFDSGSSHSFISS FV HA LEVEPL +VLSVSTPSGE MLSKEK+KACQI I + V++VTL+VLD+ DFDVILGMDWL  NHASIDCS KE
Subjt:  MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKE

Query:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK
        V                                           VVDT+E ++ L+ EPVVR+Y DVFPEELPGLP HRE++FA+ELEPGT    R PY+
Subjt:  VV------------------------------------------VVDTKEVEIFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYK

Query:  MTPTDLKEL---------------------------KKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGAT-----------------------
        M P +LKEL                           KKKDGSMRLCIDYRELNKVT KNRYPLP+IDDLFDQLQGAT                       
Subjt:  MTPTDLKEL---------------------------KKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGAT-----------------------

Query:  ----------------------------------------------------------------------------------------MTFLGHVVSSEG
                                                                                                ++FLGHVVS  G
Subjt:  ----------------------------------------------------------------------------------------MTFLGHVVSSEG

Query:  VYVDPAKIEAITSWSRSSTVSEVCSFLGLAG---------------------RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSF-----------
        V VDPAKIEA+T W+R ST+SEV SFLGLAG                     +G  FVWS ACE SFQN KQKLVTAP+ TVPDG GSF           
Subjt:  VYVDPAKIEAITSWSRSSTVSEVCSFLGLAG---------------------RGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSF-----------

Query:  -------------------SHEQNYPTHDLELAA---------DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQL
                           SHEQNYPTHDLELAA         DYD EILYH GKANVVADALSRKVSHS ALITRQ P+H+D +R EIAVSVG VT QL
Subjt:  -------------------SHEQNYPTHDLELAA---------DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQL

Query:  DQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVA
         QLTVQPTLRQ+IIDAQ ND  LVEK  L E GQA  FS+SSDGGLLFER LCVP D+ VKT+LL+EAHSSPFSMHPGSTKMYQDLKR+YWWRNMKREVA
Subjt:  DQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVA

Query:  EFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL----------
        EFVS CLVCQQVKA  QKP GLLQPLS+P+WK ENVSMDF+TGL  TL+G++VIWVVVD+LTKSA FV GKSTYTASKWAQLY+++IVRL          
Subjt:  EFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLTGL-LTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL----------

Query:  -----PEYLYRLFQTEM------------HTDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDE
                 ++  QT M             TD QTERLNQVLEDMLRACALEFPG+WDSHLHLMEFAYNN++QATIGM PFEALYGKCC+S VCW EV E
Subjt:  -----PEYLYRLFQTEM------------HTDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDE

Query:  QRMMSPEL
        QR+M PEL
Subjt:  QRMMSPEL

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.4e-3126.32Show/hide
Query:  DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSD
        D+++EI Y  G AN +ADALSR       ++    PI +D +   I        + ++Q+++    + +++    ND  L+   +  +    +   +  D
Subjt:  DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSD

Query:  GGLLFER-SLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLT
        G L+  +  + +P DT +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   +   E++SMDF+T
Subjt:  GGLLFER-SLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLT

Query:  GLLTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---PEYLY----RLFQTEM--------------------HTDDQTERLNQVLE
         L    GY  ++VVVD+ +K A  VP   + TA + A+++  +++     P+ +      +F ++                      TD QTER NQ +E
Subjt:  GLLTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---PEYLY----RLFQTEM--------------------HTDDQTERLNQVLE

Query:  DMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALY
         +LR      P TW  H+ L++ +YNN   +   M PFE ++
Subjt:  DMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALY

P0CT34 Transposon Tf2-1 polyprotein3.2e-0446.34Show/hide
Query:  KKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGATM
        KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+
Subjt:  KKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGATM

P0CT35 Transposon Tf2-2 polyprotein2.4e-3126.32Show/hide
Query:  DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSD
        D+++EI Y  G AN +ADALSR       ++    PI +D +   I        + ++Q+++    + +++    ND  L+   +  +    +   +  D
Subjt:  DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSD

Query:  GGLLFER-SLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLT
        G L+  +  + +P DT +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   +   E++SMDF+T
Subjt:  GGLLFER-SLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLT

Query:  GLLTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---PEYLY----RLFQTEM--------------------HTDDQTERLNQVLE
         L    GY  ++VVVD+ +K A  VP   + TA + A+++  +++     P+ +      +F ++                      TD QTER NQ +E
Subjt:  GLLTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---PEYLY----RLFQTEM--------------------HTDDQTERLNQVLE

Query:  DMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALY
         +LR      P TW  H+ L++ +YNN   +   M PFE ++
Subjt:  DMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALY

P0CT35 Transposon Tf2-2 polyprotein3.2e-0446.34Show/hide
Query:  KKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGATM
        KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+
Subjt:  KKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGATM

P0CT36 Transposon Tf2-3 polyprotein2.4e-3126.32Show/hide
Query:  DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSD
        D+++EI Y  G AN +ADALSR       ++    PI +D +   I        + ++Q+++    + +++    ND  L+   +  +    +   +  D
Subjt:  DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSD

Query:  GGLLFER-SLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLT
        G L+  +  + +P DT +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   +   E++SMDF+T
Subjt:  GGLLFER-SLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLT

Query:  GLLTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---PEYLY----RLFQTEM--------------------HTDDQTERLNQVLE
         L    GY  ++VVVD+ +K A  VP   + TA + A+++  +++     P+ +      +F ++                      TD QTER NQ +E
Subjt:  GLLTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---PEYLY----RLFQTEM--------------------HTDDQTERLNQVLE

Query:  DMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALY
         +LR      P TW  H+ L++ +YNN   +   M PFE ++
Subjt:  DMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALY

P0CT36 Transposon Tf2-3 polyprotein3.2e-0446.34Show/hide
Query:  KKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGATM
        KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+
Subjt:  KKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGATM

P0CT41 Transposon Tf2-12 polyprotein2.4e-3126.32Show/hide
Query:  DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSD
        D+++EI Y  G AN +ADALSR       ++    PI +D +   I        + ++Q+++    + +++    ND  L+   +  +    +   +  D
Subjt:  DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSD

Query:  GGLLFER-SLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLT
        G L+  +  + +P DT +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   +   E++SMDF+T
Subjt:  GGLLFER-SLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLT

Query:  GLLTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---PEYLY----RLFQTEM--------------------HTDDQTERLNQVLE
         L    GY  ++VVVD+ +K A  VP   + TA + A+++  +++     P+ +      +F ++                      TD QTER NQ +E
Subjt:  GLLTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---PEYLY----RLFQTEM--------------------HTDDQTERLNQVLE

Query:  DMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALY
         +LR      P TW  H+ L++ +YNN   +   M PFE ++
Subjt:  DMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALY

P0CT41 Transposon Tf2-12 polyprotein3.2e-0446.34Show/hide
Query:  KKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGATM
        KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+
Subjt:  KKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGATM

Q9UR07 Transposon Tf2-11 polyprotein2.4e-3126.32Show/hide
Query:  DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSD
        D+++EI Y  G AN +ADALSR       ++    PI +D +   I        + ++Q+++    + +++    ND  L+   +  +    +   +  D
Subjt:  DYDYEILYHLGKANVVADALSRKVSHSTALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSD

Query:  GGLLFER-SLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLT
        G L+  +  + +P DT +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   +   E++SMDF+T
Subjt:  GGLLFER-SLCVPVDTVVKTDLLNEAHSSPFSMHPGSTKMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLT

Query:  GLLTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---PEYLY----RLFQTEM--------------------HTDDQTERLNQVLE
         L    GY  ++VVVD+ +K A  VP   + TA + A+++  +++     P+ +      +F ++                      TD QTER NQ +E
Subjt:  GLLTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRL---PEYLY----RLFQTEM--------------------HTDDQTERLNQVLE

Query:  DMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALY
         +LR      P TW  H+ L++ +YNN   +   M PFE ++
Subjt:  DMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALY

Q9UR07 Transposon Tf2-11 polyprotein3.2e-0446.34Show/hide
Query:  KKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGATM
        KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+
Subjt:  KKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGATM

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.6e-0629.9Show/hide
Query:  MTFLG--HVVSSEGVYVDPAKIEAITSWSRSSTVSEVCSFLGLAGRGLLFV--------------------WSPACESSFQNFKQKLVTAPIFTVPD
        + +LG  H++S EGV  DPAK+EA+  W      +E+  FLGL G    FV                    W+     +F+  K  + T P+  +PD
Subjt:  MTFLG--HVVSSEGVYVDPAKIEAITSWSRSSTVSEVCSFLGLAGRGLLFV--------------------WSPACESSFQNFKQKLVTAPIFTVPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGGCATTATGCCTTAGCCTTGTTTGATTCTGGTTCCTCTCACTCTTTTATATCGTCGATATTTGTAACACATGCATGTTTAGAAGTAGAGCCCCTAGGTTATGT
TTTATCAGTGTCTACACCATCTGGGGAGATTATGTTGTCAAAAGAAAAGATTAAAGCATGCCAAATTGGGATATTTAATCGTGTGTTGGATGTAACCTTACTAGTATTGG
ATATACGTGATTTCGATGTGATTTTAGGCATGGATTGGTTAGCTACTAATCATGCTAGTATCGATTGTTCTAGCAAAGAGGTAGTTGTGGTGGATACTAAAGAAGTTGAG
ATTTTCTTGACATTAGAACCTGTGGTAAGGGAATACTCTGATGTGTTTCCAGAAGAGCTTCCAGGACTTCCGCCTCACAGAGAGATCGATTTTGCTGTAGAGTTGGAGCC
TGGTACCGCTCTCACTTTTAGAGACCCGTATAAAATGACTCCAACTGATTTGAAAGAATTGAAGAAAAAGGATGGGTCGATGCGTCTTTGCATTGACTATAGGGAACTTA
ATAAAGTAACAACCAAGAATAGGTATCCATTGCCCAAAATTGATGATTTGTTTGATCAGTTGCAAGGAGCCACTATGACTTTCCTTGGTCATGTGGTATCTAGTGAGGGA
GTTTATGTGGATCCTGCAAAGATTGAAGCTATTACCAGTTGGTCTCGATCCTCTACAGTTAGTGAGGTTTGTAGTTTTCTGGGTTTAGCAGGAAGGGGACTCCTTTTTGT
ATGGAGCCCAGCATGTGAGAGTAGTTTTCAGAACTTTAAGCAGAAGTTGGTTACTGCACCGATTTTTACAGTGCCAGATGGATTTGGAAGTTTTAGTCACGAGCAGAACT
ACCCTACACATGATCTAGAGTTGGCAGCAGATTATGACTATGAGATATTATACCATCTTGGTAAGGCGAATGTGGTAGCTGATGCTCTTAGTAGAAAAGTATCACATTCA
ACAGCGCTTATTACTAGACAGACACCGATACATCAAGACTTTAAGAGAGGTGAGATTGCAGTATCTGTGGGGGTAGTTACCTCACAATTGGATCAATTAACAGTGCAACC
AACTTTGAGGCAGAAGATTATTGATGCTCAGAGAAATGATCTTTGTTTGGTTGAGAAGTGTCACCTTATAGAAACAGGGCAAGCTGATGGGTTCTCCATATCCTCTGATG
GTGGACTTTTGTTTGAGAGGAGTTTGTGTGTGCCAGTAGACACTGTAGTTAAGACTGATCTATTGAATGAAGCTCATAGTTCCCCATTTTCTATGCATCCTGGTAGTACG
AAAATGTATCAGGATCTGAAACGGATGTATTGGTGGCGTAATATGAAGAGAGAAGTGGCAGAGTTTGTTAGTATATGCTTGGTGTGCCAGCAGGTTAAAGCACTTAGACA
GAAACCAACAGGTTTGTTGCAACCTTTGAGTGTGCCAAAATGGAAGTTGGAGAATGTGTCTATGGATTTTCTTACCGGACTGCTTACCCTAAAGGGTTACACAGTGATTT
GGGTTGTTGTCGACAAACTTACGAAATCGGCACAATTCGTTCCAGGAAAATCCACTTATACTGCTAGTAAGTGGGCACAATTATATTTGACTAAGATTGTGAGACTGCCG
GAGTACCTGTATCGATTGTTTCAGACAGAGATGCATACTGATGATCAAACTGAACGTTTGAACCAAGTTTTGGAGGATATGCTACGAGCTTGTGCATTAGAGTTTCCAGG
TACTTGGGACTCTCACTTGCATTTGATGGAGTTTGCTTATAATAACAACTTTCAGGCTACTATTGGCATGGTGCCATTTGAAGCTTTATATGGTAAATGTTGTAGATCTT
CTGTTTGCTGGGACGAGGTTGATGAACAAAGAATGATGAGTCCTGAGTTG
mRNA sequenceShow/hide mRNA sequence
ATGTTAGGGCATTATGCCTTAGCCTTGTTTGATTCTGGTTCCTCTCACTCTTTTATATCGTCGATATTTGTAACACATGCATGTTTAGAAGTAGAGCCCCTAGGTTATGT
TTTATCAGTGTCTACACCATCTGGGGAGATTATGTTGTCAAAAGAAAAGATTAAAGCATGCCAAATTGGGATATTTAATCGTGTGTTGGATGTAACCTTACTAGTATTGG
ATATACGTGATTTCGATGTGATTTTAGGCATGGATTGGTTAGCTACTAATCATGCTAGTATCGATTGTTCTAGCAAAGAGGTAGTTGTGGTGGATACTAAAGAAGTTGAG
ATTTTCTTGACATTAGAACCTGTGGTAAGGGAATACTCTGATGTGTTTCCAGAAGAGCTTCCAGGACTTCCGCCTCACAGAGAGATCGATTTTGCTGTAGAGTTGGAGCC
TGGTACCGCTCTCACTTTTAGAGACCCGTATAAAATGACTCCAACTGATTTGAAAGAATTGAAGAAAAAGGATGGGTCGATGCGTCTTTGCATTGACTATAGGGAACTTA
ATAAAGTAACAACCAAGAATAGGTATCCATTGCCCAAAATTGATGATTTGTTTGATCAGTTGCAAGGAGCCACTATGACTTTCCTTGGTCATGTGGTATCTAGTGAGGGA
GTTTATGTGGATCCTGCAAAGATTGAAGCTATTACCAGTTGGTCTCGATCCTCTACAGTTAGTGAGGTTTGTAGTTTTCTGGGTTTAGCAGGAAGGGGACTCCTTTTTGT
ATGGAGCCCAGCATGTGAGAGTAGTTTTCAGAACTTTAAGCAGAAGTTGGTTACTGCACCGATTTTTACAGTGCCAGATGGATTTGGAAGTTTTAGTCACGAGCAGAACT
ACCCTACACATGATCTAGAGTTGGCAGCAGATTATGACTATGAGATATTATACCATCTTGGTAAGGCGAATGTGGTAGCTGATGCTCTTAGTAGAAAAGTATCACATTCA
ACAGCGCTTATTACTAGACAGACACCGATACATCAAGACTTTAAGAGAGGTGAGATTGCAGTATCTGTGGGGGTAGTTACCTCACAATTGGATCAATTAACAGTGCAACC
AACTTTGAGGCAGAAGATTATTGATGCTCAGAGAAATGATCTTTGTTTGGTTGAGAAGTGTCACCTTATAGAAACAGGGCAAGCTGATGGGTTCTCCATATCCTCTGATG
GTGGACTTTTGTTTGAGAGGAGTTTGTGTGTGCCAGTAGACACTGTAGTTAAGACTGATCTATTGAATGAAGCTCATAGTTCCCCATTTTCTATGCATCCTGGTAGTACG
AAAATGTATCAGGATCTGAAACGGATGTATTGGTGGCGTAATATGAAGAGAGAAGTGGCAGAGTTTGTTAGTATATGCTTGGTGTGCCAGCAGGTTAAAGCACTTAGACA
GAAACCAACAGGTTTGTTGCAACCTTTGAGTGTGCCAAAATGGAAGTTGGAGAATGTGTCTATGGATTTTCTTACCGGACTGCTTACCCTAAAGGGTTACACAGTGATTT
GGGTTGTTGTCGACAAACTTACGAAATCGGCACAATTCGTTCCAGGAAAATCCACTTATACTGCTAGTAAGTGGGCACAATTATATTTGACTAAGATTGTGAGACTGCCG
GAGTACCTGTATCGATTGTTTCAGACAGAGATGCATACTGATGATCAAACTGAACGTTTGAACCAAGTTTTGGAGGATATGCTACGAGCTTGTGCATTAGAGTTTCCAGG
TACTTGGGACTCTCACTTGCATTTGATGGAGTTTGCTTATAATAACAACTTTCAGGCTACTATTGGCATGGTGCCATTTGAAGCTTTATATGGTAAATGTTGTAGATCTT
CTGTTTGCTGGGACGAGGTTGATGAACAAAGAATGATGAGTCCTGAGTTG
Protein sequenceShow/hide protein sequence
MLGHYALALFDSGSSHSFISSIFVTHACLEVEPLGYVLSVSTPSGEIMLSKEKIKACQIGIFNRVLDVTLLVLDIRDFDVILGMDWLATNHASIDCSSKEVVVVDTKEVE
IFLTLEPVVREYSDVFPEELPGLPPHREIDFAVELEPGTALTFRDPYKMTPTDLKELKKKDGSMRLCIDYRELNKVTTKNRYPLPKIDDLFDQLQGATMTFLGHVVSSEG
VYVDPAKIEAITSWSRSSTVSEVCSFLGLAGRGLLFVWSPACESSFQNFKQKLVTAPIFTVPDGFGSFSHEQNYPTHDLELAADYDYEILYHLGKANVVADALSRKVSHS
TALITRQTPIHQDFKRGEIAVSVGVVTSQLDQLTVQPTLRQKIIDAQRNDLCLVEKCHLIETGQADGFSISSDGGLLFERSLCVPVDTVVKTDLLNEAHSSPFSMHPGST
KMYQDLKRMYWWRNMKREVAEFVSICLVCQQVKALRQKPTGLLQPLSVPKWKLENVSMDFLTGLLTLKGYTVIWVVVDKLTKSAQFVPGKSTYTASKWAQLYLTKIVRLP
EYLYRLFQTEMHTDDQTERLNQVLEDMLRACALEFPGTWDSHLHLMEFAYNNNFQATIGMVPFEALYGKCCRSSVCWDEVDEQRMMSPEL