; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G021790 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G021790
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr04:15932703..15934254
RNA-Seq ExpressionCmoCh04G021790
SyntenyCmoCh04G021790
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025998.1 pol protein [Cucumis melo var. makuwa]2.2e-18564.92Show/hide
Query:  GKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELK
        G SF+F+      + K I A+KA KL+       LASV  VR  +  +S  PVV E+ DVF DE+PGLPP REV+  IELEPG    S+APYRMAP ELK
Subjt:  GKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELK

Query:  ELKLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRY
        ELK+QLQELL++G                              +LN VT+KN+YPLPRIDDLFDQLQGA VFSKIDL SGYHQ+R+++ D+PKTAFR+RY
Subjt:  ELKLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRY

Query:  GHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEH-----------------------LRKVVFLGHVVSKDGITVDPAK
        GHYEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  EH EH                       LRKV FLGHVVS +G++VDPAK
Subjt:  GHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEH-----------------------LRKVVFLGHVVSKDGITVDPAK

Query:  VEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQN
        +EA+  W RP+TV+E+RSFLGLAGYYRRF++DF+RIA+PLTQLTRKG  F WS ACE SFQELK++L +APVL VP+G+GN VIYSDASK GLGCVLMQ 
Subjt:  VEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQN

Query:  GRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAH
        G+V+AYASRQLK +E++YPTHDLELAAV FALK+WRHYLYGE+IQ++TDHKSLKY FTQKEL MRQRRWLELVKDYD EIL+HPGKANVVADALSRK AH
Subjt:  GRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAH

Query:  SSAMLTRQHNIQMEFE
        S+A++T+Q  +  +FE
Subjt:  SSAMLTRQHNIQMEFE

KAA0040547.1 pol protein [Cucumis melo var. makuwa]6.4e-18564.53Show/hide
Query:  GKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELK
        G SF+F+      + K I A+KA KL+       LASV  +R  +  +S  PVV E+ DVF DE+PGLPP REV+  IELEPG    S+APYRMAP ELK
Subjt:  GKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELK

Query:  ELKLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRY
        ELK+QLQELL++G                              +LN VT+KN+YPLPRIDDLFDQLQGA VFSKIDL SGYHQ+R+++ D+PKTAFR+RY
Subjt:  ELKLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRY

Query:  GHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEH-----------------------LRKVVFLGHVVSKDGITVDPAK
        GHYEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  EH EH                       LRKV FLGHVVS +G++VDPAK
Subjt:  GHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEH-----------------------LRKVVFLGHVVSKDGITVDPAK

Query:  VEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQN
        +EA+  W RP+TV+E++SFLGLAGYYRRF++DF+RIA+PLTQLTRKG  F WS ACESSFQELK++L +APVL VP+G+GN VIYSDASK GLGCVLMQ 
Subjt:  VEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQN

Query:  GRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAH
        G+V+AYASRQLK YE++YPTHDLEL AV FALK+WRHYLYGE+IQ++TDHKSLKY FTQKEL MRQRRWLELVKDYD EIL+HPGKANVVADALSRK AH
Subjt:  GRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAH

Query:  SSAMLTRQHNIQMEFE
        S+ ++T+Q  +  +FE
Subjt:  SSAMLTRQHNIQMEFE

KAA0060745.1 pol protein [Cucumis melo var. makuwa]6.4e-18564.73Show/hide
Query:  GKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELK
        G SF+F+      + K I A+KA KL+       LASV  +R  +  +S  PVV E+ DVF DE+PGLPP REV+  IELEPG    S+APYRMAP ELK
Subjt:  GKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELK

Query:  ELKLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRY
        ELK+QLQELL++G                              +LN VT+KN+YPLP+IDDLFDQLQGA VFSKIDL SGYHQ+R+++ D+PKTAFR+RY
Subjt:  ELKLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRY

Query:  GHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEH-----------------------LRKVVFLGHVVSKDGITVDPAK
        GHYEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  EH EH                       LRKV FLGHVVS +G++VDPAK
Subjt:  GHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEH-----------------------LRKVVFLGHVVSKDGITVDPAK

Query:  VEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQN
        +EA+  W RP+TV+E+RSFLGLAGYYRRF++DF+RIA+PLTQLTRKG  F WS ACESSFQELK++L  APVL VP+G+GN VIYSDASK GLGCVLMQ 
Subjt:  VEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQN

Query:  GRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAH
        G+V+AYASRQLK +E++YPTHDLELAAV FALK+WRHYLYGE+IQ++TDHKSLKY FTQKEL MRQRRWLELVKDYD EIL+HPGKANVVADALSRK AH
Subjt:  GRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAH

Query:  SSAMLTRQHNIQMEFE
        S+A++T+Q  +  +FE
Subjt:  SSAMLTRQHNIQMEFE

TYK20443.1 pol protein [Cucumis melo var. makuwa]8.4e-18564.98Show/hide
Query:  SFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELKEL
        SF+F+      + K I A+KA KL+       LASV  VR  +  +S  PVV E+ DVF DE+PGLPP REV+  IELEPG    S+APYRMAP ELKEL
Subjt:  SFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELKEL

Query:  KLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGH
        K+QLQELL++G                              +LN VT+KN+YPLPRIDDLFDQLQGA VFSKIDL SGYHQ+R+++ D+PKTAFR+RYGH
Subjt:  KLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGH

Query:  YEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEH-----------------------LRKVVFLGHVVSKDGITVDPAKVE
        YEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  EH EH                       LRKV FLGHVVS +G++VDPAK+E
Subjt:  YEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEH-----------------------LRKVVFLGHVVSKDGITVDPAKVE

Query:  AIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQNGR
        A+  W RP+TV+E+RSFLGLAGYYRRF++DF+RIA+PLTQLTRKG  F WS ACE SFQELK++L +APVL VP+G+GN VIYSDASK GLGCVLMQ G+
Subjt:  AIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQNGR

Query:  VIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAHSS
        V+AYASRQLK +E++YPTHDLELAAV FALK+WRHYLYGE+IQ++TDHKSLKY FTQKEL MRQRRWLELVKDYD EIL+HPGKANVVADALSRK AHS+
Subjt:  VIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAHSS

Query:  AMLTRQHNIQMEFE
        A++T+Q  +  +FE
Subjt:  AMLTRQHNIQMEFE

XP_022931734.1 uncharacterized protein LOC111437896 [Cucurbita moschata]2.8e-18866.15Show/hide
Query:  MGKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTEL
        +G++F F+     S  + I ALKARK++     A LASV KV +    VS VPVV EF DVF +E+PGLPP REV+  I+LEPG T  SK PYRMAP EL
Subjt:  MGKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTEL

Query:  KELKLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTR
        KELK+QLQELL++G                              +LN VTIKNKYPLPRIDDLFDQLQGA VFSKIDL SGYHQ+R++E+D+PKTAFR+R
Subjt:  KELKLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTR

Query:  YGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVV-----------------------FLGHVVSKDGITVDPA
        YGHYEF+VMSFGLTNAPAVFMELMNRVF++FLD+FVIVFIDDILVYSK+  EH  HLR+V+                       FLGHVVS  G+TVDPA
Subjt:  YGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVV-----------------------FLGHVVSKDGITVDPA

Query:  KVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQ
        K+EA++ W RPTTVTEVRSFLGLAGYYRRFIKDF++++A LTQLT+KGK F W++ CE SF ELK+RL +APVL VP+G+G LV+YSDAS  GLGCVLMQ
Subjt:  KVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQ

Query:  NGRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTA
         G+VIAYASRQLK+YER+YPTHDLELAAV +ALK WRHYLYGER+QV+TDHKSLKYLFTQKEL MRQRRWLELVKDYD+EIL+HPGKANVVADALSRKTA
Subjt:  NGRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTA

Query:  HSSAMLTRQHNIQMEFE
        H+SA++TRQ  +Q E +
Subjt:  HSSAMLTRQHNIQMEFE

TrEMBL top hitse value%identityAlignment
A0A5A7SIJ5 Reverse transcriptase1.1e-18564.92Show/hide
Query:  GKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELK
        G SF+F+      + K I A+KA KL+       LASV  VR  +  +S  PVV E+ DVF DE+PGLPP REV+  IELEPG    S+APYRMAP ELK
Subjt:  GKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELK

Query:  ELKLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRY
        ELK+QLQELL++G                              +LN VT+KN+YPLPRIDDLFDQLQGA VFSKIDL SGYHQ+R+++ D+PKTAFR+RY
Subjt:  ELKLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRY

Query:  GHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEH-----------------------LRKVVFLGHVVSKDGITVDPAK
        GHYEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  EH EH                       LRKV FLGHVVS +G++VDPAK
Subjt:  GHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEH-----------------------LRKVVFLGHVVSKDGITVDPAK

Query:  VEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQN
        +EA+  W RP+TV+E+RSFLGLAGYYRRF++DF+RIA+PLTQLTRKG  F WS ACE SFQELK++L +APVL VP+G+GN VIYSDASK GLGCVLMQ 
Subjt:  VEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQN

Query:  GRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAH
        G+V+AYASRQLK +E++YPTHDLELAAV FALK+WRHYLYGE+IQ++TDHKSLKY FTQKEL MRQRRWLELVKDYD EIL+HPGKANVVADALSRK AH
Subjt:  GRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAH

Query:  SSAMLTRQHNIQMEFE
        S+A++T+Q  +  +FE
Subjt:  SSAMLTRQHNIQMEFE

A0A5A7TG62 Reverse transcriptase3.1e-18564.53Show/hide
Query:  GKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELK
        G SF+F+      + K I A+KA KL+       LASV  +R  +  +S  PVV E+ DVF DE+PGLPP REV+  IELEPG    S+APYRMAP ELK
Subjt:  GKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELK

Query:  ELKLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRY
        ELK+QLQELL++G                              +LN VT+KN+YPLPRIDDLFDQLQGA VFSKIDL SGYHQ+R+++ D+PKTAFR+RY
Subjt:  ELKLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRY

Query:  GHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEH-----------------------LRKVVFLGHVVSKDGITVDPAK
        GHYEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  EH EH                       LRKV FLGHVVS +G++VDPAK
Subjt:  GHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEH-----------------------LRKVVFLGHVVSKDGITVDPAK

Query:  VEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQN
        +EA+  W RP+TV+E++SFLGLAGYYRRF++DF+RIA+PLTQLTRKG  F WS ACESSFQELK++L +APVL VP+G+GN VIYSDASK GLGCVLMQ 
Subjt:  VEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQN

Query:  GRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAH
        G+V+AYASRQLK YE++YPTHDLEL AV FALK+WRHYLYGE+IQ++TDHKSLKY FTQKEL MRQRRWLELVKDYD EIL+HPGKANVVADALSRK AH
Subjt:  GRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAH

Query:  SSAMLTRQHNIQMEFE
        S+ ++T+Q  +  +FE
Subjt:  SSAMLTRQHNIQMEFE

A0A5A7V4E4 Reverse transcriptase3.1e-18564.73Show/hide
Query:  GKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELK
        G SF+F+      + K I A+KA KL+       LASV  +R  +  +S  PVV E+ DVF DE+PGLPP REV+  IELEPG    S+APYRMAP ELK
Subjt:  GKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELK

Query:  ELKLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRY
        ELK+QLQELL++G                              +LN VT+KN+YPLP+IDDLFDQLQGA VFSKIDL SGYHQ+R+++ D+PKTAFR+RY
Subjt:  ELKLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRY

Query:  GHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEH-----------------------LRKVVFLGHVVSKDGITVDPAK
        GHYEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  EH EH                       LRKV FLGHVVS +G++VDPAK
Subjt:  GHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEH-----------------------LRKVVFLGHVVSKDGITVDPAK

Query:  VEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQN
        +EA+  W RP+TV+E+RSFLGLAGYYRRF++DF+RIA+PLTQLTRKG  F WS ACESSFQELK++L  APVL VP+G+GN VIYSDASK GLGCVLMQ 
Subjt:  VEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQN

Query:  GRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAH
        G+V+AYASRQLK +E++YPTHDLELAAV FALK+WRHYLYGE+IQ++TDHKSLKY FTQKEL MRQRRWLELVKDYD EIL+HPGKANVVADALSRK AH
Subjt:  GRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAH

Query:  SSAMLTRQHNIQMEFE
        S+A++T+Q  +  +FE
Subjt:  SSAMLTRQHNIQMEFE

A0A5D3BTN0 Reverse transcriptase4.1e-18564.98Show/hide
Query:  SFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELKEL
        SF+F+      + K I A+KA KL+       LASV  VR  +  +S  PVV E+ DVF DE+PGLPP REV+  IELEPG    S+APYRMAP ELKEL
Subjt:  SFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELKEL

Query:  KLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGH
        K+QLQELL++G                              +LN VT+KN+YPLPRIDDLFDQLQGA VFSKIDL SGYHQ+R+++ D+PKTAFR+RYGH
Subjt:  KLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGH

Query:  YEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEH-----------------------LRKVVFLGHVVSKDGITVDPAKVE
        YEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  EH EH                       LRKV FLGHVVS +G++VDPAK+E
Subjt:  YEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEH-----------------------LRKVVFLGHVVSKDGITVDPAKVE

Query:  AIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQNGR
        A+  W RP+TV+E+RSFLGLAGYYRRF++DF+RIA+PLTQLTRKG  F WS ACE SFQELK++L +APVL VP+G+GN VIYSDASK GLGCVLMQ G+
Subjt:  AIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQNGR

Query:  VIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAHSS
        V+AYASRQLK +E++YPTHDLELAAV FALK+WRHYLYGE+IQ++TDHKSLKY FTQKEL MRQRRWLELVKDYD EIL+HPGKANVVADALSRK AHS+
Subjt:  VIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAHSS

Query:  AMLTRQHNIQMEFE
        A++T+Q  +  +FE
Subjt:  AMLTRQHNIQMEFE

A0A6J1EV26 Reverse transcriptase1.4e-18866.15Show/hide
Query:  MGKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTEL
        +G++F F+     S  + I ALKARK++     A LASV KV +    VS VPVV EF DVF +E+PGLPP REV+  I+LEPG T  SK PYRMAP EL
Subjt:  MGKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTEL

Query:  KELKLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTR
        KELK+QLQELL++G                              +LN VTIKNKYPLPRIDDLFDQLQGA VFSKIDL SGYHQ+R++E+D+PKTAFR+R
Subjt:  KELKLQLQELLNQG------------------------------KLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTR

Query:  YGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVV-----------------------FLGHVVSKDGITVDPA
        YGHYEF+VMSFGLTNAPAVFMELMNRVF++FLD+FVIVFIDDILVYSK+  EH  HLR+V+                       FLGHVVS  G+TVDPA
Subjt:  YGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVV-----------------------FLGHVVSKDGITVDPA

Query:  KVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQ
        K+EA++ W RPTTVTEVRSFLGLAGYYRRFIKDF++++A LTQLT+KGK F W++ CE SF ELK+RL +APVL VP+G+G LV+YSDAS  GLGCVLMQ
Subjt:  KVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQ

Query:  NGRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTA
         G+VIAYASRQLK+YER+YPTHDLELAAV +ALK WRHYLYGER+QV+TDHKSLKYLFTQKEL MRQRRWLELVKDYD+EIL+HPGKANVVADALSRKTA
Subjt:  NGRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTA

Query:  HSSAMLTRQHNIQMEFE
        H+SA++TRQ  +Q E +
Subjt:  HSSAMLTRQHNIQMEFE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.6e-7236.39Show/hide
Query:  EREVNLDIE--LEPGMTLNSKAPYR-----MAPTELKELKLQLQELLNQGKLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPK
        E+EV   I+  L  G+   S +PY      +   +    K + + +++  KLN +T+ +++P+P +D++  +L     F+ IDL  G+HQI +  + V K
Subjt:  EREVNLDIE--LEPGMTLNSKAPYR-----MAPTELKELKLQLQELLNQGKLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPK

Query:  TAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHL-----------------------RKVVFLGHVVSKDG
        TAF T++GHYE++ M FGL NAPA F   MN + +  L+   +V++DDI+V+S + DEH + L                       ++  FLGHV++ DG
Subjt:  TAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHL-----------------------RKVVFLGHVVSKDG

Query:  ITVDPAKVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSR-ACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHG
        I  +P K+EAI  +  PT   E+++FLGL GYYR+FI +FA IA P+T+  +K  K D +    +S+F++LK  ++  P+L VP+ T    + +DAS   
Subjt:  ITVDPAKVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSR-ACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHG

Query:  LGCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVAD
        LG VL Q+G  ++Y SR L ++E +Y T + EL A+ +A K +RHYL G   ++ +DH+ L +L+  K+   +  RW   + ++D +I +  GK N VAD
Subjt:  LGCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVAD

Query:  ALSR
        ALSR
Subjt:  ALSR

P10401 Retrovirus-related Pol polyprotein from transposon gypsy6.7e-6032.52Show/hide
Query:  VTKPGSVSK----TIIALKARKLIRHDVVAFLASV-AKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPERE-VNLDIE--LEPGMTLNSKAPYRMAPTEL
        +  P SV K    TII  K      ++ + F  +V A +R VD +            V+    P L    + VN +++  L+ G+   S++PY  +PT +
Subjt:  VTKPGSVSK----TIIALKARKLIRHDVVAFLASV-AKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPERE-VNLDIE--LEPGMTLNSKAPYRMAPTEL

Query:  KELK-------LQLQELLNQGKLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMEL
         + K          + +++  KLN  TI ++YP+P I  +   L  A  F+ +DL SGYHQI + E D  KT+F    G YEF  + FGL NA ++F   
Subjt:  KELK-------LQLQELLNQGKLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMEL

Query:  MNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHL-----------------------RKVVFLGHVVSKDGITVDPAKVEAIIGWVRPTTVTEVRSFLGL
        ++ V ++ +     V++DD++++S+   +H  H+                         V +LG +VSKDG   DP KV+AI  +  P  V +VRSFLGL
Subjt:  MNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHL-----------------------RKVVFLGHVVSKDGITVDPAKVEAIIGWVRPTTVTEVRSFLGL

Query:  AGYYRRFIKDFARIAAPLTQLTR-----------KGKKFDWSRACESSFQELKERLASAPVLI-VPNGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQ
        A YYR FIKDFA IA P+T + +           K    +++    ++FQ L+  LAS  V++  P+      + +DAS  G+G VL Q GR I   SR 
Subjt:  AGYYRRFIKDFARIAAPLTQLTR-----------KGKKFDWSRACESSFQELKERLASAPVLI-VPNGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQ

Query:  LKDYERSYPTHDLELAAVGFALKVWRHYLYGER-IQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAHS
        LK  E++Y T++ EL A+ +AL   +++LYG R I + TDH+ L +    +    + +RW   +  ++ ++ + PGK N VADALSR+  ++
Subjt:  LKDYERSYPTHDLELAAVGFALKVWRHYLYGER-IQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSRKTAHS

P20825 Retrovirus-related Pol polyprotein from transposon 2971.2e-6937.31Show/hide
Query:  LEPGMTLNSKAPYRMAPTELKELK------LQLQELLNQGKLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGHY
        L  G+   S +PY  +PT +   K       + + +++  KLN +TI ++YP+P +D++  +L     F+ IDL  G+HQI + E+ + KTAF T+ GHY
Subjt:  LEPGMTLNSKAPYRMAPTELKELK------LQLQELLNQGKLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGHY

Query:  EFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEH----------------------AEHLRKVV-FLGHVVSKDGITVDPAKVEA
        E++ M FGL NAPA F   MN + +  L+   +V++DDI+++S +  EH                       E L+K   FLGH+V+ DGI  +P KV+A
Subjt:  EFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEH----------------------AEHLRKVV-FLGHVVSKDGITVDPAKVEA

Query:  IIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSR-ACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQNGR
        I+ +  PT   E+R+FLGL GYYR+FI ++A IA P+T   +K  K D  +     +F++LK  +   P+L +P+     V+ +DAS   LG VL QNG 
Subjt:  IIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSR-ACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQNGR

Query:  VIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSR
         I++ SR L D+E +Y   + EL A+ +A K +RHYL G +  + +DH+ L++L   KE   +  RW   + +Y  +I +  GK N VADALSR
Subjt:  VIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.9e-6736.93Show/hide
Query:  EREVNLDIELEPGMTLNSKAPYR----MAPTELK-ELKLQLQELLNQGKLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTA
        E E  +D  L+ G+   S +PY     + P + K   + Q + +++  +LNTVTI + YP+P I+     L  A  F+ +DL SG+HQI +KE D+PKTA
Subjt:  EREVNLDIELEPGMTLNSKAPYR----MAPTELK-ELKLQLQELLNQGKLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTA

Query:  FRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLR-----------------------KVVFLGHVVSKDGIT
        F T  G YEF+ + FGL NAPA+F  +++ + ++ +     V+IDDI+V+S+  D H ++LR                       +V FLG++V+ DGI 
Subjt:  FRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLR-----------------------KVVFLGHVVSKDGIT

Query:  VDPAKVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTR-----------KGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVI
         DP KV AI     PT+V E++ FLG+  YYR+FI+D+A++A PLT LTR                        SF +LK  L S+ +L  P  T    +
Subjt:  VDPAKVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTR-----------KGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNLVI

Query:  YSDASKHGLGCVLMQN----GRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGE-RIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVE
         +DAS   +G VL Q+     R IAY SR L   E +Y T + E+ A+ ++L   R YLYG   I+V+TDH+ L +    +    + +RW   +++Y+ E
Subjt:  YSDASKHGLGCVLMQN----GRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGE-RIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVE

Query:  ILHHPGKANVVADALSR
        +++ PGK+NVVADALSR
Subjt:  ILHHPGKANVVADALSR

Q99315 Transposon Ty3-G Gag-Pol polyprotein6.3e-5832.44Show/hide
Query:  EFLDVFLDEMPGLPPERE---VNLDIELEPGMTLNSKAPYRMAPTELKELKLQLQELLNQ------------------------------GKLNTVTIKN
        ++ ++  +++P  P +     V  DIE++PG  L    PY +     +E+   +Q+LL+                                 LN  TI +
Subjt:  EFLDVFLDEMPGLPPERE---VNLDIELEPGMTLNSKAPYRMAPTELKELKLQLQELLNQ------------------------------GKLNTVTIKN

Query:  KYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEH
         +PLPRID+L  ++  A +F+ +DL SGYHQI ++  D  KTAF T  G YE+ VM FGL NAP+ F   M   F+D    FV V++DDIL++S++ +EH
Subjt:  KYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEH

Query:  AEHLRKVV-----------------------FLGHVVSKDGITVDPAKVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDW
         +HL  V+                       FLG+ +    I     K  AI  +  P TV + + FLG+  YYRRFI + ++IA P+        K  W
Subjt:  AEHLRKVV-----------------------FLGHVVSKDGITVDPAKVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDW

Query:  SRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQNGR------VIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQV
        +   + +  +LK+ L ++PVL+  N   N  + +DASK G+G VL +         V+ Y S+ L+  +++YP  +LEL  +  AL  +R+ L+G+   +
Subjt:  SRACESSFQELKERLASAPVLIVPNGTGNLVIYSDASKHGLGCVLMQNGR------VIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQV

Query:  HTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSR
         TDH SL  L  + E   R +RWL+ +  YD  + +  G  NVVADA+SR
Subjt:  HTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein5.3e-2042.42Show/hide
Query:  KVVFLG--HVVSKDGITVDPAKVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPN
        ++ +LG  H++S +G++ DPAK+EA++GW  P   TE+R FLGL GYYRRF+K++ +I  PLT+L +K     W+     +F+ LK  + + PVL +P+
Subjt:  KVVFLG--HVVSKDGITVDPAKVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAGAGCTTCCGGTTTCAGGTTACGAAGCCGGGAAGTGTTTCTAAGACAATAATTGCTCTAAAGGCTAGAAAGTTGATTAGGCATGATGTTGTGGCGTTCCTGGC
TAGTGTGGCAAAAGTGAGACGAGTTGATAAAGATGTATCAGTTGTACCTGTAGTGAATGAGTTCCTGGATGTCTTCCTGGATGAGATGCCTGGTTTGCCACCAGAGCGGG
AAGTCAACCTTGACATTGAACTTGAACCGGGGATGACACTTAACTCTAAAGCTCCTTATAGGATGGCCCCCACGGAATTGAAAGAGTTGAAATTGCAACTACAAGAATTG
TTGAACCAGGGGAAGCTGAATACGGTGACCATTAAAAATAAGTATCCCTTGCCGCGAATTGATGACTTGTTTGACCAGCTTCAAGGGGCAGCGGTATTTTCGAAGATTGA
TCTTTGTTCTGGTTATCACCAGATAAGAGTCAAAGAAGATGACGTACCGAAGACAGCTTTTCGTACTCGGTATGGGCATTATGAGTTTGTTGTGATGTCCTTTGGCTTGA
CTAATGCCCCTGCAGTGTTTATGGAGCTGATGAATCGGGTATTCCAGGATTTTCTGGATTCTTTTGTCATTGTGTTCATTGATGATATCTTGGTTTATTCCAAGACAAAC
GATGAACATGCAGAACATTTGAGGAAGGTAGTATTCCTTGGTCATGTGGTATCCAAGGATGGTATAACTGTTGATCCAGCAAAGGTGGAGGCAATTATAGGTTGGGTTCG
ACCAACTACAGTTACTGAGGTGAGAAGTTTTCTGGGTTTAGCCGGATATTACAGGCGCTTTATTAAAGACTTTGCAAGGATTGCTGCACCACTGACTCAGTTAACCCGAA
AAGGGAAGAAATTTGATTGGAGTCGAGCGTGTGAAAGTAGTTTTCAGGAACTCAAGGAAAGATTAGCGTCAGCCCCAGTGCTTATTGTACCTAACGGTACTGGAAACCTA
GTAATTTATAGTGATGCCTCTAAGCATGGGTTGGGGTGCGTACTTATGCAAAACGGGAGAGTTATTGCTTATGCCTCTCGGCAATTAAAGGATTATGAACGCAGTTACCC
AACTCATGATTTAGAATTAGCTGCTGTGGGGTTTGCTCTGAAGGTATGGAGACATTATTTGTATGGTGAGAGGATACAAGTACATACTGATCATAAGAGTCTTAAATATC
TGTTCACCCAGAAAGAGCTCATCATGAGGCAACGTCGGTGGTTGGAATTGGTAAAAGATTATGATGTGGAGATCCTACATCATCCTGGTAAAGCTAATGTGGTAGCTGAT
GCCTTGAGTCGTAAGACAGCTCACTCATCTGCAATGCTAACGAGACAACACAACATTCAGATGGAGTTTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAAGAGCTTCCGGTTTCAGGTTACGAAGCCGGGAAGTGTTTCTAAGACAATAATTGCTCTAAAGGCTAGAAAGTTGATTAGGCATGATGTTGTGGCGTTCCTGGC
TAGTGTGGCAAAAGTGAGACGAGTTGATAAAGATGTATCAGTTGTACCTGTAGTGAATGAGTTCCTGGATGTCTTCCTGGATGAGATGCCTGGTTTGCCACCAGAGCGGG
AAGTCAACCTTGACATTGAACTTGAACCGGGGATGACACTTAACTCTAAAGCTCCTTATAGGATGGCCCCCACGGAATTGAAAGAGTTGAAATTGCAACTACAAGAATTG
TTGAACCAGGGGAAGCTGAATACGGTGACCATTAAAAATAAGTATCCCTTGCCGCGAATTGATGACTTGTTTGACCAGCTTCAAGGGGCAGCGGTATTTTCGAAGATTGA
TCTTTGTTCTGGTTATCACCAGATAAGAGTCAAAGAAGATGACGTACCGAAGACAGCTTTTCGTACTCGGTATGGGCATTATGAGTTTGTTGTGATGTCCTTTGGCTTGA
CTAATGCCCCTGCAGTGTTTATGGAGCTGATGAATCGGGTATTCCAGGATTTTCTGGATTCTTTTGTCATTGTGTTCATTGATGATATCTTGGTTTATTCCAAGACAAAC
GATGAACATGCAGAACATTTGAGGAAGGTAGTATTCCTTGGTCATGTGGTATCCAAGGATGGTATAACTGTTGATCCAGCAAAGGTGGAGGCAATTATAGGTTGGGTTCG
ACCAACTACAGTTACTGAGGTGAGAAGTTTTCTGGGTTTAGCCGGATATTACAGGCGCTTTATTAAAGACTTTGCAAGGATTGCTGCACCACTGACTCAGTTAACCCGAA
AAGGGAAGAAATTTGATTGGAGTCGAGCGTGTGAAAGTAGTTTTCAGGAACTCAAGGAAAGATTAGCGTCAGCCCCAGTGCTTATTGTACCTAACGGTACTGGAAACCTA
GTAATTTATAGTGATGCCTCTAAGCATGGGTTGGGGTGCGTACTTATGCAAAACGGGAGAGTTATTGCTTATGCCTCTCGGCAATTAAAGGATTATGAACGCAGTTACCC
AACTCATGATTTAGAATTAGCTGCTGTGGGGTTTGCTCTGAAGGTATGGAGACATTATTTGTATGGTGAGAGGATACAAGTACATACTGATCATAAGAGTCTTAAATATC
TGTTCACCCAGAAAGAGCTCATCATGAGGCAACGTCGGTGGTTGGAATTGGTAAAAGATTATGATGTGGAGATCCTACATCATCCTGGTAAAGCTAATGTGGTAGCTGAT
GCCTTGAGTCGTAAGACAGCTCACTCATCTGCAATGCTAACGAGACAACACAACATTCAGATGGAGTTTGAATGA
Protein sequenceShow/hide protein sequence
MGKSFRFQVTKPGSVSKTIIALKARKLIRHDVVAFLASVAKVRRVDKDVSVVPVVNEFLDVFLDEMPGLPPEREVNLDIELEPGMTLNSKAPYRMAPTELKELKLQLQEL
LNQGKLNTVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLCSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTN
DEHAEHLRKVVFLGHVVSKDGITVDPAKVEAIIGWVRPTTVTEVRSFLGLAGYYRRFIKDFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPNGTGNL
VIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVGFALKVWRHYLYGERIQVHTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVAD
ALSRKTAHSSAMLTRQHNIQMEFE