; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G011190 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G011190
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr20:10676309..10678249
RNA-Seq ExpressionCmoCh20G011190
SyntenyCmoCh20G011190
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025998.1 pol protein [Cucumis melo var. makuwa]5.2e-19380.25Show/hide
Query:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK
        PVV E+ DVFPD+LPGLPP REV+F IELEP T PIS+A YRMAPAELKELK+QLQELL++GFIRPSVSPWGAPVLFVKKKDG++RLCIDYRELNKVT+K
Subjt:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK

Query:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE
        N+YPLPRIDDLFDQLQGA VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  E
Subjt:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE

Query:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD
        H EHL +VL  LR  +LYAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+T++E+RSFLGLAGYYR F++DF+RIA+PLTQLTRKG  F 
Subjt:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD

Query:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK
        WS ACE SFQELK++L +APVL VPDG+GN VIYSDASK GLGCVLMQ G+V+AYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGE+IQ+YTDHK
Subjt:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK

KAA0063793.1 pol protein [Cucumis melo var. makuwa]3.1e-19380.5Show/hide
Query:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK
        PVV E+ DVFPDKLPGLPP REV+F IELEP T PIS+A YRMAPAELKELK+QLQELL++GFIRPSVSPWGAPVLFVKKKDG++RLCIDYRELNKVT+K
Subjt:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK

Query:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE
        N+YPLPRIDDLFDQLQGA VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  E
Subjt:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE

Query:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD
        H EHL +VL  LR  +LYAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+T++E+RSFLGLAGYYR F++DF+RIA+PLTQLTRKG  F 
Subjt:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD

Query:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK
        WS ACE SFQELK++L +APVL VPDG+GN VIYSDASK GLGCVLMQ G+V+AYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGE+IQ+YTDHK
Subjt:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK

TYK01576.1 pol protein [Cucumis melo var. makuwa]5.2e-19380.25Show/hide
Query:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK
        PVV E+ DVFPD+LPGLPP REV+F IELEP T PIS+A YRMAPAELKELK+QLQELL++GFIRPSVSPWGAPVLFVKKKDG++RLCIDYRELNKVT+K
Subjt:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK

Query:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE
        N+YPLPRIDDLFDQLQGA VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  E
Subjt:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE

Query:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD
        H EHL +VL  LR  +LYAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+T++E+RSFLGLAGYYR F++DF+RIA+PLTQLTRKG  F 
Subjt:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD

Query:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK
        WS ACE SFQELK++L +APVL VPDG+GN VIYSDASK GLGCVLMQ G+V+AYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGE+IQ+YTDHK
Subjt:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK

TYK20443.1 pol protein [Cucumis melo var. makuwa]5.2e-19380.25Show/hide
Query:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK
        PVV E+ DVFPD+LPGLPP REV+F IELEP T PIS+A YRMAPAELKELK+QLQELL++GFIRPSVSPWGAPVLFVKKKDG++RLCIDYRELNKVT+K
Subjt:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK

Query:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE
        N+YPLPRIDDLFDQLQGA VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  E
Subjt:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE

Query:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD
        H EHL +VL  LR  +LYAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+T++E+RSFLGLAGYYR F++DF+RIA+PLTQLTRKG  F 
Subjt:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD

Query:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK
        WS ACE SFQELK++L +APVL VPDG+GN VIYSDASK GLGCVLMQ G+V+AYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGE+IQ+YTDHK
Subjt:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK

XP_022931734.1 uncharacterized protein LOC111437896 [Cucurbita moschata]5.2e-19381Show/hide
Query:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK
        PVV EF DVFP++LPGLPP REV+F I+LEP TTPISK  YRMAPAELKELK+QLQELL++GFIRPSVSPWGAPVLFVKKKDGT+RLCIDYRELNKVTIK
Subjt:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK

Query:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE
        NKYPLPRIDDLFDQLQGA VFSKIDLRSGYHQ+R++E+D+PKTAFR+RYGHYEF+VMSFGLTNAPAVFMELMNRVF++FLD+FVIVFIDDILVYSK+  E
Subjt:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE

Query:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD
        H  HLR+VL +LR  +LYAKFSKCEFWLQ+V FLGHVVS  G+TVDPAK+EAV+ W RPTT+TEVRSFLGLAGYYR FIKDF++++A LTQLT+KGK F 
Subjt:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD

Query:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK
        W++ CE SF ELK+RL +APVL VPDG+G LV+YSDAS  GLGCVLMQ G+VIAYASRQLK+YERNYPTHDLELAAVV+ALK WRHYLYGER+QVYTDHK
Subjt:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK

TrEMBL top hitse value%identityAlignment
A0A5A7SIJ5 Reverse transcriptase2.5e-19380.25Show/hide
Query:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK
        PVV E+ DVFPD+LPGLPP REV+F IELEP T PIS+A YRMAPAELKELK+QLQELL++GFIRPSVSPWGAPVLFVKKKDG++RLCIDYRELNKVT+K
Subjt:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK

Query:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE
        N+YPLPRIDDLFDQLQGA VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  E
Subjt:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE

Query:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD
        H EHL +VL  LR  +LYAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+T++E+RSFLGLAGYYR F++DF+RIA+PLTQLTRKG  F 
Subjt:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD

Query:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK
        WS ACE SFQELK++L +APVL VPDG+GN VIYSDASK GLGCVLMQ G+V+AYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGE+IQ+YTDHK
Subjt:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK

A0A5A7V2A0 Reverse transcriptase2.5e-19380.25Show/hide
Query:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK
        PVV E+ DVFPD+LPGLPP REV+F IELEP T PIS+A YRMAPAELKELK+QLQELL++GFIRPSVSPWGAPVLFVKKKDG++RLCIDYRELNKVT+K
Subjt:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK

Query:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE
        N+YPLPRIDDLFDQLQGA VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  E
Subjt:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE

Query:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD
        H EHL +VL  LR  +LYAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+T++E+RSFLGLAGYYR F++DF+RIA+PLTQLTRKG  F 
Subjt:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD

Query:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK
        WS ACE SFQELK++L +APVL VPDG+GN VIYSDASK GLGCVLMQ G+V+AYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGE+IQ+YTDHK
Subjt:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK

A0A5A7V6R2 Reverse transcriptase1.5e-19380.5Show/hide
Query:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK
        PVV E+ DVFPDKLPGLPP REV+F IELEP T PIS+A YRMAPAELKELK+QLQELL++GFIRPSVSPWGAPVLFVKKKDG++RLCIDYRELNKVT+K
Subjt:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK

Query:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE
        N+YPLPRIDDLFDQLQGA VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  E
Subjt:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE

Query:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD
        H EHL +VL  LR  +LYAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+T++E+RSFLGLAGYYR F++DF+RIA+PLTQLTRKG  F 
Subjt:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD

Query:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK
        WS ACE SFQELK++L +APVL VPDG+GN VIYSDASK GLGCVLMQ G+V+AYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGE+IQ+YTDHK
Subjt:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK

A0A5D3BTN0 Reverse transcriptase2.5e-19380.25Show/hide
Query:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK
        PVV E+ DVFPD+LPGLPP REV+F IELEP T PIS+A YRMAPAELKELK+QLQELL++GFIRPSVSPWGAPVLFVKKKDG++RLCIDYRELNKVT+K
Subjt:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK

Query:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE
        N+YPLPRIDDLFDQLQGA VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  E
Subjt:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE

Query:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD
        H EHL +VL  LR  +LYAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+T++E+RSFLGLAGYYR F++DF+RIA+PLTQLTRKG  F 
Subjt:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD

Query:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK
        WS ACE SFQELK++L +APVL VPDG+GN VIYSDASK GLGCVLMQ G+V+AYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGE+IQ+YTDHK
Subjt:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK

A0A5D3C6W3 Reverse transcriptase2.5e-19380.25Show/hide
Query:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK
        PVV E+ DVFPD+LPGLPP REV+F IELEP T PIS+A YRMAPAELKELK+QLQELL++GFIRPSVSPWGAPVLFVKKKDG++RLCIDYRELNKVT+K
Subjt:  PVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIK

Query:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE
        N+YPLPRIDDLFDQLQGA VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  E
Subjt:  NKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDE

Query:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD
        H EHL +VL  LR  +LYAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+T++E+RSFLGLAGYYR F++DF+RIA+PLTQLTRKG  F 
Subjt:  HAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFD

Query:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK
        WS ACE SFQELK++L +APVL VPDG+GN VIYSDASK GLGCVLMQ G+V+AYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGE+IQ+YTDHK
Subjt:  WSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.7e-8242.16Show/hide
Query:  SKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGT-----LRLCIDYRELNKVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLRSGYH
        SK SY  A  +  E++ Q+Q++LNQG IR S SP+ +P+  V KK         R+ IDYR+LN++T+ +++P+P +D++  +L     F+ IDL  G+H
Subjt:  SKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGT-----LRLCIDYRELNKVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLRSGYH

Query:  QIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLWVLRKQRLYAKFSKCEFWLQKV
        QI +  + V KTAF T++GHYE++ M FGL NAPA F   MN + +  L+   +V++DDI+V+S + DEH + L  V   L K  L  +  KCEF  Q+ 
Subjt:  QIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLWVLRKQRLYAKFSKCEFWLQKV

Query:  VFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFDWSR-ACESSFQELKERLASAPVLIVPDGTGN
         FLGHV++ DGI  +P K+EA+  +  PT   E+++FLGL GYYR FI +FA IA P+T+  +K  K D +    +S+F++LK  ++  P+L VPD T  
Subjt:  VFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFDWSR-ACESSFQELKERLASAPVLIVPDGTGN

Query:  LVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK
          + +DAS   LG VL Q+G  ++Y SR L ++E NY T + EL A+V+A K +RHYL G   ++ +DH+
Subjt:  LVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK

P20825 Retrovirus-related Pol polyprotein from transposon 2971.6e-8040.75Show/hide
Query:  TPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKD-----GTLRLCIDYRELNKVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLRS
        +PI    Y +A     E++ Q+QE+LNQG IR S SP+ +P   V KK         R+ IDYR+LN++TI ++YP+P +D++  +L     F+ IDL  
Subjt:  TPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKD-----GTLRLCIDYRELNKVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLRS

Query:  GYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLWVLRKQRLYAKFSKCEFWL
        G+HQI + E+ + KTAF T+ GHYE++ M FGL NAPA F   MN + +  L+   +V++DDI+++S +  EH   ++ V   L    L  +  KCEF  
Subjt:  GYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLWVLRKQRLYAKFSKCEFWL

Query:  QKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFDWSR-ACESSFQELKERLASAPVLIVPDG
        ++  FLGH+V+ DGI  +P KV+A++ +  PT   E+R+FLGL GYYR FI ++A IA P+T   +K  K D  +     +F++LK  +   P+L +PD 
Subjt:  QKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFDWSR-ACESSFQELKERLASAPVLIVPDG

Query:  TGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK
            V+ +DAS   LG VL QNG  I++ SR L D+E NY   + EL A+V+A K +RHYL G +  + +DH+
Subjt:  TGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQVYTDHK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.1e-7136.88Show/hide
Query:  EFLDVFPDKLPGLPPERE---VNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIKN
        ++ ++  + LP  P +     V   IE++P         Y +     +E+   +Q+LL+  FI PS SP  +PV+ V KKDGT RLC+DYR LNK TI +
Subjt:  EFLDVFPDKLPGLPPERE---VNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIKN

Query:  KYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEH
         +PLPRID+L  ++  A +F+ +DL SGYHQI ++  D  KTAF T  G YE+ VM FGL NAP+ F   M   F+D    FV V++DDIL++S++ +EH
Subjt:  KYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEH

Query:  AEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFDW
         +HL  VL  L+ + L  K  KC+F  ++  FLG+ +    I     K  A+  +  P T+ + + FLG+  YYR FI + ++IA P+        K  W
Subjt:  AEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFDW

Query:  SRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGR------VIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQV
        +   + + ++LK  L ++PVL+  +   N  + +DASK G+G VL +         V+ Y S+ L+  ++NYP  +LEL  ++ AL  +R+ L+G+   +
Subjt:  SRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGR------VIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQV

Query:  YTDH
         TDH
Subjt:  YTDH

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.5e-7338.57Show/hide
Query:  VVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKK-----DGTLRLCIDYRELNK
        ++ EF  +F   L G+  E  V   I    +  PI   SY        E++ Q+ ELL  G IRPS SP+ +P+  V KK     +   R+ +D++ LN 
Subjt:  VVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKK-----DGTLRLCIDYRELNK

Query:  VTIKNKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSK
        VTI + YP+P I+     L  A  F+ +DL SG+HQI +KE D+PKTAF T  G YEF+ + FGL NAPA+F  +++ + ++ +     V+IDDI+V+S+
Subjt:  VTIKNKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSK

Query:  TNDEHAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTR--
          D H ++LR VL  L K  L     K  F   +V FLG++V+ DGI  DP KV A+     PT++ E++ FLG+  YYR FI+D+A++A PLT LTR  
Subjt:  TNDEHAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTR--

Query:  ---------KGKKFDWSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQN----GRVIAYASRQLKDYERNYPTHDLELAAVVFAL
                              SF +LK  L S+ +L  P  T    + +DAS   +G VL Q+     R IAY SR L   E NY T + E+ A++++L
Subjt:  ---------KGKKFDWSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQN----GRVIAYASRQLKDYERNYPTHDLELAAVVFAL

Query:  KIWRHYLYGE-RIQVYTDHK
           R YLYG   I+VYTDH+
Subjt:  KIWRHYLYGE-RIQVYTDHK

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.4e-7136.88Show/hide
Query:  EFLDVFPDKLPGLPPERE---VNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIKN
        ++ ++  + LP  P +     V   IE++P         Y +     +E+   +Q+LL+  FI PS SP  +PV+ V KKDGT RLC+DYR LNK TI +
Subjt:  EFLDVFPDKLPGLPPERE---VNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSPWGAPVLFVKKKDGTLRLCIDYRELNKVTIKN

Query:  KYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEH
         +PLPRID+L  ++  A +F+ +DL SGYHQI ++  D  KTAF T  G YE+ VM FGL NAP+ F   M   F+D    FV V++DDIL++S++ +EH
Subjt:  KYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEH

Query:  AEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFDW
         +HL  VL  L+ + L  K  KC+F  ++  FLG+ +    I     K  A+  +  P T+ + + FLG+  YYR FI + ++IA P+        K  W
Subjt:  AEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFDW

Query:  SRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGR------VIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQV
        +   + +  +LK+ L ++PVL+  +   N  + +DASK G+G VL +         V+ Y S+ L+  ++NYP  +LEL  ++ AL  +R+ L+G+   +
Subjt:  SRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGR------VIAYASRQLKDYERNYPTHDLELAAVVFALKIWRHYLYGERIQV

Query:  YTDH
         TDH
Subjt:  YTDH

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein5.0e-2440.8Show/hide
Query:  HLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLG--HVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFDW
        HL  VL +  + + YA   KC F   ++ +LG  H++S +G++ DPAK+EA++GW  P   TE+R FLGL GYYR F+K++ +I  PLT+L +K     W
Subjt:  HLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLG--HVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIKDFARIAAPLTQLTRKGKKFDW

Query:  SRACESSFQELKERLASAPVLIVPD
        +     +F+ LK  + + PVL +PD
Subjt:  SRACESSFQELKERLASAPVLIVPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGTGACAGATATGGACGTAGTTAAGAATTTAGACCCTGGTACATCCAATGTGGACTTGGGAAATGAGGGGCAGCCTGTAGAGGAAATTGCTCCAGCGGAG
GCGGTTCCGGAGCCTGCTGCTCAGTCGGCATCCAGAGATCAGCCGACTGTTGTGATTACTTTGGAAGCATTACAATCATTGATTGAGAGTCGAGTAGATCAGGCA
ATGCAGAGCCGGGTGGATCAAGCGGTTCAGGCAGCCCTTGTTGGTCTTGGAAGCCAGGCGGCTCCAACAGTACCTGTATCGGGCCAGACGACATTGGTGTCTGAA
GCACCAGGAGTAGGTGTTCAGACAGTAATACCTCCAACACGGTTGACAGAACTACCTGGTACAGCTGTGGTGACAGAGGCACCATCGCGGGTAGTAACTTATGGC
CGACGATGTATGACAGAAGAGAGTGAGTACATACGAGATTTCATGAAACTTGGCCCGCCAACTTTTGGAGGAAAGGGGACTGATCCGGAGGCAGCTGAATGGTGG
TTGGAATGTGTTGAAACAAAATTTACATTCTACAACTGCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTA
CCTGTAGTGAATGAATTCCTGGATGTTTTTCCTGATAAATTACCTGGTTTACCACCAGAGCGGGAAGTCAACTTTGGTATTGAACTAGAACCAAGGACAACACCT
ATCTCTAAAGCTTCTTATAGGATGGCTCCCGCGGAATTGAAAGAGTTGAAGTTGCAATTACAGGAATTGTTGAACCAGGGATTCATACGACCTAGTGTGTCACCA
TGGGGAGCTCCAGTGTTGTTTGTGAAAAAGAAGGATGGCACACTTCGTCTCTGCATTGACTATAGGGAGCTGAATAAGGTGACCATAAAAAATAAGTATCCCTTG
CCACGAATTGATGACTTATTTGACCAGCTTCAAGGGGCAGCGGTATTTTCGAAGATTGATCTTCGTTCTGGTTATCACCAGATAAGAGTCAAAGAAGATGACGTA
CCGAAGACAGCTTTTCGTACTCGGTATGGGCATTATGAATTTGTCGTGATGTCTTTTGGCTTGACTAATGCCCCTGCAGTGTTTATGGAGCTGATGAATCGGGTA
TTCCAGGATTTTCTGGATTCTTTTGTCATTGTGTTCATTGATGATATCTTGGTTTATTCCAAGACAAACGATGAACATGCAGAACATTTGAGGAAAGTTTTGTGG
GTTCTACGTAAACAAAGATTATATGCCAAGTTCTCAAAATGCGAGTTTTGGCTTCAAAAGGTAGTATTCCTTGGTCATGTGGTATCCAAGGATGGTATAACTGTT
GATCCAGCAAAGGTGGAGGCAGTTATAGGTTGGGTTCGACCAACTACAATTACTGAGGTGAGAAGTTTTCTGGGTTTAGCCGGATATTACAGGTGCTTTATTAAA
GACTTTGCAAGGATTGCTGCACCACTGACTCAGTTAACCCGAAAAGGTAAGAAATTTGATTGGAGTCGAGCTTGTGAAAGTAGTTTTCAGGAACTCAAGGAAAGA
TTAGCGTCAGCCCCAGTGCTTATTGTACCTGACGGTACTGGGAACTTAGTAATTTATAGTGATGCTTCTAAGCATGGGTTGGGGTGCGTACTTATGCAAAACGGG
AGAGTTATTGCTTATGCCTCTCGGCAATTAAAGGATTATGAACGCAATTACCCAACTCATGATTTAGAATTAGCTGCTGTGGTGTTTGCTCTGAAGATATGGAGA
CATTATCTGTACGGTGAGAGGATACAAGTATATACGGATCATAAGCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATGTGACAGATATGGACGTAGTTAAGAATTTAGACCCTGGTACATCCAATGTGGACTTGGGAAATGAGGGGCAGCCTGTAGAGGAAATTGCTCCAGCGGAG
GCGGTTCCGGAGCCTGCTGCTCAGTCGGCATCCAGAGATCAGCCGACTGTTGTGATTACTTTGGAAGCATTACAATCATTGATTGAGAGTCGAGTAGATCAGGCA
ATGCAGAGCCGGGTGGATCAAGCGGTTCAGGCAGCCCTTGTTGGTCTTGGAAGCCAGGCGGCTCCAACAGTACCTGTATCGGGCCAGACGACATTGGTGTCTGAA
GCACCAGGAGTAGGTGTTCAGACAGTAATACCTCCAACACGGTTGACAGAACTACCTGGTACAGCTGTGGTGACAGAGGCACCATCGCGGGTAGTAACTTATGGC
CGACGATGTATGACAGAAGAGAGTGAGTACATACGAGATTTCATGAAACTTGGCCCGCCAACTTTTGGAGGAAAGGGGACTGATCCGGAGGCAGCTGAATGGTGG
TTGGAATGTGTTGAAACAAAATTTACATTCTACAACTGCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTA
CCTGTAGTGAATGAATTCCTGGATGTTTTTCCTGATAAATTACCTGGTTTACCACCAGAGCGGGAAGTCAACTTTGGTATTGAACTAGAACCAAGGACAACACCT
ATCTCTAAAGCTTCTTATAGGATGGCTCCCGCGGAATTGAAAGAGTTGAAGTTGCAATTACAGGAATTGTTGAACCAGGGATTCATACGACCTAGTGTGTCACCA
TGGGGAGCTCCAGTGTTGTTTGTGAAAAAGAAGGATGGCACACTTCGTCTCTGCATTGACTATAGGGAGCTGAATAAGGTGACCATAAAAAATAAGTATCCCTTG
CCACGAATTGATGACTTATTTGACCAGCTTCAAGGGGCAGCGGTATTTTCGAAGATTGATCTTCGTTCTGGTTATCACCAGATAAGAGTCAAAGAAGATGACGTA
CCGAAGACAGCTTTTCGTACTCGGTATGGGCATTATGAATTTGTCGTGATGTCTTTTGGCTTGACTAATGCCCCTGCAGTGTTTATGGAGCTGATGAATCGGGTA
TTCCAGGATTTTCTGGATTCTTTTGTCATTGTGTTCATTGATGATATCTTGGTTTATTCCAAGACAAACGATGAACATGCAGAACATTTGAGGAAAGTTTTGTGG
GTTCTACGTAAACAAAGATTATATGCCAAGTTCTCAAAATGCGAGTTTTGGCTTCAAAAGGTAGTATTCCTTGGTCATGTGGTATCCAAGGATGGTATAACTGTT
GATCCAGCAAAGGTGGAGGCAGTTATAGGTTGGGTTCGACCAACTACAATTACTGAGGTGAGAAGTTTTCTGGGTTTAGCCGGATATTACAGGTGCTTTATTAAA
GACTTTGCAAGGATTGCTGCACCACTGACTCAGTTAACCCGAAAAGGTAAGAAATTTGATTGGAGTCGAGCTTGTGAAAGTAGTTTTCAGGAACTCAAGGAAAGA
TTAGCGTCAGCCCCAGTGCTTATTGTACCTGACGGTACTGGGAACTTAGTAATTTATAGTGATGCTTCTAAGCATGGGTTGGGGTGCGTACTTATGCAAAACGGG
AGAGTTATTGCTTATGCCTCTCGGCAATTAAAGGATTATGAACGCAATTACCCAACTCATGATTTAGAATTAGCTGCTGTGGTGTTTGCTCTGAAGATATGGAGA
CATTATCTGTACGGTGAGAGGATACAAGTATATACGGATCATAAGCTTTAG
Protein sequenceShow/hide protein sequence
MYVTDMDVVKNLDPGTSNVDLGNEGQPVEEIAPAEAVPEPAAQSASRDQPTVVITLEALQSLIESRVDQAMQSRVDQAVQAALVGLGSQAAPTVPVSGQTTLVSE
APGVGVQTVIPPTRLTELPGTAVVTEAPSRVVTYGRRCMTEESEYIRDFMKLGPPTFGGKGTDPEAAEWWLECVETKFTFYNCXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPVVNEFLDVFPDKLPGLPPEREVNFGIELEPRTTPISKASYRMAPAELKELKLQLQELLNQGFIRPSVSP
WGAPVLFVKKKDGTLRLCIDYRELNKVTIKNKYPLPRIDDLFDQLQGAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRV
FQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLWVLRKQRLYAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWVRPTTITEVRSFLGLAGYYRCFIK
DFARIAAPLTQLTRKGKKFDWSRACESSFQELKERLASAPVLIVPDGTGNLVIYSDASKHGLGCVLMQNGRVIAYASRQLKDYERNYPTHDLELAAVVFALKIWR
HYLYGERIQVYTDHKL