; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0020663 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0020663
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr07:10182992..10191242
RNA-Seq ExpressionPay0020663
SyntenyPay0020663
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR004242 - Transposon, En/Spm-like
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025917.1 pol protein [Cucumis melo var. makuwa]2.5e-20782.89Show/hide
Query:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
        P  FP       P +E+DF IELEPGT PIS+APYRMAP ELKELKVQL            VSPWGAP+LFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
Subjt:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP

Query:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH
        +I+DL DQLQGATVFSKIDLRSGYHQLRI+D DIPKTAFRSRYGHYEFIVMSFGL+NA AVFMDLMNRVFKDFLD FVIVFIDDILIYSKTEAEHEEHLH
Subjt:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH

Query:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE
        QVLETLRA+KLYAKFSKCEFWLKKVTFL HVVSSEGVSVD AK+EAVT+WPRPSTVSEIRSFLGLAGYYR FVEDFSRIASPLTQL  K TPFVWSP CE
Subjt:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE

Query:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF
        SSFQELKQKLV+ PVL   DGSGSFVIYSDASKKGLG VLMQQG+V             NYPTHDLELAAVVF+LKIWRHYLYGE IQI+TDHKSLKYFF
Subjt:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE
        TQKELNMRQR+WLELVKDYD EILYH GKANVVA+ALSRKVAHSA LI++
Subjt:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE

KAA0058812.1 pol protein [Cucumis melo var. makuwa]4.3e-20782Show/hide
Query:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
        P  FP       P +E+DF IELEPGT PIS+APYRMAP ELKELKVQL            VSPWGAP+LFVKKKDGSMRLCIDYRELNKVT+KNRYPLP
Subjt:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP

Query:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH
        +IDDL DQLQGATVFSKIDLRSGYHQLRI+D DIPKTAF SRYGHYEF+VMSFGL+NA AVFMDLMNRVFKDF+D+FVIVFIDDILIYSKTEAEHEEHLH
Subjt:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH

Query:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE
        QVLETLRANKLYAKFSKCEFWL+KVTFL HVVSSEGVSVD AKIEAVT+WPRPSTVSEIRSFLGLAGYYR FVEDFSRIASPLTQL  K TPFVWSP CE
Subjt:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE

Query:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF
        SSFQELKQKLV+ PVL   DGSG+FVIYSDASKKGLG VLMQQG+V             NYPTHDLELA VVF+LKIWRHYLYGE IQI+TDHKSLKYFF
Subjt:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE
        TQKELNMRQR+WLELVKDYD+EILYH GKANVVA+ALSRKVAHSA LI++
Subjt:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE

KAA0060745.1 pol protein [Cucumis melo var. makuwa]5.1e-20883.11Show/hide
Query:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
        P  FP       P +E+DF IELEPGT PIS+APYRMAP ELKELKVQL            VSPWGAP+LFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
Subjt:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP

Query:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH
        KIDDL DQLQGATVFSKIDLRSGYHQLRI+D DIPKTAFRSRYGHYEF+VMSFGL+NA AVFMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEHEEHLH
Subjt:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH

Query:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE
        QVLETLRANKLYAKFSKCEFWL+KVTFL HVVSSEGVSVD AKIEAVT+WPRPSTVSEIRSFLGLAGYYR FVEDFSRIASPLTQL  K TPFVWSP CE
Subjt:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE

Query:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF
        SSFQELKQKLV  PVL   DGSG+FVIYSDASKKGLG VLMQQG+V             NYPTHDLELAAVVF+LKIWRHYLYGE IQI+TDHKSLKYFF
Subjt:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE
        TQKELNMRQR+WLELVKDYD EILYH GKANVVA+ALSRKVAHSA LI++
Subjt:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE

KAA0063793.1 pol protein [Cucumis melo var. makuwa]1.1e-20782.67Show/hide
Query:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
        P  FP +     P +E+DF IELEPGT PIS+APYRMAP ELKELKVQL            VSPWGAP+LFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
Subjt:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP

Query:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH
        +IDDL DQLQGATVFSKIDLRSGYHQLRI+D DIPKTAFRSRYGHYEF+VMSFGL+NA AVFMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEHEEHLH
Subjt:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH

Query:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE
        QVLETLRANKLYAKFSKCEFWL+KVTFL HVVSSEGVSVD AKIEAVT+WPRPSTVSEIRSFLGLAGYYR FVEDFSRIASPLTQL  K TPFVWSP CE
Subjt:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE

Query:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF
         SFQELKQKLV+ PVL   DGSG+FVIYSDASKKGLG VLMQQG+V             NYPTHDLELAAVVF+LKIWRHYLYGE IQI+TDHKSLKYFF
Subjt:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE
        TQKELNMRQR+WLELVKDYD EILYH GKANVVA+ALSRKVAHSA LI++
Subjt:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE

TYK20443.1 pol protein [Cucumis melo var. makuwa]1.6e-20682.44Show/hide
Query:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
        P  FP       P +E+DF IELEPGT PIS+APYRMAP ELKELKVQL            VSPWGAP+LFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
Subjt:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP

Query:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH
        +IDDL DQLQGATVFSKIDLRSGYHQLRI+D DIPKTAFRSRYGHYEF+VMSFGL+NA AVFMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEHEEHLH
Subjt:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH

Query:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE
        QVLETLRANKLYAKFSKCEFWL+KVTFL HVVSSEGVSVD AKIEAVT+W RPSTVSEIRSFLGLAGYYR FVEDFSRIASPLTQL  K TPFVWSP CE
Subjt:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE

Query:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF
         SFQELKQKLV+ PVL   DGSG+FVIYSDASKKGLG VLMQQG+V             NYPTHDLELAAVVF+LKIWRHYLYGE IQI+TDHKSLKYFF
Subjt:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE
        TQKELNMRQR+WLELVKDYD EILYH GKANVVA+ALSRKVAHSA LI++
Subjt:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE

TrEMBL top hitse value%identityAlignment
A0A5A7TP01 Reverse transcriptase1.2e-20782.89Show/hide
Query:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
        P  FP       P +E+DF IELEPGT PIS+APYRMAP ELKELKVQL            VSPWGAP+LFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
Subjt:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP

Query:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH
        +I+DL DQLQGATVFSKIDLRSGYHQLRI+D DIPKTAFRSRYGHYEFIVMSFGL+NA AVFMDLMNRVFKDFLD FVIVFIDDILIYSKTEAEHEEHLH
Subjt:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH

Query:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE
        QVLETLRA+KLYAKFSKCEFWLKKVTFL HVVSSEGVSVD AK+EAVT+WPRPSTVSEIRSFLGLAGYYR FVEDFSRIASPLTQL  K TPFVWSP CE
Subjt:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE

Query:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF
        SSFQELKQKLV+ PVL   DGSGSFVIYSDASKKGLG VLMQQG+V             NYPTHDLELAAVVF+LKIWRHYLYGE IQI+TDHKSLKYFF
Subjt:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE
        TQKELNMRQR+WLELVKDYD EILYH GKANVVA+ALSRKVAHSA LI++
Subjt:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE

A0A5A7USG7 Reverse transcriptase2.1e-20782Show/hide
Query:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
        P  FP       P +E+DF IELEPGT PIS+APYRMAP ELKELKVQL            VSPWGAP+LFVKKKDGSMRLCIDYRELNKVT+KNRYPLP
Subjt:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP

Query:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH
        +IDDL DQLQGATVFSKIDLRSGYHQLRI+D DIPKTAF SRYGHYEF+VMSFGL+NA AVFMDLMNRVFKDF+D+FVIVFIDDILIYSKTEAEHEEHLH
Subjt:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH

Query:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE
        QVLETLRANKLYAKFSKCEFWL+KVTFL HVVSSEGVSVD AKIEAVT+WPRPSTVSEIRSFLGLAGYYR FVEDFSRIASPLTQL  K TPFVWSP CE
Subjt:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE

Query:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF
        SSFQELKQKLV+ PVL   DGSG+FVIYSDASKKGLG VLMQQG+V             NYPTHDLELA VVF+LKIWRHYLYGE IQI+TDHKSLKYFF
Subjt:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE
        TQKELNMRQR+WLELVKDYD+EILYH GKANVVA+ALSRKVAHSA LI++
Subjt:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE

A0A5A7V4E4 Reverse transcriptase2.5e-20883.11Show/hide
Query:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
        P  FP       P +E+DF IELEPGT PIS+APYRMAP ELKELKVQL            VSPWGAP+LFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
Subjt:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP

Query:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH
        KIDDL DQLQGATVFSKIDLRSGYHQLRI+D DIPKTAFRSRYGHYEF+VMSFGL+NA AVFMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEHEEHLH
Subjt:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH

Query:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE
        QVLETLRANKLYAKFSKCEFWL+KVTFL HVVSSEGVSVD AKIEAVT+WPRPSTVSEIRSFLGLAGYYR FVEDFSRIASPLTQL  K TPFVWSP CE
Subjt:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE

Query:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF
        SSFQELKQKLV  PVL   DGSG+FVIYSDASKKGLG VLMQQG+V             NYPTHDLELAAVVF+LKIWRHYLYGE IQI+TDHKSLKYFF
Subjt:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE
        TQKELNMRQR+WLELVKDYD EILYH GKANVVA+ALSRKVAHSA LI++
Subjt:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE

A0A5A7V6R2 Reverse transcriptase5.5e-20882.67Show/hide
Query:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
        P  FP +     P +E+DF IELEPGT PIS+APYRMAP ELKELKVQL            VSPWGAP+LFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
Subjt:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP

Query:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH
        +IDDL DQLQGATVFSKIDLRSGYHQLRI+D DIPKTAFRSRYGHYEF+VMSFGL+NA AVFMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEHEEHLH
Subjt:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH

Query:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE
        QVLETLRANKLYAKFSKCEFWL+KVTFL HVVSSEGVSVD AKIEAVT+WPRPSTVSEIRSFLGLAGYYR FVEDFSRIASPLTQL  K TPFVWSP CE
Subjt:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE

Query:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF
         SFQELKQKLV+ PVL   DGSG+FVIYSDASKKGLG VLMQQG+V             NYPTHDLELAAVVF+LKIWRHYLYGE IQI+TDHKSLKYFF
Subjt:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE
        TQKELNMRQR+WLELVKDYD EILYH GKANVVA+ALSRKVAHSA LI++
Subjt:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE

A0A5D3BTN0 Reverse transcriptase8.0e-20782.44Show/hide
Query:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
        P  FP       P +E+DF IELEPGT PIS+APYRMAP ELKELKVQL            VSPWGAP+LFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
Subjt:  PRCFPRRASRTSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQL------------VSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLP

Query:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH
        +IDDL DQLQGATVFSKIDLRSGYHQLRI+D DIPKTAFRSRYGHYEF+VMSFGL+NA AVFMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEHEEHLH
Subjt:  KIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLH

Query:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE
        QVLETLRANKLYAKFSKCEFWL+KVTFL HVVSSEGVSVD AKIEAVT+W RPSTVSEIRSFLGLAGYYR FVEDFSRIASPLTQL  K TPFVWSP CE
Subjt:  QVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCE

Query:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF
         SFQELKQKLV+ PVL   DGSG+FVIYSDASKKGLG VLMQQG+V             NYPTHDLELAAVVF+LKIWRHYLYGE IQI+TDHKSLKYFF
Subjt:  SSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRV-------------NYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE
        TQKELNMRQR+WLELVKDYD EILYH GKANVVA+ALSRKVAHSA LI++
Subjt:  TQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.9e-7237.6Show/hide
Query:  SPWGAPILFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPKIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSN
        SP+ +PI  V KK+D S     R+ IDYR+LN++TV +R+P+P +D+++ +L     F+ IDL  G+HQ+ +    + KTAF +++GHYE++ M FGL N
Subjt:  SPWGAPILFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPKIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSN

Query:  ASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVS
        A A F   MN + +  L+   +V++DDI+++S +  EH + L  V E L    L  +  KCEF  ++ TFL HV++ +G+  +  KIEA+  +P P+   
Subjt:  ASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVS

Query:  EIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPF-VWSPVCESSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQG------------
        EI++FLGL GYYR F+ +F+ IA P+T+ + K       +P  +S+F++LK  +   P+L   D +  F + +DAS   LG VL Q G            
Subjt:  EIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPF-VWSPVCESSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQG------------

Query:  -RVNYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFFTQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSR
          +NY T + EL A+V++ K +RHYL G   +I +DH+ L + +  K+ N +  +W   + ++D++I Y  GK N VA+ALSR
Subjt:  -RVNYPTHDLELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFFTQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSR

P0CT34 Transposon Tf2-1 polyprotein4.0e-7037.08Show/hide
Query:  PILFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMN
        P++FV KK+G++R+ +DY+ LNK    N YPLP I+ L+ ++QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G+S A A F   +N
Subjt:  PILFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMN

Query:  RVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAG
         +  +  ++ V+ ++DDILI+SK+E+EH +H+  VL+ L+   L    +KCEF   +V F+ + +S +G +     I+ V  W +P    E+R FLG   
Subjt:  RVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAG

Query:  YYRTFVEDFSRIASPLTQLITKRTPFVWSPVCESSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVL------------------MQQGRVNYP
        Y R F+   S++  PL  L+ K   + W+P    + + +KQ LVSPPVL   D S   ++ +DAS   +G VL                  M + ++NY 
Subjt:  YYRTFVEDFSRIASPLTQLITKRTPFVWSPVCESSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVL------------------MQQGRVNYP

Query:  THDLELAAVVFSLKIWRHYLYG--EMIQIFTDHKSLKYFFTQKE--LNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKV
          D E+ A++ SLK WRHYL    E  +I TDH++L    T +    N R  +W   ++D+++EI Y  G AN +A+ALSR V
Subjt:  THDLELAAVVFSLKIWRHYLYG--EMIQIFTDHKSLKYFFTQKE--LNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKV

P0CT35 Transposon Tf2-2 polyprotein4.0e-7037.08Show/hide
Query:  PILFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMN
        P++FV KK+G++R+ +DY+ LNK    N YPLP I+ L+ ++QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G+S A A F   +N
Subjt:  PILFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMN

Query:  RVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAG
         +  +  ++ V+ ++DDILI+SK+E+EH +H+  VL+ L+   L    +KCEF   +V F+ + +S +G +     I+ V  W +P    E+R FLG   
Subjt:  RVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAG

Query:  YYRTFVEDFSRIASPLTQLITKRTPFVWSPVCESSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVL------------------MQQGRVNYP
        Y R F+   S++  PL  L+ K   + W+P    + + +KQ LVSPPVL   D S   ++ +DAS   +G VL                  M + ++NY 
Subjt:  YYRTFVEDFSRIASPLTQLITKRTPFVWSPVCESSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVL------------------MQQGRVNYP

Query:  THDLELAAVVFSLKIWRHYLYG--EMIQIFTDHKSLKYFFTQKE--LNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKV
          D E+ A++ SLK WRHYL    E  +I TDH++L    T +    N R  +W   ++D+++EI Y  G AN +A+ALSR V
Subjt:  THDLELAAVVFSLKIWRHYLYG--EMIQIFTDHKSLKYFFTQKE--LNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKV

P0CT36 Transposon Tf2-3 polyprotein4.0e-7037.08Show/hide
Query:  PILFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMN
        P++FV KK+G++R+ +DY+ LNK    N YPLP I+ L+ ++QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G+S A A F   +N
Subjt:  PILFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMN

Query:  RVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAG
         +  +  ++ V+ ++DDILI+SK+E+EH +H+  VL+ L+   L    +KCEF   +V F+ + +S +G +     I+ V  W +P    E+R FLG   
Subjt:  RVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAG

Query:  YYRTFVEDFSRIASPLTQLITKRTPFVWSPVCESSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVL------------------MQQGRVNYP
        Y R F+   S++  PL  L+ K   + W+P    + + +KQ LVSPPVL   D S   ++ +DAS   +G VL                  M + ++NY 
Subjt:  YYRTFVEDFSRIASPLTQLITKRTPFVWSPVCESSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVL------------------MQQGRVNYP

Query:  THDLELAAVVFSLKIWRHYLYG--EMIQIFTDHKSLKYFFTQKE--LNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKV
          D E+ A++ SLK WRHYL    E  +I TDH++L    T +    N R  +W   ++D+++EI Y  G AN +A+ALSR V
Subjt:  THDLELAAVVFSLKIWRHYLYG--EMIQIFTDHKSLKYFFTQKE--LNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKV

P0CT41 Transposon Tf2-12 polyprotein4.0e-7037.08Show/hide
Query:  PILFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMN
        P++FV KK+G++R+ +DY+ LNK    N YPLP I+ L+ ++QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G+S A A F   +N
Subjt:  PILFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLIDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMN

Query:  RVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAG
         +  +  ++ V+ ++DDILI+SK+E+EH +H+  VL+ L+   L    +KCEF   +V F+ + +S +G +     I+ V  W +P    E+R FLG   
Subjt:  RVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAG

Query:  YYRTFVEDFSRIASPLTQLITKRTPFVWSPVCESSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVL------------------MQQGRVNYP
        Y R F+   S++  PL  L+ K   + W+P    + + +KQ LVSPPVL   D S   ++ +DAS   +G VL                  M + ++NY 
Subjt:  YYRTFVEDFSRIASPLTQLITKRTPFVWSPVCESSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVL------------------MQQGRVNYP

Query:  THDLELAAVVFSLKIWRHYLYG--EMIQIFTDHKSLKYFFTQKE--LNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKV
          D E+ A++ SLK WRHYL    E  +I TDH++L    T +    N R  +W   ++D+++EI Y  G AN +A+ALSR V
Subjt:  THDLELAAVVFSLKIWRHYLYG--EMIQIFTDHKSLKYFFTQKE--LNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKV

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.2e-2140.5Show/hide
Query:  HLHQVLETLRANKLYAKFSKCEFWLKKVTFLD--HVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVW
        HL  VL+    ++ YA   KC F   ++ +L   H++S EGVS D AK+EA+  WP P   +E+R FLGL GYYR FV+++ +I  PLT+L+ K+    W
Subjt:  HLHQVLETLRANKLYAKFSKCEFWLKKVTFLD--HVVSSEGVSVDSAKIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVW

Query:  SPVCESSFQELKQKLVSPPVL
        + +   +F+ LK  + + PVL
Subjt:  SPVCESSFQELKQKLVSPPVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGAAGTTGCTCATGAGGAGTATTCAAACGATCCAAATGAATTTGAGAAATTGCTTAATGATGCCGAAAAATCACTATATGAAGGATGCAAAAAATTCACCAAGTT
GTCCACATTAGTGAAGTTGTATAATTTAAAAGTTAGGTATGGATGGAGTGATATTAGCTTTTCGGAACTACTGAAAATTTTAAAGAAAATTTTGCCTACTTCCAATGAGA
TCCCAACATCCATGTATGAAGAGAAGAAAACTTTAGGTGCATTAGGAATGAGTTATGAAGAGATTCATGCATGCCCTAATGATTGTTGTTTGTATAGAAAAGAACACGTT
AATGCAACTGAATGTCCTGAATGTGATGAATCAAAGTGGAAGTTTGCTAATAATGCAAACGGGGGAAAAAAACAAATTCCTGAAAAAGTTGTATGGTATTTTCCACCGAT
TCCACGTTTCAAAAGATTGTTTCGAAGTATTAACAATGCTAAATTTTTGATTCGACATTCTAATGAACGAGTAATTGATGGAAAATTACGACATCCTGTAGACTCTCCAG
CTTGGAAATTAATAGATTTGAAGTGGCCAGACTTTGGCTCTGAACCTAGGAATATTCGTTTAGCATTGTCAGTGGATGGGATCAACCCACACGGTGAAATGAGTTCTAAA
TATAGTTGTTGGCCCGTAGTGATAGTTATTTACAATCTTCCACCATGGTTGTGCATGAAAAGAAAGTTCATGATGTTATCAATGTTGATATCGGGTCCAAGGCAACCAAG
GGATGACATTGGTACGTACTTAGCACCACCCATCGAGGATTTAAAACTCTTATGGGAAAGTGGTGTTGAATGTTATGATGCTAATCAAGAAGAAATATTCAATTTAAGGG
CTGTTTTGTTATGGACAATAAATGATTTTCCTGCATATGGAAATTTTAGTGGATTTAGTGTGAAAGGTGGTAAGGGAGTACCCCGATGTTTTCCCCGACGAGCTTCCAGG
ACTTCCCCCTCCCAGGAGATAGACTTCACCATCGAGTTAGAGCCAGGCACTACTCCTATCTCGAAGGCCCCTTACAGAATGGCTCCAACTGAGTTAAAGGAGCTGAAGGT
GCAGCTTGTGTCACCTTGGGGAGCACCAATATTGTTCGTAAAGAAGAAGGATGGGTCAATGCGCCTTTGCATTGACTATAGAGAGTTGAACAAGGTGACAGTCAAGAACC
GCTATCCCTTACCCAAGATTGATGATTTAATCGATCAGTTACAAGGAGCCACCGTCTTTTCTAAGATCGACCTACGATCAGGCTATCACCAGCTGAGGATCAAGGATAGT
GATATTCCTAAGACGGCTTTCCGTTCCAGATATGGGCATTACGAGTTCATTGTGATGTCCTTTGGTTTGAGTAATGCTTCTGCAGTATTCATGGACTTGATGAACAGGGT
GTTTAAGGACTTCTTAGACACGTTTGTTATAGTTTTCATTGATGACATTTTGATTTATTCCAAGACTGAGGCTGAACATGAAGAGCATTTGCATCAGGTTTTGGAGACTC
TTCGAGCTAATAAGTTGTATGCCAAGTTCTCCAAGTGTGAGTTCTGGCTGAAGAAGGTGACTTTTCTCGACCATGTGGTTTCTAGTGAGGGAGTTTCTGTGGACTCAGCA
AAGATCGAAGCGGTTACCAGTTGGCCTCGACCGTCTACAGTTAGTGAGATTCGTAGTTTCCTGGGTTTGGCAGGTTACTACAGAACGTTCGTGGAAGACTTCTCTCGTAT
AGCCAGTCCCTTAACTCAGTTGATCACGAAGAGGACTCCTTTTGTTTGGAGCCCAGTTTGTGAGAGTAGCTTCCAGGAGCTTAAGCAGAAGCTTGTGTCTCCACCAGTCC
TCATAGGGCTAGATGGATCCGGGAGCTTTGTAATCTACAGTGATGCCTCCAAGAAAGGACTGGGTTACGTGTTGATGCAGCAGGGTAGGGTAAACTATCCTACCCATGAC
CTAGAGTTGGCAGCAGTGGTTTTTTCACTGAAGATATGGAGACACTACCTGTATGGTGAGATGATACAGATTTTCACTGACCATAAGAGCCTAAAGTACTTCTTCACCCA
GAAGGAGTTGAACATGAGACAGAGAAAATGGCTTGAGTTGGTGAAGGATTATGACTACGAGATTCTGTATCACCTTGGTAAGGCAAATGTAGTAGCCAACGCGCTCAGTA
GGAAAGTTGCACATTCAGCAGTGCTTATCTCCGAGCTTGCTCAGAGATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTGAAGTTGCTCATGAGGAGTATTCAAACGATCCAAATGAATTTGAGAAATTGCTTAATGATGCCGAAAAATCACTATATGAAGGATGCAAAAAATTCACCAAGTT
GTCCACATTAGTGAAGTTGTATAATTTAAAAGTTAGGTATGGATGGAGTGATATTAGCTTTTCGGAACTACTGAAAATTTTAAAGAAAATTTTGCCTACTTCCAATGAGA
TCCCAACATCCATGTATGAAGAGAAGAAAACTTTAGGTGCATTAGGAATGAGTTATGAAGAGATTCATGCATGCCCTAATGATTGTTGTTTGTATAGAAAAGAACACGTT
AATGCAACTGAATGTCCTGAATGTGATGAATCAAAGTGGAAGTTTGCTAATAATGCAAACGGGGGAAAAAAACAAATTCCTGAAAAAGTTGTATGGTATTTTCCACCGAT
TCCACGTTTCAAAAGATTGTTTCGAAGTATTAACAATGCTAAATTTTTGATTCGACATTCTAATGAACGAGTAATTGATGGAAAATTACGACATCCTGTAGACTCTCCAG
CTTGGAAATTAATAGATTTGAAGTGGCCAGACTTTGGCTCTGAACCTAGGAATATTCGTTTAGCATTGTCAGTGGATGGGATCAACCCACACGGTGAAATGAGTTCTAAA
TATAGTTGTTGGCCCGTAGTGATAGTTATTTACAATCTTCCACCATGGTTGTGCATGAAAAGAAAGTTCATGATGTTATCAATGTTGATATCGGGTCCAAGGCAACCAAG
GGATGACATTGGTACGTACTTAGCACCACCCATCGAGGATTTAAAACTCTTATGGGAAAGTGGTGTTGAATGTTATGATGCTAATCAAGAAGAAATATTCAATTTAAGGG
CTGTTTTGTTATGGACAATAAATGATTTTCCTGCATATGGAAATTTTAGTGGATTTAGTGTGAAAGGTGGTAAGGGAGTACCCCGATGTTTTCCCCGACGAGCTTCCAGG
ACTTCCCCCTCCCAGGAGATAGACTTCACCATCGAGTTAGAGCCAGGCACTACTCCTATCTCGAAGGCCCCTTACAGAATGGCTCCAACTGAGTTAAAGGAGCTGAAGGT
GCAGCTTGTGTCACCTTGGGGAGCACCAATATTGTTCGTAAAGAAGAAGGATGGGTCAATGCGCCTTTGCATTGACTATAGAGAGTTGAACAAGGTGACAGTCAAGAACC
GCTATCCCTTACCCAAGATTGATGATTTAATCGATCAGTTACAAGGAGCCACCGTCTTTTCTAAGATCGACCTACGATCAGGCTATCACCAGCTGAGGATCAAGGATAGT
GATATTCCTAAGACGGCTTTCCGTTCCAGATATGGGCATTACGAGTTCATTGTGATGTCCTTTGGTTTGAGTAATGCTTCTGCAGTATTCATGGACTTGATGAACAGGGT
GTTTAAGGACTTCTTAGACACGTTTGTTATAGTTTTCATTGATGACATTTTGATTTATTCCAAGACTGAGGCTGAACATGAAGAGCATTTGCATCAGGTTTTGGAGACTC
TTCGAGCTAATAAGTTGTATGCCAAGTTCTCCAAGTGTGAGTTCTGGCTGAAGAAGGTGACTTTTCTCGACCATGTGGTTTCTAGTGAGGGAGTTTCTGTGGACTCAGCA
AAGATCGAAGCGGTTACCAGTTGGCCTCGACCGTCTACAGTTAGTGAGATTCGTAGTTTCCTGGGTTTGGCAGGTTACTACAGAACGTTCGTGGAAGACTTCTCTCGTAT
AGCCAGTCCCTTAACTCAGTTGATCACGAAGAGGACTCCTTTTGTTTGGAGCCCAGTTTGTGAGAGTAGCTTCCAGGAGCTTAAGCAGAAGCTTGTGTCTCCACCAGTCC
TCATAGGGCTAGATGGATCCGGGAGCTTTGTAATCTACAGTGATGCCTCCAAGAAAGGACTGGGTTACGTGTTGATGCAGCAGGGTAGGGTAAACTATCCTACCCATGAC
CTAGAGTTGGCAGCAGTGGTTTTTTCACTGAAGATATGGAGACACTACCTGTATGGTGAGATGATACAGATTTTCACTGACCATAAGAGCCTAAAGTACTTCTTCACCCA
GAAGGAGTTGAACATGAGACAGAGAAAATGGCTTGAGTTGGTGAAGGATTATGACTACGAGATTCTGTATCACCTTGGTAAGGCAAATGTAGTAGCCAACGCGCTCAGTA
GGAAAGTTGCACATTCAGCAGTGCTTATCTCCGAGCTTGCTCAGAGATTTTGA
Protein sequenceShow/hide protein sequence
MIEVAHEEYSNDPNEFEKLLNDAEKSLYEGCKKFTKLSTLVKLYNLKVRYGWSDISFSELLKILKKILPTSNEIPTSMYEEKKTLGALGMSYEEIHACPNDCCLYRKEHV
NATECPECDESKWKFANNANGGKKQIPEKVVWYFPPIPRFKRLFRSINNAKFLIRHSNERVIDGKLRHPVDSPAWKLIDLKWPDFGSEPRNIRLALSVDGINPHGEMSSK
YSCWPVVIVIYNLPPWLCMKRKFMMLSMLISGPRQPRDDIGTYLAPPIEDLKLLWESGVECYDANQEEIFNLRAVLLWTINDFPAYGNFSGFSVKGGKGVPRCFPRRASR
TSPSQEIDFTIELEPGTTPISKAPYRMAPTELKELKVQLVSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLIDQLQGATVFSKIDLRSGYHQLRIKDS
DIPKTAFRSRYGHYEFIVMSFGLSNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSEGVSVDSA
KIEAVTSWPRPSTVSEIRSFLGLAGYYRTFVEDFSRIASPLTQLITKRTPFVWSPVCESSFQELKQKLVSPPVLIGLDGSGSFVIYSDASKKGLGYVLMQQGRVNYPTHD
LELAAVVFSLKIWRHYLYGEMIQIFTDHKSLKYFFTQKELNMRQRKWLELVKDYDYEILYHLGKANVVANALSRKVAHSAVLISELAQRF