; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh15G010110 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh15G010110
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr15:5807705..5809198
RNA-Seq ExpressionCmoCh15G010110
SyntenyCmoCh15G010110
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]1.0e-22078.91Show/hide
Query:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM
        MFVDATIN   +KST++DSGATHNFI++QEARRL LTI KD GKMK VN EALPIVGVSK V  K+G WTG +D VVVRMDDF+VVLGMEFL+EHKVIPM
Subjt:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM

Query:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK
        PLAKC+++T  +PTV+  SIKQPG +RMISA+QLK GL REE TFMAIP++E+      VP EI+ V+  Y DIMP+SLP+TLPPRRGIDHEIEL+P  K
Subjt:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK

Query:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR
        PPAKNAYRMA PELAELRKQLDELL AGFIRP KAP+GAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPII+DLFDQL+GAK FTKLDLRSGYYQVR
Subjt:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR

Query:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL
        IAEGDEPKTTC+TRYGAFEFLVM FGLTNAPATFCT+MNQVF+EYLDQFV+VYL+DIVVYS TL+EH++HL+LVFDKLRQ+QLY+KKEKCAFAQ  I FL
Subjt:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL

Query:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLK
        GHV+  GQISMD+DK+KAIQEW+VPTSV ELRSFLGLANYYRRF+EGFSR A P+TELLKKG  W WS + Q AFE+LK
Subjt:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLK

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]7.3e-21176.62Show/hide
Query:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM
        MFVDA IN   +KS ++DSGATHNFI++QEA RL LTI KD GKMKAVNSEALPIVGVSK V  K+G WTG  D VVVRMDDF+VVLGMEFL+EHKVIPM
Subjt:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM

Query:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK
        PLAKC+++T  +PTV+ ASIKQPG +RMISA+QLK GL REE TFM      +      VP EI+ V+  Y DIMP+SLP+TLPPRRGIDHEIEL+P  K
Subjt:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK

Query:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR
        PPAKNAYRMA P+LAELRKQLDEL   GFIRP KAP+GA VLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPII+DLFDQL+GAK FTKLDLRSGYYQVR
Subjt:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR

Query:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL
        IAEGDEPKTTC+TRYGAFEFLVMPFGLTNAPATFCT+MNQVF+EYLDQFV+VYL+DIVVYS TL+EH++HL+LVFDKLRQ+QLY+KKEKCAFAQ  I FL
Subjt:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL

Query:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLK
        GHV+  GQISMD+DK+K IQEW+VPTSV+ELRSFLGLANYYRRF+EGFSR A P+TELL K     WS + Q AFE+LK
Subjt:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLK

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]2.3e-26595.46Show/hide
Query:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM
        MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDF+VVLGMEFLLEHKVIPM
Subjt:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM

Query:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK
        PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLK GLAREE TFMAIPLIEEATTEETVPEEIKEVLD+YTDIMPESLPQTLPPRRGIDHEIELLP VK
Subjt:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK

Query:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR
        PPAKNAYRMA PELAELRKQLDELLKAGFIRP KAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQL+GAK FTKLDLRSGYYQVR
Subjt:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR

Query:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL
        I EGDEPKTTC+TRYGAFEFLVMPFGLTNAPATF TLMNQVFYEYLDQFVIVYL+DIVVYSTTLEEHKVHLKLVFDKLRQ+QLY+KKEKCAFAQTCINFL
Subjt:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL

Query:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLKNNHDEG
        GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSR AAPLTELLKK HPWSWSNDCQMAFENLK     G
Subjt:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLKNNHDEG

XP_023526180.1 uncharacterized protein LOC111789739 [Cucurbita pepo subsp. pepo]3.6e-26695.67Show/hide
Query:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM
        MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDF+VVLGMEFLLEHKVIPM
Subjt:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM

Query:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK
        PLAKCLVITDRNP VIPASIKQPGNLRMISAIQLK GLAREE TFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLP VK
Subjt:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK

Query:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR
        PPAKNAYRMA PELAELRKQLDELLKAGFIRP KAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQL+GAK FTKLDLRSGYYQVR
Subjt:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR

Query:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL
        IAEGDEPKTTC+TRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYL+DIVVYSTTLEEHKVHLKLVFDKLRQ+QLY+KKEKCAFAQTCI+FL
Subjt:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL

Query:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLKNNHDEG
        GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSR AAPLTELLKK HPWSWSNDCQMAFENLK     G
Subjt:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLKNNHDEG

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]1.6e-26695.67Show/hide
Query:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM
        MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDF+VVLGMEFLLEHKVIPM
Subjt:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM

Query:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK
        PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLK GLAREE TFMAIPLIEEATTEETVPEEIK+VLDSYTDIMPESLPQTLPPRRGIDHEIELLP VK
Subjt:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK

Query:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR
        PPAKNAYRMA PELAELRKQLDELLKAGFIRP KAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQL+GAK FTKLDLRSGYYQVR
Subjt:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR

Query:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL
        IAEGDEPKTTC+TRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYL+DIVVYSTTLEEHKVHLKLVFDKLRQ+QLY+KKEKCAFAQTCI+FL
Subjt:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL

Query:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLKNNHDEG
        GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSR AAPLTELLKK HPWSWSNDCQMAFENLK     G
Subjt:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLKNNHDEG

TrEMBL top hitse value%identityAlignment
A0A5D3BQE4 Reverse transcriptase2.5e-20471.96Show/hide
Query:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM
        M+VD  IN + +KST++DSGATHNFI + EA+RL L   KD G+MKAVNS ALPI+G+ K    ++G W+G +D VVV+MDDF+VVLGMEFLLEH+VIPM
Subjt:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM

Query:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK
        PLAKCLVIT   P+V+   ++QP  L+MISA+QLK GL+R+E TFMAIPL     + ETVP+EI  VL+ Y D+MP+SLP++LPPRR IDHEIEL+P  K
Subjt:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK

Query:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR
        PPAKNAYRMA PELAELRKQLDELL AGFIRP KAPYGAPVLFQ+KKDG+LRLCIDYRALNK+TVRNKYPLPII+DLFD+L+GAK F+KLDLRSGYYQVR
Subjt:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR

Query:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL
        IAEGDEPKTTC+TRYGAFEFLVMPFGLTNAPATFCTLMNQVF+EYLD+FV+VYL+DIVVYSTT+EEH+ HL+ VF KL+++QLY+K+EKC+FAQ  INFL
Subjt:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL

Query:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLKNNHDEG
        GHV+ CG+I M+  KI AI++W +P SVSELRSFLGLANYYRRFVEGFS+ A+PLTELLKK   W+W  +CQ AF+ LK    EG
Subjt:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLKNNHDEG

A0A5D3C8Z6 Reverse transcriptase1.9e-20471.96Show/hide
Query:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM
        M+VD  IN + +KST++DSGATHNFI + EA+RL L   KD G+MKAVNS ALPI+G+ K    ++G W+G +D VVV+MDDF+VVLGMEFLLEH+VIPM
Subjt:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM

Query:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK
        PLAKCLVIT   P+V+   ++QP  L+MISA+QLK GL+R+E TFMAIPL     + ETVP+EI  VL+ Y D+MP+SLP++LPPRR IDHEIEL+P  K
Subjt:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK

Query:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR
        PPAKNAYRMA PELAELRKQLDELL AGFIRP KAPYGAPVLFQ+KKDG+LRLCIDYRALNK+TVRNKYPLPII+DLFD+L+GAK F+KLDLRSGYYQVR
Subjt:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR

Query:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL
        IAEGDEPKTTC+TRYGAFEFLVMPFGLTNAPATFCTLMNQVF+EYLD+FV+VYL+DIVVYSTT+EEH+ HL+ VF KL+++QLY+K+EKC+FAQ  INFL
Subjt:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL

Query:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLKNNHDEG
        GHV+ CG+I M+  KI AI++W +P SVSELRSFLGLANYYRRFVEGFS+ A+PLTELLKK   W+W  +CQ AF+ LK    EG
Subjt:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLKNNHDEG

A0A5D3C9P8 Reverse transcriptase1.4e-20472.16Show/hide
Query:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM
        M+VD  IN + +KST++DSGATHNFI + EA+RL L   KD G+MKAVNS ALPI+G+ K    ++G W+G +D VVV+MDDF+VVLGMEFLLEH+VIPM
Subjt:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM

Query:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK
        PLAKCLVIT   P+V+   ++QP  L+MISA+QLK GL+R+E TFMAIPL     + ETVP+EI  VL+ Y D+MP+SLP++LPPRR IDHEIEL+P  K
Subjt:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK

Query:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR
        PPAKNAYRMA PELAELRKQLDELL AGFIRP KAPYGAPVLFQKKKDG+LRLCIDYRALNK+TVRNKYPLPII+DLFD+L+GAK F+KLDLRSGYYQVR
Subjt:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR

Query:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL
        IAEGDEPKTTC+TRYGAFEFLVMPFGLTNAPATFCTLMNQVF+EYLD+FV+VYL+DIVVYSTT+EEH+ HL+ VF KL+++QLY+K+EKC+FAQ  INFL
Subjt:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL

Query:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLKNNHDEG
        GHV+ CG+I M+  KI AI++W +P SVSELRSFLGLANYYRRFVEGFS+ A+PLTELLKK   W+W  +CQ AF+ LK    EG
Subjt:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLKNNHDEG

A0A6J1D906 Reverse transcriptase4.9e-22178.91Show/hide
Query:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM
        MFVDATIN   +KST++DSGATHNFI++QEARRL LTI KD GKMK VN EALPIVGVSK V  K+G WTG +D VVVRMDDF+VVLGMEFL+EHKVIPM
Subjt:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM

Query:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK
        PLAKC+++T  +PTV+  SIKQPG +RMISA+QLK GL REE TFMAIP++E+      VP EI+ V+  Y DIMP+SLP+TLPPRRGIDHEIEL+P  K
Subjt:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK

Query:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR
        PPAKNAYRMA PELAELRKQLDELL AGFIRP KAP+GAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPII+DLFDQL+GAK FTKLDLRSGYYQVR
Subjt:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR

Query:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL
        IAEGDEPKTTC+TRYGAFEFLVM FGLTNAPATFCT+MNQVF+EYLDQFV+VYL+DIVVYS TL+EH++HL+LVFDKLRQ+QLY+KKEKCAFAQ  I FL
Subjt:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL

Query:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLK
        GHV+  GQISMD+DK+KAIQEW+VPTSV ELRSFLGLANYYRRF+EGFSR A P+TELLKKG  W WS + Q AFE+LK
Subjt:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLK

A0A6J1DK29 uncharacterized protein LOC1110218293.5e-21176.62Show/hide
Query:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM
        MFVDA IN   +KS ++DSGATHNFI++QEA RL LTI KD GKMKAVNSEALPIVGVSK V  K+G WTG  D VVVRMDDF+VVLGMEFL+EHKVIPM
Subjt:  MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPM

Query:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK
        PLAKC+++T  +PTV+ ASIKQPG +RMISA+QLK GL REE TFM      +      VP EI+ V+  Y DIMP+SLP+TLPPRRGIDHEIEL+P  K
Subjt:  PLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVK

Query:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR
        PPAKNAYRMA P+LAELRKQLDEL   GFIRP KAP+GA VLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPII+DLFDQL+GAK FTKLDLRSGYYQVR
Subjt:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVR

Query:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL
        IAEGDEPKTTC+TRYGAFEFLVMPFGLTNAPATFCT+MNQVF+EYLDQFV+VYL+DIVVYS TL+EH++HL+LVFDKLRQ+QLY+KKEKCAFAQ  I FL
Subjt:  IAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFL

Query:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLK
        GHV+  GQISMD+DK+K IQEW+VPTSV+ELRSFLGLANYYRRF+EGFSR A P+TELL K     WS + Q AFE+LK
Subjt:  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.7e-5632.44Show/hide
Query:  ITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPW-VKPPAKNA
        I+ +   ++      P  +   +     + ++  + T   +  +     E  +P+  KE  D   +   E LP+   P +G++ E+EL     + P +N 
Subjt:  ITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPW-VKPPAKNA

Query:  YRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVRIAEGDE
        Y +   ++  +  ++++ LK+G IR +KA    PV+F  KK+GTLR+ +DY+ LNK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE
Subjt:  YRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVRIAEGDE

Query:  PKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFLGHVVRC
         K       G FE+LVMP+G++ APA F   +N +  E  +  V+ Y++DI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+G+ +  
Subjt:  PKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFLGHVVRC

Query:  GQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLK
           +   + I  + +WK P +  ELR FLG  NY R+F+   S+   PL  LLKK   W W+     A EN+K
Subjt:  GQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLK

P0CT35 Transposon Tf2-2 polyprotein1.7e-5632.44Show/hide
Query:  ITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPW-VKPPAKNA
        I+ +   ++      P  +   +     + ++  + T   +  +     E  +P+  KE  D   +   E LP+   P +G++ E+EL     + P +N 
Subjt:  ITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPW-VKPPAKNA

Query:  YRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVRIAEGDE
        Y +   ++  +  ++++ LK+G IR +KA    PV+F  KK+GTLR+ +DY+ LNK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE
Subjt:  YRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVRIAEGDE

Query:  PKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFLGHVVRC
         K       G FE+LVMP+G++ APA F   +N +  E  +  V+ Y++DI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+G+ +  
Subjt:  PKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFLGHVVRC

Query:  GQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLK
           +   + I  + +WK P +  ELR FLG  NY R+F+   S+   PL  LLKK   W W+     A EN+K
Subjt:  GQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLK

P0CT41 Transposon Tf2-12 polyprotein1.7e-5632.44Show/hide
Query:  ITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPW-VKPPAKNA
        I+ +   ++      P  +   +     + ++  + T   +  +     E  +P+  KE  D   +   E LP+   P +G++ E+EL     + P +N 
Subjt:  ITDRNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPW-VKPPAKNA

Query:  YRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVRIAEGDE
        Y +   ++  +  ++++ LK+G IR +KA    PV+F  KK+GTLR+ +DY+ LNK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE
Subjt:  YRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVRIAEGDE

Query:  PKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFLGHVVRC
         K       G FE+LVMP+G++ APA F   +N +  E  +  V+ Y++DI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+G+ +  
Subjt:  PKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFLGHVVRC

Query:  GQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLK
           +   + I  + +WK P +  ELR FLG  NY R+F+   S+   PL  LLKK   W W+     A EN+K
Subjt:  GQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.5e-5738.83Show/hide
Query:  ESLPQTLPPRRG------IDHEIELLPWVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYP
        E +   LPPR        + H+IE+ P  + P    Y +      E+ K + +LL   FI P+K+P  +PV+   KKDGT RLC+DYR LNK T+ + +P
Subjt:  ESLPQTLPPRRG------IDHEIELLPWVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYP

Query:  LPIISDLFDQLYGAKNFTKLDLRSGYYQVRIAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVH
        LP I +L  ++  A+ FT LDL SGY+Q+ +   D  KT  +T  G +E+ VMPFGL NAP+TF   M   F +   +FV VYL+DI+++S + EEH  H
Subjt:  LPIISDLFDQLYGAKNFTKLDLRSGYYQVRIAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVH

Query:  LKLVFDKLRQDQLYLKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSND
        L  V ++L+ + L +KK+KC FA     FLG+ +   +I+    K  AI+++  P +V + + FLG+ NYYRRF+   S+ A P+   +     W+   D
Subjt:  LKLVFDKLRQDQLYLKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSND

Query:  CQMAFENLK
           A E LK
Subjt:  CQMAFENLK

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.3e-5738.39Show/hide
Query:  ESLPQTLPPRRG------IDHEIELLPWVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYP
        E +   LPPR        + H+IE+ P  + P    Y +      E+ K + +LL   FI P+K+P  +PV+   KKDGT RLC+DYR LNK T+ + +P
Subjt:  ESLPQTLPPRRG------IDHEIELLPWVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYP

Query:  LPIISDLFDQLYGAKNFTKLDLRSGYYQVRIAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVH
        LP I +L  ++  A+ FT LDL SGY+Q+ +   D  KT  +T  G +E+ VMPFGL NAP+TF   M   F +   +FV VYL+DI+++S + EEH  H
Subjt:  LPIISDLFDQLYGAKNFTKLDLRSGYYQVRIAEGDEPKTTCITRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVH

Query:  LKLVFDKLRQDQLYLKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSND
        L  V ++L+ + L +KK+KC FA     FLG+ +   +I+    K  AI+++  P +V + + FLG+ NYYRRF+   S+ A P+   +     W+   D
Subjt:  LKLVFDKLRQDQLYLKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSWSND

Query:  CQMAFENLKN
           A + LK+
Subjt:  CQMAFENLKN

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.8e-1841.96Show/hide
Query:  HLKLVFDKLRQDQLYLKKEKCAFAQTCINFLG--HVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSW
        HL +V     Q Q Y  ++KCAF Q  I +LG  H++    +S D  K++A+  W  P + +ELR FLGL  YYRRFV+ + +   PLTELLKK +   W
Subjt:  HLKLVFDKLRQDQLYLKKEKCAFAQTCINFLG--HVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRPAAPLTELLKKGHPWSW

Query:  SNDCQMAFENLK
        +    +AF+ LK
Subjt:  SNDCQMAFENLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGTAGATGCCACAATAAACTCTCGGCTGAGCAAGAGCACTTTGATAGACTCTGGAGCAACCCACAACTTCATTGCCGATCAAGAAGCCAGAAGATTGGGACTCAC
CATAGGAAAAGACCCGGGGAAAATGAAAGCTGTCAACTCCGAGGCCTTGCCTATTGTGGGAGTTTCCAAAGGAGTCCCCTTCAAAATAGGGGATTGGACTGGAGAGCTAG
ACCTTGTCGTGGTTCGCATGGACGACTTCAACGTGGTACTTGGGATGGAGTTCCTCTTAGAACACAAGGTCATCCCAATGCCATTGGCAAAATGCTTGGTGATCACCGAC
CGTAACCCCACAGTAATACCTGCAAGCATCAAGCAACCAGGTAATCTTCGAATGATCTCGGCCATACAGCTGAAAATGGGACTCGCACGAGAGGAACTTACATTCATGGC
CATACCACTGATAGAAGAAGCAACCACCGAAGAGACTGTCCCAGAGGAAATCAAGGAAGTACTAGACAGCTATACCGACATAATGCCAGAGAGTCTACCACAAACACTAC
CACCTCGTCGAGGCATTGACCACGAAATCGAACTCCTCCCCTGGGTTAAGCCACCAGCGAAGAACGCATACCGGATGGCTCTGCCCGAGCTAGCCGAATTGAGGAAACAA
TTGGACGAGTTGCTGAAGGCGGGATTCATTCGCCCGACAAAGGCACCTTATGGAGCCCCCGTACTGTTCCAGAAGAAGAAGGATGGGACGTTGCGTCTGTGCATAGACTA
TAGAGCCTTAAACAAGGTGACGGTACGCAACAAATATCCTCTGCCGATAATATCCGACTTGTTCGACCAACTTTACGGGGCCAAGAACTTCACGAAGTTGGACTTACGAT
CAGGATATTATCAAGTACGTATTGCCGAAGGGGACGAACCCAAGACAACGTGCATAACAAGATATGGGGCCTTTGAATTCCTAGTAATGCCCTTTGGCTTGACAAACGCC
CCAGCTACGTTTTGCACGTTGATGAACCAGGTTTTCTACGAATATCTGGATCAGTTCGTCATAGTATACCTCGAAGACATTGTTGTTTACAGCACAACTCTCGAGGAACA
CAAAGTGCACTTGAAGCTAGTGTTTGACAAGCTCCGGCAGGACCAGCTATATTTGAAGAAGGAGAAATGTGCATTCGCACAAACATGCATCAACTTCCTTGGACATGTCG
TCAGATGTGGACAAATTAGTATGGATAGCGACAAGATAAAAGCTATCCAAGAATGGAAAGTTCCTACTTCCGTATCTGAGTTGCGGTCCTTCTTAGGACTAGCCAACTAC
TATAGGCGGTTCGTCGAAGGGTTTTCACGACCAGCCGCCCCATTGACAGAGCTGTTGAAGAAAGGCCACCCTTGGTCGTGGTCGAATGATTGTCAAATGGCCTTTGAAAA
TCTGAAAAACAACCATGACGAGGGGTCCTATCCTCGGGTTGGTAGACATCACAAAGCCATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGTAGATGCCACAATAAACTCTCGGCTGAGCAAGAGCACTTTGATAGACTCTGGAGCAACCCACAACTTCATTGCCGATCAAGAAGCCAGAAGATTGGGACTCAC
CATAGGAAAAGACCCGGGGAAAATGAAAGCTGTCAACTCCGAGGCCTTGCCTATTGTGGGAGTTTCCAAAGGAGTCCCCTTCAAAATAGGGGATTGGACTGGAGAGCTAG
ACCTTGTCGTGGTTCGCATGGACGACTTCAACGTGGTACTTGGGATGGAGTTCCTCTTAGAACACAAGGTCATCCCAATGCCATTGGCAAAATGCTTGGTGATCACCGAC
CGTAACCCCACAGTAATACCTGCAAGCATCAAGCAACCAGGTAATCTTCGAATGATCTCGGCCATACAGCTGAAAATGGGACTCGCACGAGAGGAACTTACATTCATGGC
CATACCACTGATAGAAGAAGCAACCACCGAAGAGACTGTCCCAGAGGAAATCAAGGAAGTACTAGACAGCTATACCGACATAATGCCAGAGAGTCTACCACAAACACTAC
CACCTCGTCGAGGCATTGACCACGAAATCGAACTCCTCCCCTGGGTTAAGCCACCAGCGAAGAACGCATACCGGATGGCTCTGCCCGAGCTAGCCGAATTGAGGAAACAA
TTGGACGAGTTGCTGAAGGCGGGATTCATTCGCCCGACAAAGGCACCTTATGGAGCCCCCGTACTGTTCCAGAAGAAGAAGGATGGGACGTTGCGTCTGTGCATAGACTA
TAGAGCCTTAAACAAGGTGACGGTACGCAACAAATATCCTCTGCCGATAATATCCGACTTGTTCGACCAACTTTACGGGGCCAAGAACTTCACGAAGTTGGACTTACGAT
CAGGATATTATCAAGTACGTATTGCCGAAGGGGACGAACCCAAGACAACGTGCATAACAAGATATGGGGCCTTTGAATTCCTAGTAATGCCCTTTGGCTTGACAAACGCC
CCAGCTACGTTTTGCACGTTGATGAACCAGGTTTTCTACGAATATCTGGATCAGTTCGTCATAGTATACCTCGAAGACATTGTTGTTTACAGCACAACTCTCGAGGAACA
CAAAGTGCACTTGAAGCTAGTGTTTGACAAGCTCCGGCAGGACCAGCTATATTTGAAGAAGGAGAAATGTGCATTCGCACAAACATGCATCAACTTCCTTGGACATGTCG
TCAGATGTGGACAAATTAGTATGGATAGCGACAAGATAAAAGCTATCCAAGAATGGAAAGTTCCTACTTCCGTATCTGAGTTGCGGTCCTTCTTAGGACTAGCCAACTAC
TATAGGCGGTTCGTCGAAGGGTTTTCACGACCAGCCGCCCCATTGACAGAGCTGTTGAAGAAAGGCCACCCTTGGTCGTGGTCGAATGATTGTCAAATGGCCTTTGAAAA
TCTGAAAAACAACCATGACGAGGGGTCCTATCCTCGGGTTGGTAGACATCACAAAGCCATTTAA
Protein sequenceShow/hide protein sequence
MFVDATINSRLSKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITD
RNPTVIPASIKQPGNLRMISAIQLKMGLAREELTFMAIPLIEEATTEETVPEEIKEVLDSYTDIMPESLPQTLPPRRGIDHEIELLPWVKPPAKNAYRMALPELAELRKQ
LDELLKAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLYGAKNFTKLDLRSGYYQVRIAEGDEPKTTCITRYGAFEFLVMPFGLTNA
PATFCTLMNQVFYEYLDQFVIVYLEDIVVYSTTLEEHKVHLKLVFDKLRQDQLYLKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANY
YRRFVEGFSRPAAPLTELLKKGHPWSWSNDCQMAFENLKNNHDEGSYPRVGRHHKAI