; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0110441 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0110441
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationCMiso1.1chr04:29709746..29711014
RNA-Seq ExpressionCmc04g0110441
SyntenyCmc04g0110441
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006886 - intracellular protein transport (biological process)
GO:0007033 - vacuole organization (biological process)
GO:0015074 - DNA integration (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0016020 - membrane (cellular component)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0004601 - peroxidase activity (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052373.1 peroxidase 64 [Cucumis melo var. makuwa]3.0e-22992.42Show/hide
Query:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS
        MKSWGA+DQGFLVECRT+ECGL EEHEQDR QG+EDEE IATLLKQFASVFEWPT LPP+RSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS
Subjt:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS

Query:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG
        SGIIR SKS YSSPVLLVRKKDG WRFCVDYRALNNVTIPNKFPIPVIEE FDELKGAS+FSKIDLK GY+QIRMCPEDIEKTAFRTHEGHYEFLVMPFG
Subjt:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG

Query:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT
        LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQG+ EH+QHLEVVLGLLKEKELYAN EKCSFAKPRISYLG+FISEQGIEADP+KIRAVSEWPT T
Subjt:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT

Query:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS
        NVREVRGFLGLTGYYRRFVKDYG IA+PLTQ LKKG YKWDAE E AFDKLKK MMTLPVLAM DFNLPFEIESDASGFGVGA+LTQCRKPVAYFSKTL 
Subjt:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS

Query:  MRDRARPVYERELIVVVLAVQR
        MRDRARPVYERELIVVVLAVQR
Subjt:  MRDRARPVYERELIVVVLAVQR

KAA0053639.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.8e-24297.87Show/hide
Query:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS
        MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEA+ATLLKQFASVFEWPTALPPQRSIDHHIYLKSGT+PVNVRPYRYAHHQKEEMERLVDEMLS
Subjt:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS

Query:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG
        SGIIRPSKSPY+SPVLLVRKKDG WRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLK GYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG
Subjt:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG

Query:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT
        LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLG+FISEQGIEA PEKIRAVSEWPTPT
Subjt:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT

Query:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS
        NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAM DFNLPFEIESDASGFGVG VLTQCRKPVAYFSKTLS
Subjt:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS

Query:  MRDRARPVYERELIVVVLAVQR
        MRDRARPVYERELIVVVLAVQR
Subjt:  MRDRARPVYERELIVVVLAVQR

KAA0067256.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.2e-23494.55Show/hide
Query:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS
        MKSWGA+DQGFLVECRTIECG LEEHEQDR QG+EDEE IATLLKQFASVFEWPT LPPQRSI+HHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS
Subjt:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS

Query:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG
        SGIIRPSKSPYS PVLLVRKKDG WRFCVDYRALNNVTIP+KFPIPVIEELFD+LKGASVFSKIDLK GYHQIRMCP+DIEKTAFRTHEGHYEFLVMPFG
Subjt:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG

Query:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT
        LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGI EH+QHLEVVLGLLKEKELYANLEKCSFAKPRISYLG+FISEQGIEADPEKIRAVSEWPTPT
Subjt:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT

Query:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS
        NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAE E AF KLKK MMTLPVLAM DFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS
Subjt:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS

Query:  MRDRARPVYERELIVVVLAVQR
        MRDRA PVYERELIV VLAVQR
Subjt:  MRDRARPVYERELIVVVLAVQR

TYK00786.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.4e-22589.81Show/hide
Query:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS
        +K+WGADDQGFLVECRT+ECG LE+ EQD+G+G+ D E IATLL+QFA VFEWP  LPPQRSI+HHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS
Subjt:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS

Query:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG
        SGIIRPSKSPYSSPVLLVRKKDG WRFCVDYRALNNVTIP+KFPIPVIEELFDELKGASVFSK+DLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG
Subjt:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG

Query:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT
        LTNAPSTFQALMNQVFKPYLRRFVLVFFDDIL+YSQG++EH+QHLEVVLGLL+++ELY N+EKCSFAKPRISYLG+FISEQG+EADPEKIRAVS+WPTPT
Subjt:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT

Query:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS
        NVREVRGFLGLTGYYRRFVK+YGAIAAPLTQLLKKGAYKWDAE E AF+KLKK MMTLPVL M DFNLPFEI+SDASG GVGAVLTQCRKPVAYFSKTLS
Subjt:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS

Query:  MRDRARPVYERELIVVVLAVQR
        MRDRARPVYEREL+ VVLAVQR
Subjt:  MRDRARPVYERELIVVVLAVQR

TYK13876.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.4e-22590.28Show/hide
Query:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS
        MKSWGADDQGFLVECRTIECG LEEHEQDR QG  + E IA LL++FA VFEWP+ LPPQR IDHHIYLKSG DPVNVRPYRYAHHQKEEMERLVDEML+
Subjt:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS

Query:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG
        SGIIRPSKSPYSSPVLLVRKKDG WRFCVDYRALNNVTIP+KFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG
Subjt:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG

Query:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT
        LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYS+G+ EH+QHLEVVLGLL+EKELY N+EKCSFAKPRISYLG+FISEQGIEADPEKIRAVSEWPTP 
Subjt:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT

Query:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS
        NVREVRGFLGLTGYYRRFVK+YG IAAPLTQLLKKGAYKW  E ETAF KLK+ MMTLPVL M DF+LPFEIESDASGFG+GAVLTQCRKPVAYFSKTLS
Subjt:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS

Query:  MRDRARPVYERELIVVVLAVQR
        MRDR+RPVYERELI VVLAVQR
Subjt:  MRDRARPVYERELIVVVLAVQR

TrEMBL top hitse value%identityAlignment
A0A5A7UJK0 Ty3/gypsy retrotransposon protein8.7e-24397.87Show/hide
Query:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS
        MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEA+ATLLKQFASVFEWPTALPPQRSIDHHIYLKSGT+PVNVRPYRYAHHQKEEMERLVDEMLS
Subjt:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS

Query:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG
        SGIIRPSKSPY+SPVLLVRKKDG WRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLK GYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG
Subjt:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG

Query:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT
        LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLG+FISEQGIEA PEKIRAVSEWPTPT
Subjt:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT

Query:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS
        NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAM DFNLPFEIESDASGFGVG VLTQCRKPVAYFSKTLS
Subjt:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS

Query:  MRDRARPVYERELIVVVLAVQR
        MRDRARPVYERELIVVVLAVQR
Subjt:  MRDRARPVYERELIVVVLAVQR

A0A5A7UX59 Ty3/gypsy retrotransposon protein2.1e-22589.81Show/hide
Query:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS
        +K+WGADDQGFLVECRT+ECG LE+ EQD+G+G+ D E IATLL+QFA VFEWP  LPPQRSI+HHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS
Subjt:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS

Query:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG
        SGIIRPSKSPYSSPVLLVRKKDG WRFCVDYRALNNVTIP+KFPIPVIEELFDELKGASVFSK+DLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG
Subjt:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG

Query:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT
        LTNAPSTFQALMNQVFKPYLRRFVLVFFDDIL+YSQG++EH+QHLEVVLGLL+++ELY N+EKCSFAKPRISYLG+FISEQG+EADPEKIRAVS+WPTPT
Subjt:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT

Query:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS
        NVREVRGFLGLTGYYRRFVK+YGAIAAPLTQLLKKGAYKWDAE E AF+KLKK MMTLPVL M DFNLPFEI+SDASG GVGAVLTQCRKPVAYFSKTLS
Subjt:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS

Query:  MRDRARPVYERELIVVVLAVQR
        MRDRARPVYEREL+ VVLAVQR
Subjt:  MRDRARPVYERELIVVVLAVQR

A0A5A7VLC6 Ty3/gypsy retrotransposon protein2.5e-23494.55Show/hide
Query:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS
        MKSWGA+DQGFLVECRTIECG LEEHEQDR QG+EDEE IATLLKQFASVFEWPT LPPQRSI+HHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS
Subjt:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS

Query:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG
        SGIIRPSKSPYS PVLLVRKKDG WRFCVDYRALNNVTIP+KFPIPVIEELFD+LKGASVFSKIDLK GYHQIRMCP+DIEKTAFRTHEGHYEFLVMPFG
Subjt:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG

Query:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT
        LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGI EH+QHLEVVLGLLKEKELYANLEKCSFAKPRISYLG+FISEQGIEADPEKIRAVSEWPTPT
Subjt:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT

Query:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS
        NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAE E AF KLKK MMTLPVLAM DFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS
Subjt:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS

Query:  MRDRARPVYERELIVVVLAVQR
        MRDRA PVYERELIV VLAVQR
Subjt:  MRDRARPVYERELIVVVLAVQR

A0A5D3CD19 Peroxidase 641.4e-22992.42Show/hide
Query:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS
        MKSWGA+DQGFLVECRT+ECGL EEHEQDR QG+EDEE IATLLKQFASVFEWPT LPP+RSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS
Subjt:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS

Query:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG
        SGIIR SKS YSSPVLLVRKKDG WRFCVDYRALNNVTIPNKFPIPVIEE FDELKGAS+FSKIDLK GY+QIRMCPEDIEKTAFRTHEGHYEFLVMPFG
Subjt:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG

Query:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT
        LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQG+ EH+QHLEVVLGLLKEKELYAN EKCSFAKPRISYLG+FISEQGIEADP+KIRAVSEWPT T
Subjt:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT

Query:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS
        NVREVRGFLGLTGYYRRFVKDYG IA+PLTQ LKKG YKWDAE E AFDKLKK MMTLPVLAM DFNLPFEIESDASGFGVGA+LTQCRKPVAYFSKTL 
Subjt:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS

Query:  MRDRARPVYERELIVVVLAVQR
        MRDRARPVYERELIVVVLAVQR
Subjt:  MRDRARPVYERELIVVVLAVQR

A0A5D3CU05 Ty3/gypsy retrotransposon protein1.6e-22590.28Show/hide
Query:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS
        MKSWGADDQGFLVECRTIECG LEEHEQDR QG  + E IA LL++FA VFEWP+ LPPQR IDHHIYLKSG DPVNVRPYRYAHHQKEEMERLVDEML+
Subjt:  MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLS

Query:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG
        SGIIRPSKSPYSSPVLLVRKKDG WRFCVDYRALNNVTIP+KFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG
Subjt:  SGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFG

Query:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT
        LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYS+G+ EH+QHLEVVLGLL+EKELY N+EKCSFAKPRISYLG+FISEQGIEADPEKIRAVSEWPTP 
Subjt:  LTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPT

Query:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS
        NVREVRGFLGLTGYYRRFVK+YG IAAPLTQLLKKGAYKW  E ETAF KLK+ MMTLPVL M DF+LPFEIESDASGFG+GAVLTQCRKPVAYFSKTLS
Subjt:  NVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLS

Query:  MRDRARPVYERELIVVVLAVQR
        MRDR+RPVYERELI VVLAVQR
Subjt:  MRDRARPVYERELIVVVLAVQR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.64.1e-8041.67Show/hide
Query:  YRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDG-----RWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRM
        Y Y    ++E+E  + +ML+ GIIR S SPY+SP+ +V KK       ++R  +DYR LN +T+ ++ PIP ++E+  +L   + F+ IDL  G+HQI M
Subjt:  YRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDG-----RWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRM

Query:  CPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLG
         PE + KTAF T  GHYE+L MPFGL NAP+TFQ  MN + +P L +  LV+ DDI+V+S  ++EH+Q L +V   L +  L   L+KC F K   ++LG
Subjt:  CPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLG

Query:  YFISEQGIEADPEKIRAVSEWPTPTNVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAY--KWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIE
        + ++  GI+ +PEKI A+ ++P PT  +E++ FLGLTGYYR+F+ ++  IA P+T+ LKK       + E ++AF KLK ++   P+L + DF   F + 
Subjt:  YFISEQGIEADPEKIRAVSEWPTPTNVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAY--KWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIE

Query:  SDASGFGVGAVLTQCRKPVAYFSKTLSMRDRARPVYERELIVVVLAVQ
        +DAS   +GAVL+Q   P++Y S+TL+  +      E+EL+ +V A +
Subjt:  SDASGFGVGAVLTQCRKPVAYFSKTLSMRDRARPVYERELIVVVLAVQ

P20825 Retrovirus-related Pol polyprotein from transposon 2971.8e-8040.66Show/hide
Query:  HIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKD-----GRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASV
        H+   +   P+  + Y  A   + E+E  V EML+ G+IR S SPY+SP  +V KK       ++R  +DYR LN +TIP+++PIP ++E+  +L     
Subjt:  HIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKD-----GRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASV

Query:  FSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYAN
        F+ IDL  G+HQI M  E I KTAF T  GHYE+L MPFGL NAP+TFQ  MN + +P L +  LV+ DDI+++S  + EH+  +++V   L +  L   
Subjt:  FSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYAN

Query:  LEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPTNVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDA---ETETAFDKLKKVMMT
        L+KC F K   ++LG+ ++  GI+ +P K++A+  +P PT  +E+R FLGLTGYYR+F+ +Y  IA P+T  LKK   K D    E   AF+KLK +++ 
Subjt:  LEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPTNVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDA---ETETAFDKLKKVMMT

Query:  LPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLSMRDRARPVYERELIVVVLAVQ
         P+L + DF   F + +DAS   +GAVL+Q   P+++ S+TL+  +      E+EL+ +V A +
Subjt:  LPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLSMRDRARPVYERELIVVVLAVQ

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.3e-7440.16Show/hide
Query:  LPPQRS------IDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEE
        LPP+ +      + H I +K G     ++PY      ++E+ ++V ++L +  I PSKSP SSPV+LV KKDG +R CVDYR LN  TI + FP+P I+ 
Subjt:  LPPQRS------IDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEE

Query:  LFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLG
        L   +  A +F+ +DL +GYHQI M P+D  KTAF T  G YE+ VMPFGL NAPSTF   M   F+    RFV V+ DDIL++S+   EH +HL+ VL 
Subjt:  LFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLG

Query:  LLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPTNVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDK
         LK + L    +KC FA     +LGY I  Q I     K  A+ ++PTP  V++ + FLG+  YYRRF+ +   IA P+ QL      +W  + + A +K
Subjt:  LLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPTNVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDK

Query:  LKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKP------VAYFSKTLSMRDRARPVYERELIVVVLAV
        LK  +   PVL   +    + + +DAS  G+GAVL +          V YFSK+L    +  P  E EL+ ++ A+
Subjt:  LKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKP------VAYFSKTLSMRDRARPVYERELIVVVLAV

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.1e-8037.47Show/hide
Query:  EEAIATLLKQFASVFEWP-TALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKK-----DGRWRFCVD
        +E + +LL +F  +FE P + +  + ++   I   +  DP+  + Y Y  + + E+ER +DE+L  GIIRPS SPY+SP+ +V KK     + ++R  VD
Subjt:  EEAIATLLKQFASVFEWP-TALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKK-----DGRWRFCVD

Query:  YRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDD
        ++ LN VTIP+ +PIP I      L  A  F+ +DL +G+HQI M   DI KTAF T  G YEFL +PFGL NAP+ FQ +++ + + ++ +   V+ DD
Subjt:  YRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDD

Query:  ILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPTNVREVRGFLGLTGYYRRFVKDYGAIAAPLT
        I+V+S+  + H ++L +VL  L +  L  NLEK  F   ++ +LGY ++  GI+ADP+K+RA+SE P PT+V+E++ FLG+T YYR+F++DY  +A PLT
Subjt:  ILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPTNVREVRGFLGLTGYYRRFVKDYGAIAAPLT

Query:  QLLK------------KGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQ----CRKPVAYFSKTLSMRDRARPVYERELI
         L +            K     D     +F+ LK ++ +  +LA   F  PF + +DAS + +GAVL+Q      +P+AY S++L+  +      E+E++
Subjt:  QLLK------------KGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQ----CRKPVAYFSKTLSMRDRARPVYERELI

Query:  VVV
         ++
Subjt:  VVV

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.1e-7440.43Show/hide
Query:  LPPQRS------IDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEE
        LPP+ +      + H I +K G     ++PY      ++E+ ++V ++L +  I PSKSP SSPV+LV KKDG +R CVDYR LN  TI + FP+P I+ 
Subjt:  LPPQRS------IDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEE

Query:  LFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLG
        L   +  A +F+ +DL +GYHQI M P+D  KTAF T  G YE+ VMPFGL NAPSTF   M   F+    RFV V+ DDIL++S+   EH +HL+ VL 
Subjt:  LFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGINEHIQHLEVVLG

Query:  LLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPTNVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDK
         LK + L    +KC FA     +LGY I  Q I     K  A+ ++PTP  V++ + FLG+  YYRRF+ +   IA P+ QL      +W  + + A DK
Subjt:  LLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPTNVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDAETETAFDK

Query:  LKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKP------VAYFSKTLSMRDRARPVYERELIVVVLAV
        LK  +   PVL   +    + + +DAS  G+GAVL +          V YFSK+L    +  P  E EL+ ++ A+
Subjt:  LKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKP------VAYFSKTLSMRDRARPVYERELIVVVLAV

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein8.9e-0656.41Show/hide
Query:  QKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGRW
        ++  ++  + EML + II+PS SPYSSPVLLV+KKDG W
Subjt:  QKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGRW

ATMG00860.1 DNA/RNA polymerases superfamily protein7.0e-3552.67Show/hide
Query:  IQHLEVVLGLLKEKELYANLEKCSFAKPRISYLG--YFISEQGIEADPEKIRAVSEWPTPTNVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYK
        + HL +VL + ++ + YAN +KC+F +P+I+YLG  + IS +G+ ADP K+ A+  WP P N  E+RGFLGLTGYYRRFVK+YG I  PLT+LLKK + K
Subjt:  IQHLEVVLGLLKEKELYANLEKCSFAKPRISYLG--YFISEQGIEADPEKIRAVSEWPTPTNVREVRGFLGLTGYYRRFVKDYGAIAAPLTQLLKKGAYK

Query:  WDAETETAFDKLKKVMMTLPVLAMLDFNLPF
        W      AF  LK  + TLPVLA+ D  LPF
Subjt:  WDAETETAFDKLKKVMMTLPVLAMLDFNLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCCTGGGGAGCAGACGATCAGGGATTCCTTGTAGAATGTCGAACCATAGAATGTGGACTGTTAGAAGAACATGAACAAGACAGAGGGCAGGGGAGGGAAGATGA
AGAAGCAATAGCCACCCTGTTAAAGCAGTTTGCTAGCGTGTTCGAATGGCCCACAGCACTCCCACCACAGCGCAGTATTGATCATCACATCTACCTGAAGAGTGGAACGG
ACCCCGTGAATGTCAGGCCATACCGGTATGCGCACCATCAGAAGGAAGAGATGGAGCGATTGGTAGACGAAATGCTTTCCTCAGGGATCATACGACCGAGCAAAAGCCCT
TATTCCAGCCCGGTGTTGTTGGTAAGGAAGAAAGACGGGCGTTGGAGGTTTTGTGTAGACTACCGAGCATTGAATAACGTGACAATCCCAAACAAGTTCCCAATACCGGT
GATAGAAGAATTGTTCGACGAATTGAAGGGAGCTAGTGTATTTTCCAAAATAGATCTCAAAGCCGGATACCATCAGATTAGGATGTGCCCCGAGGACATCGAAAAAACCG
CGTTCAGAACTCATGAAGGGCACTATGAATTCTTAGTGATGCCATTCGGATTAACGAATGCTCCATCGACCTTTCAGGCACTGATGAATCAGGTATTTAAGCCATACTTG
AGACGGTTTGTGCTGGTATTCTTCGATGATATTTTGGTCTACAGCCAAGGGATAAACGAGCACATCCAGCACTTAGAGGTGGTCTTAGGACTGCTGAAAGAAAAGGAGTT
ATATGCGAATTTGGAGAAGTGTAGTTTTGCAAAGCCTCGGATCAGTTATTTGGGGTATTTCATTTCGGAACAGGGCATTGAAGCAGATCCGGAAAAGATAAGAGCGGTTA
GTGAATGGCCAACTCCGACCAATGTGAGGGAAGTTCGGGGATTCCTTGGGCTAACCGGCTACTACCGGCGCTTTGTCAAAGACTATGGAGCAATAGCAGCGCCACTCACC
CAACTGTTGAAGAAGGGGGCGTACAAGTGGGATGCTGAAACTGAGACTGCTTTTGATAAGTTGAAGAAGGTCATGATGACTCTACCGGTACTTGCCATGCTCGACTTCAA
TCTGCCCTTCGAAATCGAATCAGATGCTTCAGGATTTGGGGTTGGGGCGGTGTTGACTCAGTGCAGAAAGCCCGTAGCTTATTTCAGTAAGACACTAAGTATGCGAGACA
GAGCGCGGCCGGTGTATGAAAGAGAGTTGATTGTCGTAGTCCTTGCAGTACAAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATCCTGGGGAGCAGACGATCAGGGATTCCTTGTAGAATGTCGAACCATAGAATGTGGACTGTTAGAAGAACATGAACAAGACAGAGGGCAGGGGAGGGAAGATGA
AGAAGCAATAGCCACCCTGTTAAAGCAGTTTGCTAGCGTGTTCGAATGGCCCACAGCACTCCCACCACAGCGCAGTATTGATCATCACATCTACCTGAAGAGTGGAACGG
ACCCCGTGAATGTCAGGCCATACCGGTATGCGCACCATCAGAAGGAAGAGATGGAGCGATTGGTAGACGAAATGCTTTCCTCAGGGATCATACGACCGAGCAAAAGCCCT
TATTCCAGCCCGGTGTTGTTGGTAAGGAAGAAAGACGGGCGTTGGAGGTTTTGTGTAGACTACCGAGCATTGAATAACGTGACAATCCCAAACAAGTTCCCAATACCGGT
GATAGAAGAATTGTTCGACGAATTGAAGGGAGCTAGTGTATTTTCCAAAATAGATCTCAAAGCCGGATACCATCAGATTAGGATGTGCCCCGAGGACATCGAAAAAACCG
CGTTCAGAACTCATGAAGGGCACTATGAATTCTTAGTGATGCCATTCGGATTAACGAATGCTCCATCGACCTTTCAGGCACTGATGAATCAGGTATTTAAGCCATACTTG
AGACGGTTTGTGCTGGTATTCTTCGATGATATTTTGGTCTACAGCCAAGGGATAAACGAGCACATCCAGCACTTAGAGGTGGTCTTAGGACTGCTGAAAGAAAAGGAGTT
ATATGCGAATTTGGAGAAGTGTAGTTTTGCAAAGCCTCGGATCAGTTATTTGGGGTATTTCATTTCGGAACAGGGCATTGAAGCAGATCCGGAAAAGATAAGAGCGGTTA
GTGAATGGCCAACTCCGACCAATGTGAGGGAAGTTCGGGGATTCCTTGGGCTAACCGGCTACTACCGGCGCTTTGTCAAAGACTATGGAGCAATAGCAGCGCCACTCACC
CAACTGTTGAAGAAGGGGGCGTACAAGTGGGATGCTGAAACTGAGACTGCTTTTGATAAGTTGAAGAAGGTCATGATGACTCTACCGGTACTTGCCATGCTCGACTTCAA
TCTGCCCTTCGAAATCGAATCAGATGCTTCAGGATTTGGGGTTGGGGCGGTGTTGACTCAGTGCAGAAAGCCCGTAGCTTATTTCAGTAAGACACTAAGTATGCGAGACA
GAGCGCGGCCGGTGTATGAAAGAGAGTTGATTGTCGTAGTCCTTGCAGTACAAAGATGA
Protein sequenceShow/hide protein sequence
MKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGTDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSP
YSSPVLLVRKKDGRWRFCVDYRALNNVTIPNKFPIPVIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYL
RRFVLVFFDDILVYSQGINEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGYFISEQGIEADPEKIRAVSEWPTPTNVREVRGFLGLTGYYRRFVKDYGAIAAPLT
QLLKKGAYKWDAETETAFDKLKKVMMTLPVLAMLDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLSMRDRARPVYERELIVVVLAVQR