; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0012338 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0012338
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr09:11999410..12004892
RNA-Seq ExpressionIVF0012338
SyntenyIVF0012338
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR007650 - Zf-FLZ domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035714.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.96e-12684.75Show/hide
Query:  INASAPVSLYSHATSIIKFNGLNFSD----------------CTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFM
        I  SAP+SLYSHATSIIKFNGLNF D                  LSEKPAAIT ASSDED+SFYKAW+RSNRLSLMFMRMTVANNIKSTIKN EDAKEFM
Subjt:  INASAPVSLYSHATSIIKFNGLNFSD----------------CTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFM

Query:  KSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEE
        K VEKCSQSESADKSLAGTL STLTNIKFDGSRTIHEHILEM NLAARLKTMGMEVNENFLVTFI NSLPS+Y PFHMNYNTLKDKWNVHEL+SMLIQEE
Subjt:  KSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEE

Query:  ARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLK
        ARLKKPIIHS NLMGHKGAGKKP KKNGKGNH QLK
Subjt:  ARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLK

KAA0035949.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.20e-12786.44Show/hide
Query:  INASAPVSLYSHATSIIKFNGLNFSD----------------CTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFM
        I ASAPVSLYSHATSIIKFNGLNFSD                  LSEKPAAIT ASSD+DRSFYKAWERSNRLSLMFMRMTVANNIKS IKNTEDAKEFM
Subjt:  INASAPVSLYSHATSIIKFNGLNFSD----------------CTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFM

Query:  KSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEE
        KSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEM NLAARLKTMGMEVNENFLV FI NSLPS+YGPFHMNYNTLKDKWNVHEL+SMLIQEE
Subjt:  KSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEE

Query:  ARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLK
        ARLKKPIIHS NLM HKG GKKP KKNGKGNH QLK
Subjt:  ARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLK

KAA0041280.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]9.45e-12787.77Show/hide
Query:  VSLYSHATSIIKFNGLNFSD---------------CTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFMKSVEKCS
        VSL  HATSIIKFNGLNFSD                 LSEK AAIT ASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFMKSVEKCS
Subjt:  VSLYSHATSIIKFNGLNFSD---------------CTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFMKSVEKCS

Query:  QSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEEARLKKPI
        QSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEM NLAARLKTMGMEVNENFLVTFI NSLPS YGPFHMNYNTLKDKWNVHEL+SMLIQEEARLKKPI
Subjt:  QSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEEARLKKPI

Query:  IHSVNLMGHKGAGKKPEKKNGKGNHRQLK
        IHS NLMGHKGAGKKP KKNGKGNH QLK
Subjt:  IHSVNLMGHKGAGKKPEKKNGKGNHRQLK

KAA0056423.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]8.32e-12584.45Show/hide
Query:  INASAPVSLYSHATSIIKFNGLNFSD----------------CTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFM
        I ASAPVSLYSHATSIIKFN LNFSD                  L+EKPAAIT ASSD+DRSFYK WERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFM
Subjt:  INASAPVSLYSHATSIIKFNGLNFSD----------------CTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFM

Query:  KSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEE
        K VEKCSQSESADKSLAGTLM+TLTNIKFDGSRTIHEHILEM NLAARLKTMGMEVNENFLV FI NSLPS+YGPFHMNYNTLKDKWNVHEL+SMLIQEE
Subjt:  KSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEE

Query:  ARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLKSL
        ARLKKPIIH  NLMGHKGAGKKP KKNGKGNH QLK  
Subjt:  ARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLKSL

KAA0067409.1 UBN2 domain-containing protein [Cucumis melo var. makuwa]3.02e-166100Show/hide
Query:  MTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMN
        MTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMN
Subjt:  MTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMN

Query:  YNTLKDKWNVHELKSMLIQEEARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLKSLPPPPPRPSCPARARHHHRPLCPARARHHRRHSSLSSNTVML
        YNTLKDKWNVHELKSMLIQEEARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLKSLPPPPPRPSCPARARHHHRPLCPARARHHRRHSSLSSNTVML
Subjt:  YNTLKDKWNVHELKSMLIQEEARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLKSLPPPPPRPSCPARARHHHRPLCPARARHHRRHSSLSSNTVML

Query:  ATTVGSLFLQPCSCLALATTVARLRLYHKLRSGWLVARSVAEA
        ATTVGSLFLQPCSCLALATTVARLRLYHKLRSGWLVARSVAEA
Subjt:  ATTVGSLFLQPCSCLALATTVARLRLYHKLRSGWLVARSVAEA

TrEMBL top hitse value%identityAlignment
A0A5A7T3G8 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-10286.44Show/hide
Query:  INASAPVSLYSHATSIIKFNGLNFSD----------------CTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFM
        I ASAPVSLYSHATSIIKFNGLNFSD                  LSEKPAAIT ASSD+DRSFYKAWERSNRLSLMFMRMTVANNIKS IKNTEDAKEFM
Subjt:  INASAPVSLYSHATSIIKFNGLNFSD----------------CTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFM

Query:  KSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEE
        KSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEM NLAARLKTMGMEVNENFLV FI NSLPS+YGPFHMNYNTLKDKWNVHEL+SMLIQEE
Subjt:  KSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEE

Query:  ARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLK
        ARLKKPIIHS NLM HKG GKKP KKNGKGNH QLK
Subjt:  ARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLK

A0A5A7TTF5 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-10285.59Show/hide
Query:  INASAPVSLYSHATSIIKFNGLNFSD----------------CTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFM
        I  SAPVSLYSHATSIIKFNGLNFSD                  LSEKP AIT ASSDEDRSFYKAW+RSNRLSLMFM+MTVANNIKSTIKNTEDAKEFM
Subjt:  INASAPVSLYSHATSIIKFNGLNFSD----------------CTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFM

Query:  KSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEE
        KSVEKCSQSESADKSL GTLMSTLTNIKFDGSRTIHEHILE+ NLAARLKTMGMEVNENFLVTFI NSLPS+YGPFHMNYNTLKDKWNVHEL+SMLIQEE
Subjt:  KSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEE

Query:  ARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLK
        ARLKKPIIHS NLMGHKGA KKP KKNGKGNH QLK
Subjt:  ARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLK

A0A5A7UG95 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-10286.02Show/hide
Query:  INASAPVSLYSHATSIIKFNGLNFSD----------------CTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFM
        I ASAPVSLYSHATSIIKFNGLNFSD                  LSEKP AIT A+SDEDRSFYKAWERSNRLSLMF+RMTVANNIK TIKNTEDAKEFM
Subjt:  INASAPVSLYSHATSIIKFNGLNFSD----------------CTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFM

Query:  KSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEE
        KSV+KC QSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEM NLAARLKTMGMEVNENFLV FI NSLPS+YGPFHMNYNTLKDKWNVHEL+SMLIQEE
Subjt:  KSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEE

Query:  ARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLK
        ARLKKPIIHSVNLMGHKGAGKKP KKNGKGNH QLK
Subjt:  ARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLK

A0A5D3BWW5 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-10386.44Show/hide
Query:  INASAPVSLYSHATSIIKFNGLNFSD----------------CTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFM
        I ASAPVSLYSHATSIIKFNGLNFSD                  LSEKP AIT A+SDEDRSFYKAWERSNRLSLMF+RMTVANNIK TIKNTEDAKEFM
Subjt:  INASAPVSLYSHATSIIKFNGLNFSD----------------CTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFM

Query:  KSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEE
        KSV+KC QSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEM NLAARLKTMGMEVNENFLVTFI NSLPS+YGPFHMNYNTLKDKWNVHEL+SMLIQEE
Subjt:  KSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEE

Query:  ARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLK
        ARLKKPIIHSVNLMGHKGAGKKP KKNGKGNH QLK
Subjt:  ARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLK

A0A5D3DS74 UBN2 domain-containing protein5.2e-131100Show/hide
Query:  MTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMN
        MTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMN
Subjt:  MTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMN

Query:  YNTLKDKWNVHELKSMLIQEEARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLKSLPPPPPRPSCPARARHHHRPLCPARARHHRRHSSLSSNTVML
        YNTLKDKWNVHELKSMLIQEEARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLKSLPPPPPRPSCPARARHHHRPLCPARARHHRRHSSLSSNTVML
Subjt:  YNTLKDKWNVHELKSMLIQEEARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLKSLPPPPPRPSCPARARHHHRPLCPARARHHRRHSSLSSNTVML

Query:  ATTVGSLFLQPCSCLALATTVARLRLYHKLRSGWLVARSVAEA
        ATTVGSLFLQPCSCLALATTVARLRLYHKLRSGWLVARSVAEA
Subjt:  ATTVGSLFLQPCSCLALATTVARLRLYHKLRSGWLVARSVAEA

SwissProt top hitse value%identityAlignment
F4JW68 FCS-Like Zinc finger 74.1e-0848.48Show/hide
Query:  LSPRNNRRHSDEF---PWSSHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEK
        LSP N++R+  ++      S  L     C+R L  GRDIYMYKG++AFCS ECR+QQM  DE K +
Subjt:  LSPRNNRRHSDEF---PWSSHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEK

Q8GRN0 FCS-Like Zinc finger 132.7e-0744.12Show/hide
Query:  LRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEKCLTASKKGSTAIASAPTTVTKVSA
        L + CLC+++ L G+DIYMYKGE  FCSAECR  Q+  DE +E+C T   + +  ++S      ++SA
Subjt:  LRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEKCLTASKKGSTAIASAPTTVTKVSA

Q8VY80 FCS-Like Zinc finger 51.6e-1250.56Show/hide
Query:  MLSPR-NNRRHSDEFPWSSHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEKCLTASKKGSTAIASAPTTVTKVSA
        M+SPR   RRHS ++  S   LR+  LC+R L+ GRDIYMY+G+ AFCS ECRQQQ+  DE KEK    S + +  +A+  TT  +VSA
Subjt:  MLSPR-NNRRHSDEFPWSSHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEKCLTASKKGSTAIASAPTTVTKVSA

Q8VZM9 FCS-Like Zinc finger 22.4e-0845.59Show/hide
Query:  SPRNNRRHSDEFPWS------SHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEK
        SPR+ + H   F  S       H L +  LC++RL   RDI+MY+G++ FCS ECR++Q+ +DEAKEK
Subjt:  SPRNNRRHSDEFPWS------SHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEK

Q9SGZ8 FCS-Like Zinc finger 62.0e-1553.16Show/hide
Query:  MLSPRNN-RRHSDEFPWSSHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEKCLTASKKGSTAIAS
        M++PR N RRHS +F  + H LR+  LC+R L+ GRDIYMY+G+ AFCS+ECRQ+QM QDE KEK  +A+     A+ +
Subjt:  MLSPRNN-RRHSDEFPWSSHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEKCLTASKKGSTAIAS

Arabidopsis top hitse value%identityAlignment
AT1G22160.1 Protein of unknown function (DUF581)1.1e-1350.56Show/hide
Query:  MLSPR-NNRRHSDEFPWSSHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEKCLTASKKGSTAIASAPTTVTKVSA
        M+SPR   RRHS ++  S   LR+  LC+R L+ GRDIYMY+G+ AFCS ECRQQQ+  DE KEK    S + +  +A+  TT  +VSA
Subjt:  MLSPR-NNRRHSDEFPWSSHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEKCLTASKKGSTAIASAPTTVTKVSA

AT1G74940.1 Protein of unknown function (DUF581)1.9e-0844.12Show/hide
Query:  LRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEKCLTASKKGSTAIASAPTTVTKVSA
        L + CLC+++ L G+DIYMYKGE  FCSAECR  Q+  DE +E+C T   + +  ++S      ++SA
Subjt:  LRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEKCLTASKKGSTAIASAPTTVTKVSA

AT1G78020.1 Protein of unknown function (DUF581)1.4e-1653.16Show/hide
Query:  MLSPRNN-RRHSDEFPWSSHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEKCLTASKKGSTAIAS
        M++PR N RRHS +F  + H LR+  LC+R L+ GRDIYMY+G+ AFCS+ECRQ+QM QDE KEK  +A+     A+ +
Subjt:  MLSPRNN-RRHSDEFPWSSHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEKCLTASKKGSTAIAS

AT4G17670.1 Protein of unknown function (DUF581)1.7e-0945.59Show/hide
Query:  SPRNNRRHSDEFPWS------SHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEK
        SPR+ + H   F  S       H L +  LC++RL   RDI+MY+G++ FCS ECR++Q+ +DEAKEK
Subjt:  SPRNNRRHSDEFPWS------SHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEK

AT4G39795.1 Protein of unknown function (DUF581)2.9e-0948.48Show/hide
Query:  LSPRNNRRHSDEF---PWSSHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEK
        LSP N++R+  ++      S  L     C+R L  GRDIYMYKG++AFCS ECR+QQM  DE K +
Subjt:  LSPRNNRRHSDEF---PWSSHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTCCCCCAGAAACAATCGAAGGCATTCCGATGAGTTTCCCTGGTCTTCTCACCACCTCCGTGCTTTCTGCCTCTGCCAACGCCGCCTCCTCGCCGGCCGA
GATATTTACATGTACAAGGGAGAAAGCGCTTTTTGTAGTGCAGAGTGCCGGCAACAGCAGATGAATCAAGACGAGGCTAAGGAGAAATGTTTGACGGCGTCGAAG
AAAGGATCGACGGCAATCGCATCGGCTCCGACCACCGTGACAAAAGTCTCTGCTATTAATGCATCCGCCCCTGTTTCTCTTTATTCGCATGCTACATCTATAATA
AAGTTTAACGGACTCAATTTCTCTGATTGCACTTTAAGTGAGAAACCTGCTGCAATTACTCTTGCTAGCAGTGATGAGGATAGATCTTTCTATAAAGCTTGGGAA
AGATCAAATAGATTGAGCTTAATGTTTATGCGAATGACTGTAGCAAACAATATTAAGTCTACAATTAAGAACACTGAAGATGCTAAGGAATTTATGAAATCTGTG
GAAAAATGTTCTCAGTCAGAGTCGGCTGACAAGTCACTTGCTGGAACACTTATGAGTACTTTAACCAACATCAAGTTTGATGGTTCTCGTACTATACATGAGCAT
ATCCTTGAAATGATGAACTTGGCAGCAAGGTTAAAGACCATGGGAATGGAAGTTAATGAGAATTTTTTGGTAACATTTATCCCTAATTCCTTACCTTCAGATTAT
GGTCCATTTCACATGAACTATAACACTCTAAAAGATAAATGGAATGTGCATGAATTAAAAAGTATGCTCATTCAAGAGGAAGCGAGACTTAAGAAACCAATAATT
CACTCTGTCAATCTCATGGGTCATAAAGGAGCTGGAAAGAAACCTGAAAAAAAGAATGGCAAGGGCAATCATAGACAATTAAAGTCGCTGCCACCACCACCACCT
CGTCCCTCGTGCCCAGCCCGTGCCCGCCACCACCATCGTCCTTTGTGCCCAGCTCGTGCCCGCCACCACCGTCGTCACTCCTCCCTTTCTTCAAACACTGTTATG
CTTGCCACCACCGTCGGGAGTCTTTTTCTTCAACCATGTTCATGTCTAGCCCTCGCCACCACCGTCGCACGTCTGCGTCTATATCACAAGCTCCGATCTGGTTGG
CTTGTGGCCAGATCTGTTGCAGAAGCATTATGCTGGGAAAAGTTTGGGTGGGAGAACTCGAAGGCAGATGAATTGCTTTTGCCAGTCCTGAAAGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTCTCCCCCAGAAACAATCGAAGGCATTCCGATGAGTTTCCCTGGTCTTCTCACCACCTCCGTGCTTTCTGCCTCTGCCAACGCCGCCTCCTCGCCGGCCGA
GATATTTACATGTACAAGGGAGAAAGCGCTTTTTGTAGTGCAGAGTGCCGGCAACAGCAGATGAATCAAGACGAGGCTAAGGAGAAATGTTTGACGGCGTCGAAG
AAAGGATCGACGGCAATCGCATCGGCTCCGACCACCGTGACAAAAGTCTCTGCTATTAATGCATCCGCCCCTGTTTCTCTTTATTCGCATGCTACATCTATAATA
AAGTTTAACGGACTCAATTTCTCTGATTGCACTTTAAGTGAGAAACCTGCTGCAATTACTCTTGCTAGCAGTGATGAGGATAGATCTTTCTATAAAGCTTGGGAA
AGATCAAATAGATTGAGCTTAATGTTTATGCGAATGACTGTAGCAAACAATATTAAGTCTACAATTAAGAACACTGAAGATGCTAAGGAATTTATGAAATCTGTG
GAAAAATGTTCTCAGTCAGAGTCGGCTGACAAGTCACTTGCTGGAACACTTATGAGTACTTTAACCAACATCAAGTTTGATGGTTCTCGTACTATACATGAGCAT
ATCCTTGAAATGATGAACTTGGCAGCAAGGTTAAAGACCATGGGAATGGAAGTTAATGAGAATTTTTTGGTAACATTTATCCCTAATTCCTTACCTTCAGATTAT
GGTCCATTTCACATGAACTATAACACTCTAAAAGATAAATGGAATGTGCATGAATTAAAAAGTATGCTCATTCAAGAGGAAGCGAGACTTAAGAAACCAATAATT
CACTCTGTCAATCTCATGGGTCATAAAGGAGCTGGAAAGAAACCTGAAAAAAAGAATGGCAAGGGCAATCATAGACAATTAAAGTCGCTGCCACCACCACCACCT
CGTCCCTCGTGCCCAGCCCGTGCCCGCCACCACCATCGTCCTTTGTGCCCAGCTCGTGCCCGCCACCACCGTCGTCACTCCTCCCTTTCTTCAAACACTGTTATG
CTTGCCACCACCGTCGGGAGTCTTTTTCTTCAACCATGTTCATGTCTAGCCCTCGCCACCACCGTCGCACGTCTGCGTCTATATCACAAGCTCCGATCTGGTTGG
CTTGTGGCCAGATCTGTTGCAGAAGCATTATGCTGGGAAAAGTTTGGGTGGGAGAACTCGAAGGCAGATGAATTGCTTTTGCCAGTCCTGAAAGAGTAA
Protein sequenceShow/hide protein sequence
MLSPRNNRRHSDEFPWSSHHLRAFCLCQRRLLAGRDIYMYKGESAFCSAECRQQQMNQDEAKEKCLTASKKGSTAIASAPTTVTKVSAINASAPVSLYSHATSII
KFNGLNFSDCTLSEKPAAITLASSDEDRSFYKAWERSNRLSLMFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEH
ILEMMNLAARLKTMGMEVNENFLVTFIPNSLPSDYGPFHMNYNTLKDKWNVHELKSMLIQEEARLKKPIIHSVNLMGHKGAGKKPEKKNGKGNHRQLKSLPPPPP
RPSCPARARHHHRPLCPARARHHRRHSSLSSNTVMLATTVGSLFLQPCSCLALATTVARLRLYHKLRSGWLVARSVAEALCWEKFGWENSKADELLLPVLKE