; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0011596 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0011596
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionZf-CCHC domain-containing protein/UBN2 domain-containing protein
Genome locationchr06:16135209..16142463
RNA-Seq ExpressionPay0011596
SyntenyPay0011596
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046182.1 zf-CCHC domain-containing protein/UBN2 domain-containing protein [Cucumis melo var. makuwa]2.4e-12170.68Show/hide
Query:  RKIPRILPKTWEAKVTAIQKTKDLTKILLEELIGSLMTHEIIIEQ-LEDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQL
        RKI   LPKTW+AKVTAIQ+ KDLTK+ LEELIGSLMTHEII+E+ LEDESKKKKSIAL TISLE  LEDEDDLD DDI YFSRKYKNFIKRKK FK+ L
Subjt:  RKIPRILPKTWEAKVTAIQKTKDLTKILLEELIGSLMTHEIIIEQ-LEDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQL

Query:  STQKESKGEKSKKDEVICYACKKSGHIRTNFSLL-----------KATWDDSSESESEVEEMENLGLMAHSNKEDEHDDEVTLEPPSIDELFEMFESMQN
        STQK SKGEKSKKDEVICY CKK  HIRT+   L           KATWDDSSESESEVEE  NLGLM  S+KEDEHDDEVTLEPPSI+ELFE FE++QN
Subjt:  STQKESKGEKSKKDEVICYACKKSGHIRTNFSLL-----------KATWDDSSESESEVEEMENLGLMAHSNKEDEHDDEVTLEPPSIDELFEMFESMQN

Query:  DLENL-----------NVLTSENKSLLDKIACFKENENVIQIKELNVSNDKHVCYCNEKDALLDKVTFLEHDSCEKDNLIKVLKENELN----------T
        DLE L           NVL+SENKSLLDKIACFKEN N  QI+ELNVS+DKH+  CNEKDALLDKV FLEHDSCEKDNLIKVLKENELN          T
Subjt:  DLENL-----------NVLTSENKSLLDKIACFKENENVIQIKELNVSNDKHVCYCNEKDALLDKVTFLEHDSCEKDNLIKVLKENELN----------T

Query:  IKKLTIGVQRLGKMIEVGKSYGDKRG-GYIDESSTPS------------------------ILYLYVIIVVLKVTLDLNALS
        IKKLTIG QRL K+IEVGKSYGDKR  GYIDESST S                        +L LYVIIVVLKVTLDLNAL+
Subjt:  IKKLTIGVQRLGKMIEVGKSYGDKRG-GYIDESSTPS------------------------ILYLYVIIVVLKVTLDLNALS

XP_022931810.1 uncharacterized protein LOC111438099 [Cucurbita moschata]8.6e-11170.33Show/hide
Query:  MANL-LANEIVESQCTPRPPYFDSSNYAYWKTRMKIYLQSIDYNLWLIVAKGLYVLMKKVDNVDKPKVEEEYDENGMKNCSFNAKAINCLYCALSKDEFN
        MANL + N   E Q T RPPYFD +NY  WK RMKIYLQS+DY LWL V+ G Y+ +K V+N++ PK+E E+DE+ MK CS NA AINCLYCALS DEFN
Subjt:  MANL-LANEIVESQCTPRPPYFDSSNYAYWKTRMKIYLQSIDYNLWLIVAKGLYVLMKKVDNVDKPKVEEEYDENGMKNCSFNAKAINCLYCALSKDEFN

Query:  RISMCSFAQEIWNALEVTHKGTNQVKESKISMIVHNYELFKMDANETIIDMFTRFTNIINALKGFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLT
        R+ MCS A EIW  LEVTH+GTNQVKE+KISM+VHNYELFKM+ NE I DMFTRFTNI+NALK  GKVY+TS+NVRKI R LPK+WEAKVTAIQ+ KDLT
Subjt:  RISMCSFAQEIWNALEVTHKGTNQVKESKISMIVHNYELFKMDANETIIDMFTRFTNIINALKGFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLT

Query:  KILLEELIGSLMTHEIIIE-QLEDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQLSTQKESKGEKSKKDEVICYACKKSG
        K+ L+EL+GSLMTHEI +   +E+ESKKKKSIALK  S++VD EDED LD DD+AYF+RKYKNFIKRKK FK+  + QKESKGEKSK DEVICY CKK G
Subjt:  KILLEELIGSLMTHEIIIE-QLEDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQLSTQKESKGEKSKKDEVICYACKKSG

XP_031739764.1 uncharacterized protein LOC116403291 [Cucumis sativus]4.0e-10885.54Show/hide
Query:  MKIYLQSIDYNLWLIVAKGLYVLMKKVDNVDKPKVEEEYDENGMKNCSFNAKAINCLYCALSKDEFNRISMCSFAQEIWNALEVTHKGTNQVKESKISMI
        MKIYLQSIDYNLWLIVAKG YV MK VDNVD PK+EEEYDEN MK CSFNAKAINCLYCALSKDEFNRISMCS AQEIWN LE+TH+GTNQVKESKISM 
Subjt:  MKIYLQSIDYNLWLIVAKGLYVLMKKVDNVDKPKVEEEYDENGMKNCSFNAKAINCLYCALSKDEFNRISMCSFAQEIWNALEVTHKGTNQVKESKISMI

Query:  VHNYELFKMDANETIIDMFTRFTNIINALKGFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLTKILLEELIGSLMTHEIII-EQLEDESKKKKSIA
        VHNYELFKMDANETI DMFTRFTNIINALKG GKVYTTS+NVRKI R LPKTWEAKVTAIQ+ KDLTK+ LEELIGSLMTHEII+ E LEDESKKKKSIA
Subjt:  VHNYELFKMDANETIIDMFTRFTNIINALKGFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLTKILLEELIGSLMTHEIII-EQLEDESKKKKSIA

Query:  LKTISLEVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQLSTQKESK
        LKTISLEVD EDED LD DDIAYFSRKYKNFIKRKK F+E     K  K
Subjt:  LKTISLEVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQLSTQKESK

XP_031741720.1 uncharacterized protein LOC116403915 [Cucumis sativus]4.7e-21881.6Show/hide
Query:  MANLLANEIVESQCTPRPPYFDSSNYAYWKTRMKIYLQSIDYNLWLIVAKGLYVLMKKVDNVDKPKVEEEYDENGMKNCSFNAKAINCLYCALSKDEFNR
        MANLLAN IVE Q T RPPYFD SNYAYWK RMKIYLQSIDYNLWLIVAKG YV MK VDNVD PK+EEEYDEN MK CSFNAKAINCLYCALSKDEFNR
Subjt:  MANLLANEIVESQCTPRPPYFDSSNYAYWKTRMKIYLQSIDYNLWLIVAKGLYVLMKKVDNVDKPKVEEEYDENGMKNCSFNAKAINCLYCALSKDEFNR

Query:  ISMCSFAQEIWNALEVTHKGTNQVKESKISMIVHNYELFKMDANETIIDMFTRFTNIINALKGFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLTK
        ISMCS AQEIWN LE+TH+GTNQVKESKISM VHNYELFKMDANETI DMFTRFTNIINALKG GKVYTTS+NVRKI R LPKTWEAKVTAIQ+ KDLTK
Subjt:  ISMCSFAQEIWNALEVTHKGTNQVKESKISMIVHNYELFKMDANETIIDMFTRFTNIINALKGFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLTK

Query:  ILLEELIGSLMTHEIII-EQLEDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQLSTQKESKGEKSKKDEVICYACKKSGH
        + LEELIGSLMTHEII+ E LEDESKKKKSIALKTISLEVD EDED LD DDIAYFSRKYKNFIKRKKYFK+ LSTQKESKGEKSKKDEVICY CK+SGH
Subjt:  ILLEELIGSLMTHEIII-EQLEDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQLSTQKESKGEKSKKDEVICYACKKSGH

Query:  IRTNFSLL-----------KATWDDSSESESEVEEMENLGLMAHSNKEDEHDDEVTLEPPSIDELFEMFESMQNDLENL-----------NVLTSENKSL
        IRT+  LL           KATWDDSSESESEVEEM NLGLMAHS+K+DEHDD+VTLEP SIDELFE FESMQNDLE L           NVL SENKSL
Subjt:  IRTNFSLL-----------KATWDDSSESESEVEEMENLGLMAHSNKEDEHDDEVTLEPPSIDELFEMFESMQNDLENL-----------NVLTSENKSL

Query:  LDKIACFKENENVIQIKELNVSNDKHVCYCNEKDALLDKVTFLEHDSCEKDNLIKVLKENELN----------TIKKLTIGVQRLGKMIEVGKSYGDKRG
        LD IACFKENEN  QI+ELNVS+DKHVC   EKDALLDKV FLEHDSCEKDNLIKVLKENEL+          TIKKLTIG QRL K+IEVGKSYGDKRG
Subjt:  LDKIACFKENENVIQIKELNVSNDKHVCYCNEKDALLDKVTFLEHDSCEKDNLIKVLKENELN----------TIKKLTIGVQRLGKMIEVGKSYGDKRG

Query:  -GYIDESSTPS
         GYIDESSTPS
Subjt:  -GYIDESSTPS

XP_038895919.1 uncharacterized protein LOC120084093 [Benincasa hispida]8.7e-9554.4Show/hide
Query:  MANLLANEIVESQCTPRPPYFDSSNYAYWKTRMKIYLQSIDYNLWLIVAKGLYVLMKKVDNVDKPKVEEEYDENGMKNCSFNAKAINCLYCALSKDEFNR
        MA    N   E Q T RPP FD +NYA+WKTRM+IYL SIDYNLW IV  G  +  K VDN D PK E++ ++   K  S NAKA+NCL+C L  +EFN+
Subjt:  MANLLANEIVESQCTPRPPYFDSSNYAYWKTRMKIYLQSIDYNLWLIVAKGLYVLMKKVDNVDKPKVEEEYDENGMKNCSFNAKAINCLYCALSKDEFNR

Query:  ISMCSFAQEIWNALEVTHKGTNQVKESKISMIVHNYELFKMDANETIIDMFTRFTNIINALKGFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLTK
        IS C+ A+EIW+ L+VTH+GTNQVKESKISM+VHNY+LFKMDANETI +MFTRFTNI+N LKG GK YTTS+NVRKI R LPK+WEAKVT IQ+ KDL+K
Subjt:  ISMCSFAQEIWNALEVTHKGTNQVKESKISMIVHNYELFKMDANETIIDMFTRFTNIINALKGFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLTK

Query:  ILLEELIGSLMTHEIIIE-QLEDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQLSTQKESKGEKSKKDEVICYACKKSGH
        + LEEL+GSLM HEII++  +E++ KKKK++ LK+  ++ D E E +L+ ++ AY ++K+K    RK+ F ++++ Q E KGEKS +D +ICY CKK GH
Subjt:  ILLEELIGSLMTHEIIIE-QLEDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQLSTQKESKGEKSKKDEVICYACKKSGH

Query:  IRTNF-----------SLLKATWDDSSESESEVEE-MENLGLMAHSNKEDEHDDEVTLEPPSID
        +  ++             +KAT D+S ESE E EE + NL +MA  + +D+ DDEV+ E  + D
Subjt:  IRTNF-----------SLLKATWDDSSESESEVEE-MENLGLMAHSNKEDEHDDEVTLEPPSID

TrEMBL top hitse value%identityAlignment
A0A5A7TRZ7 Zf-CCHC domain-containing protein/UBN2 domain-containing protein1.2e-12170.68Show/hide
Query:  RKIPRILPKTWEAKVTAIQKTKDLTKILLEELIGSLMTHEIIIEQ-LEDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQL
        RKI   LPKTW+AKVTAIQ+ KDLTK+ LEELIGSLMTHEII+E+ LEDESKKKKSIAL TISLE  LEDEDDLD DDI YFSRKYKNFIKRKK FK+ L
Subjt:  RKIPRILPKTWEAKVTAIQKTKDLTKILLEELIGSLMTHEIIIEQ-LEDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQL

Query:  STQKESKGEKSKKDEVICYACKKSGHIRTNFSLL-----------KATWDDSSESESEVEEMENLGLMAHSNKEDEHDDEVTLEPPSIDELFEMFESMQN
        STQK SKGEKSKKDEVICY CKK  HIRT+   L           KATWDDSSESESEVEE  NLGLM  S+KEDEHDDEVTLEPPSI+ELFE FE++QN
Subjt:  STQKESKGEKSKKDEVICYACKKSGHIRTNFSLL-----------KATWDDSSESESEVEEMENLGLMAHSNKEDEHDDEVTLEPPSIDELFEMFESMQN

Query:  DLENL-----------NVLTSENKSLLDKIACFKENENVIQIKELNVSNDKHVCYCNEKDALLDKVTFLEHDSCEKDNLIKVLKENELN----------T
        DLE L           NVL+SENKSLLDKIACFKEN N  QI+ELNVS+DKH+  CNEKDALLDKV FLEHDSCEKDNLIKVLKENELN          T
Subjt:  DLENL-----------NVLTSENKSLLDKIACFKENENVIQIKELNVSNDKHVCYCNEKDALLDKVTFLEHDSCEKDNLIKVLKENELN----------T

Query:  IKKLTIGVQRLGKMIEVGKSYGDKRG-GYIDESSTPS------------------------ILYLYVIIVVLKVTLDLNALS
        IKKLTIG QRL K+IEVGKSYGDKR  GYIDESST S                        +L LYVIIVVLKVTLDLNAL+
Subjt:  IKKLTIGVQRLGKMIEVGKSYGDKRG-GYIDESSTPS------------------------ILYLYVIIVVLKVTLDLNALS

A0A5A7U923 Zf-CCHC domain-containing protein/UBN2 domain-containing protein5.1e-8561.16Show/hide
Query:  GFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLTKILLEELIGSLMTHEIIIE-QLEDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKYKN
        G GKVYTT +N RKI R LPKTW AKVTAIQ+ KDLTK+  EELIGSLMTHEII++  LEDESKK KS+ALKTI LEVD +DEDDLD +DIAYFSRKYKN
Subjt:  GFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLTKILLEELIGSLMTHEIIIE-QLEDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKYKN

Query:  FIKRKKYFKEQLSTQKESKGEKSKKDEVICYACKKSGHIRTNFSLLKATWDDSSESESEVEEMENLGLMAHSNKEDEHDDEVTLEPPSIDELFEMFESMQ
        FIKRKKYFK+ +S+QKESK EKSKKDE                  +KATWDDSS SESEVE+M +LGLMAHSNKEDEHDDE                   
Subjt:  FIKRKKYFKEQLSTQKESKGEKSKKDEVICYACKKSGHIRTNFSLLKATWDDSSESESEVEEMENLGLMAHSNKEDEHDDEVTLEPPSIDELFEMFESMQ

Query:  NDLENLNVLTSENKSLLDKIACFKENENVIQIKELNVSNDKHVCYCNEKDALLDKVTFLEHDSCEKDNLIKVLKENELN----------TIKKLTIGVQR
                                           NVS+ KHVC CNEK+ALLDKV FLEHD CEKDNLIKVLKENELN          TIKKLTI  QR
Subjt:  NDLENLNVLTSENKSLLDKIACFKENENVIQIKELNVSNDKHVCYCNEKDALLDKVTFLEHDSCEKDNLIKVLKENELN----------TIKKLTIGVQR

Query:  LGKMIEVGKSYGDKRG-GYIDESSTPS
        LG++IEVGKSYGDKRG GYIDE STPS
Subjt:  LGKMIEVGKSYGDKRG-GYIDESSTPS

A0A5D3DLU8 UBN2 domain-containing protein4.3e-9263.1Show/hide
Query:  FKMDANETIIDMFTRFTNIINALKGFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLTKILLEELIGSLMTHEIIIEQ-LEDESKKKKSIALKTISL
        + MDANETI D+FTRFTNIINALK  GK+YTTS+N RKI R LPKTWEAKV AIQ+ K   K+ LEELIGSLMTHEIII++ LEDESKKKKSIALKTISL
Subjt:  FKMDANETIIDMFTRFTNIINALKGFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLTKILLEELIGSLMTHEIIIEQ-LEDESKKKKSIALKTISL

Query:  EVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQLSTQKESKGEKSKKDEVICYACKKSGHIRTNFSLLKATWDDSSESESEVEEMENLGLMAHSNKED
        EVDL+DEDDLD DDIAYFS+KYKNFIK K   +     +K  K +  KK +      KK+         +KATWDDSSES  EVE+M  LGLMAH     
Subjt:  EVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQLSTQKESKGEKSKKDEVICYACKKSGHIRTNFSLLKATWDDSSESESEVEEMENLGLMAHSNKED

Query:  EHDDEVTLEPPSIDELFEMFESMQNDLENLNVLTSENKSLLDKIACFKENENVIQIKELNVSNDKHVCYCNEKDALLDKVTFLEHDSCEKDNLIKVLKEN
        E+   V L+                  +  NVLTSENKSLL K ACFKENENV+QI+ELNVS+DKHVC CNEKDALLDKV FL+HD CEKDNLIKVLKEN
Subjt:  EHDDEVTLEPPSIDELFEMFESMQNDLENLNVLTSENKSLLDKIACFKENENVIQIKELNVSNDKHVCYCNEKDALLDKVTFLEHDSCEKDNLIKVLKEN

Query:  ELN----------TIKKLTIGVQRLGKMIEVGKSYG
        ELN          TI+KLTI  +RL K+I VGKSYG
Subjt:  ELN----------TIKKLTIGVQRLGKMIEVGKSYG

A0A6J1F0H1 uncharacterized protein LOC1114380994.2e-11170.33Show/hide
Query:  MANL-LANEIVESQCTPRPPYFDSSNYAYWKTRMKIYLQSIDYNLWLIVAKGLYVLMKKVDNVDKPKVEEEYDENGMKNCSFNAKAINCLYCALSKDEFN
        MANL + N   E Q T RPPYFD +NY  WK RMKIYLQS+DY LWL V+ G Y+ +K V+N++ PK+E E+DE+ MK CS NA AINCLYCALS DEFN
Subjt:  MANL-LANEIVESQCTPRPPYFDSSNYAYWKTRMKIYLQSIDYNLWLIVAKGLYVLMKKVDNVDKPKVEEEYDENGMKNCSFNAKAINCLYCALSKDEFN

Query:  RISMCSFAQEIWNALEVTHKGTNQVKESKISMIVHNYELFKMDANETIIDMFTRFTNIINALKGFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLT
        R+ MCS A EIW  LEVTH+GTNQVKE+KISM+VHNYELFKM+ NE I DMFTRFTNI+NALK  GKVY+TS+NVRKI R LPK+WEAKVTAIQ+ KDLT
Subjt:  RISMCSFAQEIWNALEVTHKGTNQVKESKISMIVHNYELFKMDANETIIDMFTRFTNIINALKGFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLT

Query:  KILLEELIGSLMTHEIIIE-QLEDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQLSTQKESKGEKSKKDEVICYACKKSG
        K+ L+EL+GSLMTHEI +   +E+ESKKKKSIALK  S++VD EDED LD DD+AYF+RKYKNFIKRKK FK+  + QKESKGEKSK DEVICY CKK G
Subjt:  KILLEELIGSLMTHEIIIE-QLEDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQLSTQKESKGEKSKKDEVICYACKKSG

A0A6J1I2X4 uncharacterized protein LOC1114704651.0e-9368.97Show/hide
Query:  MANL-LANEIVESQCTPRPPYFDSSNYAYWKTRMKIYLQSIDYNLWLIVAKGLYVLMKKVDNVDKPKVEEEYDENGMKNCSFNAKAINCLYCALSKDEFN
        MANL + N   E Q T RPPYFD +NY  WK RMKIYLQS+D+ LWL V+ G Y+ +K V+N++ PK+E E+DE+ MK CS NA AINCLYCALS DEFN
Subjt:  MANL-LANEIVESQCTPRPPYFDSSNYAYWKTRMKIYLQSIDYNLWLIVAKGLYVLMKKVDNVDKPKVEEEYDENGMKNCSFNAKAINCLYCALSKDEFN

Query:  RISMCSFAQEIWNALEVTHKGTNQVKESKISMIVHNYELFKMDANETIIDMFTRFTNIINALKGFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLT
        R+ MCS A EIW  LEVTH+GTNQVKE+KISM+VHNYELFKM+ NE I DMFTRFTNI+NALK  GKVY+TS+NVRKI R LPK+WEAKVTAIQ+ KDLT
Subjt:  RISMCSFAQEIWNALEVTHKGTNQVKESKISMIVHNYELFKMDANETIIDMFTRFTNIINALKGFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLT

Query:  KILLEELIGSLMTHEIIIE-QLEDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKY
        K+ L+EL+GSLMTHEI +   +E+ESKKKKSIALK  S++VD EDED LD DD+AYF+RKY
Subjt:  KILLEELIGSLMTHEIIIE-QLEDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAACCTATTGGCAAATGAGATTGTTGAAAGCCAATGTACTCCTAGACCTCCTTATTTTGATAGTTCAAATTATGCATATTGGAAAACTAGAATGAAAATTTATTT
GCAATCTATTGACTATAATTTGTGGTTAATTGTTGCTAAAGGCCTTTATGTTCTCATGAAAAAGGTTGATAATGTTGATAAGCCTAAAGTAGAAGAAGAGTATGACGAAA
ATGGAATGAAAAATTGTTCTTTTAATGCTAAAGCTATTAATTGTTTATATTGTGCTTTGAGTAAAGATGAATTCAATAGAATTTCCATGTGTTCTTTTGCTCAAGAAATT
TGGAATGCTCTTGAAGTTACTCATAAAGGAACAAATCAAGTTAAAGAGTCTAAAATTAGCATGATTGTTCATAATTATGAATTGTTTAAGATGGATGCTAATGAGACTAT
CATTGATATGTTTACTAGATTTACTAACATTATAAATGCTTTGAAGGGTTTTGGTAAAGTCTATACAACTTCGAAAAATGTTAGAAAAATTCCAAGGATTTTACCTAAGA
CTTGGGAAGCTAAGGTAACGGCAATCCAAAAAACAAAGGATCTCACCAAAATTCTACTAGAGGAGCTTATTGGCTCACTCATGACTCATGAGATCATTATTGAGCAGTTA
GAGGATGAGTCCAAAAAGAAGAAGAGCATTGCATTAAAGACCATCTCCTTGGAGGTTGATCTCGAAGATGAGGATGACCTTGATGTAGATGACATTGCTTATTTCTCACG
TAAGTACAAAAATTTCATAAAAAGGAAGAAATATTTCAAGGAACAGCTATCAACTCAAAAAGAGTCAAAAGGTGAGAAAAGCAAAAAGGATGAGGTGATTTGTTATGCAT
GCAAAAAGTCGGGTCACATAAGAACAAATTTCTCTCTCTTGAAGGCTACTTGGGATGATAGTAGTGAAAGTGAAAGTGAAGTTGAAGAAATGGAAAACCTTGGTCTCATG
GCTCATAGTAACAAAGAAGATGAACATGATGATGAGGTAACTCTAGAACCTCCTTCTATTGATGAATTGTTTGAAATGTTTGAAAGCATGCAAAATGACCTAGAAAACTT
AAATGTTTTAACTAGTGAAAATAAGTCTTTACTCGATAAAATTGCTTGCTTTAAAGAGAATGAAAATGTTATACAAATTAAAGAATTAAATGTCTCTAATGATAAGCATG
TTTGTTACTGTAACGAGAAAGATGCTTTGCTTGACAAAGTTACATTTCTTGAGCATGATAGTTGTGAAAAGGATAACTTGATTAAAGTACTTAAAGAAAATGAACTAAAT
ACGATTAAAAAGTTAACAATAGGTGTTCAAAGATTGGGCAAAATGATTGAAGTAGGAAAATCTTATGGTGATAAGAGAGGTGGCTATATTGATGAATCATCTACTCCTTC
AATTTTGTATCTATATGTCATAATTGTGGTGTTGAAGGTCACATTAGACCTAAATGCTTTAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAACCTATTGGCAAATGAGATTGTTGAAAGCCAATGTACTCCTAGACCTCCTTATTTTGATAGTTCAAATTATGCATATTGGAAAACTAGAATGAAAATTTATTT
GCAATCTATTGACTATAATTTGTGGTTAATTGTTGCTAAAGGCCTTTATGTTCTCATGAAAAAGGTTGATAATGTTGATAAGCCTAAAGTAGAAGAAGAGTATGACGAAA
ATGGAATGAAAAATTGTTCTTTTAATGCTAAAGCTATTAATTGTTTATATTGTGCTTTGAGTAAAGATGAATTCAATAGAATTTCCATGTGTTCTTTTGCTCAAGAAATT
TGGAATGCTCTTGAAGTTACTCATAAAGGAACAAATCAAGTTAAAGAGTCTAAAATTAGCATGATTGTTCATAATTATGAATTGTTTAAGATGGATGCTAATGAGACTAT
CATTGATATGTTTACTAGATTTACTAACATTATAAATGCTTTGAAGGGTTTTGGTAAAGTCTATACAACTTCGAAAAATGTTAGAAAAATTCCAAGGATTTTACCTAAGA
CTTGGGAAGCTAAGGTAACGGCAATCCAAAAAACAAAGGATCTCACCAAAATTCTACTAGAGGAGCTTATTGGCTCACTCATGACTCATGAGATCATTATTGAGCAGTTA
GAGGATGAGTCCAAAAAGAAGAAGAGCATTGCATTAAAGACCATCTCCTTGGAGGTTGATCTCGAAGATGAGGATGACCTTGATGTAGATGACATTGCTTATTTCTCACG
TAAGTACAAAAATTTCATAAAAAGGAAGAAATATTTCAAGGAACAGCTATCAACTCAAAAAGAGTCAAAAGGTGAGAAAAGCAAAAAGGATGAGGTGATTTGTTATGCAT
GCAAAAAGTCGGGTCACATAAGAACAAATTTCTCTCTCTTGAAGGCTACTTGGGATGATAGTAGTGAAAGTGAAAGTGAAGTTGAAGAAATGGAAAACCTTGGTCTCATG
GCTCATAGTAACAAAGAAGATGAACATGATGATGAGGTAACTCTAGAACCTCCTTCTATTGATGAATTGTTTGAAATGTTTGAAAGCATGCAAAATGACCTAGAAAACTT
AAATGTTTTAACTAGTGAAAATAAGTCTTTACTCGATAAAATTGCTTGCTTTAAAGAGAATGAAAATGTTATACAAATTAAAGAATTAAATGTCTCTAATGATAAGCATG
TTTGTTACTGTAACGAGAAAGATGCTTTGCTTGACAAAGTTACATTTCTTGAGCATGATAGTTGTGAAAAGGATAACTTGATTAAAGTACTTAAAGAAAATGAACTAAAT
ACGATTAAAAAGTTAACAATAGGTGTTCAAAGATTGGGCAAAATGATTGAAGTAGGAAAATCTTATGGTGATAAGAGAGGTGGCTATATTGATGAATCATCTACTCCTTC
AATTTTGTATCTATATGTCATAATTGTGGTGTTGAAGGTCACATTAGACCTAAATGCTTTAAGTTGA
Protein sequenceShow/hide protein sequence
MANLLANEIVESQCTPRPPYFDSSNYAYWKTRMKIYLQSIDYNLWLIVAKGLYVLMKKVDNVDKPKVEEEYDENGMKNCSFNAKAINCLYCALSKDEFNRISMCSFAQEI
WNALEVTHKGTNQVKESKISMIVHNYELFKMDANETIIDMFTRFTNIINALKGFGKVYTTSKNVRKIPRILPKTWEAKVTAIQKTKDLTKILLEELIGSLMTHEIIIEQL
EDESKKKKSIALKTISLEVDLEDEDDLDVDDIAYFSRKYKNFIKRKKYFKEQLSTQKESKGEKSKKDEVICYACKKSGHIRTNFSLLKATWDDSSESESEVEEMENLGLM
AHSNKEDEHDDEVTLEPPSIDELFEMFESMQNDLENLNVLTSENKSLLDKIACFKENENVIQIKELNVSNDKHVCYCNEKDALLDKVTFLEHDSCEKDNLIKVLKENELN
TIKKLTIGVQRLGKMIEVGKSYGDKRGGYIDESSTPSILYLYVIIVVLKVTLDLNALS