; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0007207 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0007207
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionZf-CCHC domain-containing protein/DUF4219 domain-containing protein/UBN2 domain-containing protein
Genome locationchr10:12405827..12408633
RNA-Seq ExpressionPay0007207
SyntenyPay0007207
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046182.1 zf-CCHC domain-containing protein/UBN2 domain-containing protein [Cucumis melo var. makuwa]1.1e-12372.95Show/hide
Query:  RKILRSLPKTWEAKVTTIQEAKDLTKLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEVEDEDDLDEDDILYFSHKYKNFIKRKKYFKKHLST
        RKIL  LPKTW+AKVT IQEAKDLTKLPLEELIGSLMTHEII ++HLEDESK KKSIALNTISLE+EDEDDLDEDDI+YFS KYKNFIKRKK FKK+LST
Subjt:  RKILRSLPKTWEAKVTTIQEAKDLTKLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEVEDEDDLDEDDILYFSHKYKNFIKRKKYFKKHLST

Query:  QKESKGEKSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSKESESEVEEMANLGLMAHSDKEDEHDDEVTLEPP-IDKLFGNLESIQNYL
        QK SKGEKSKKDEVICYECKK  HIRTDCP LKSSKKSK+KAMKATWDDS ESESEVEE ANLGLM  SDKEDEHDDEVTLEPP I++LF N E++QN L
Subjt:  QKESKGEKSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSKESESEVEEMANLGLMAHSDKEDEHDDEVTLEPP-IDKLFGNLESIQNYL

Query:  EKLSSKYVVLKKKYNVLTSKNKSLLDKIDCFKENENM--------------------LCLIKLDFLS----MIDNLIKVLKENELNVLQDLDKAKETIKK
        EKLSSKYVVLKKKYNVL+S+NKSLLDKI CFKEN N                       L K+ FL       DNLIKVLKENELNVLQDLDKAKETIKK
Subjt:  EKLSSKYVVLKKKYNVLTSKNKSLLDKIDCFKENENM--------------------LCLIKLDFLS----MIDNLIKVLKENELNVLQDLDKAKETIKK

Query:  LTIGAQRLDKIIEVGKSYGDKR-------------------ASPNVPKLNMPNDVPNQVKSSFVSI
        LTIGAQRLDKIIEVGKSYGDKR                   ASP VPK NM   + N V   +V I
Subjt:  LTIGAQRLDKIIEVGKSYGDKR-------------------ASPNVPKLNMPNDVPNQVKSSFVSI

TYK02592.1 zf-CCHC domain-containing protein/DUF4219 domain-containing protein/UBN2 domain-containing protein [Cucumis melo var. makuwa]1.4e-9478.6Show/hide
Query:  MDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLTKLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEV
        MDANETITDMFTRFTNIINALKG GKVYTTSENVRKILRSLPKTWEA                              K+HL+DESK KKSIALNTISLE+
Subjt:  MDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLTKLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEV

Query:  EDEDDLDEDDILYFSHKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSKESESEVEEMANLGLM
        EDEDDLDEDDI YFS KY+NFIKRKKYFKKHLSTQKESKGEK+KKDEVI YECKK G+IRTDCP LKSSKKSKKKA+KATWDDS +SESEVEEMANLGLM
Subjt:  EDEDDLDEDDILYFSHKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSKESESEVEEMANLGLM

Query:  AHSDKEDEHDDEVTLE-PPIDKLFGNLESIQNYLEKLSSKYVVLKKKYNVLTSKNKS
        AHSDKEDEHDDEVTLE P ID+LF N ES+QN LEKLSSK VVLKKKYNVLTS+NKS
Subjt:  AHSDKEDEHDDEVTLE-PPIDKLFGNLESIQNYLEKLSSKYVVLKKKYNVLTSKNKS

XP_022931810.1 uncharacterized protein LOC111438099 [Cucurbita moschata]2.5e-10768.79Show/hide
Query:  MENLLV-NEIIEDQSTSRPPYFDGSNYAYWKGRIKIYLQSIDYNLWLIVAKGSYVPMKKVDNVDKPKLEEDYGENEMKKCSFNAKAINCLYCALSNDEFN
        M NL V N   E QSTSRPPYFDG+NY  WK R+KIYLQS+DY LWL V+ G Y+P+K V+N++ PKLE ++ E++MKKCS NA AINCLYCALSNDEFN
Subjt:  MENLLV-NEIIEDQSTSRPPYFDGSNYAYWKGRIKIYLQSIDYNLWLIVAKGSYVPMKKVDNVDKPKLEEDYGENEMKKCSFNAKAINCLYCALSNDEFN

Query:  RISM--------------------FKKSKISMLVHNYELFKMDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLT
        R+ M                     K++KISMLVHNYELFKM+ NE I DMFTRFTNI+NALK  GKVY+TSENVRKILRSLPK+WEAKVT IQEAKDLT
Subjt:  RISM--------------------FKKSKISMLVHNYELFKMDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLT

Query:  KLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEVEDEDDLDEDDILYFSHKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKKPG
        KLPL+EL+GSLMTHEI    H+E+ESK KKSIAL +I ++ EDED LDEDD+ YF+ KYKNFIKRKK FKKH + QKESKGEKSK DEVICYECKKPG
Subjt:  KLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEVEDEDDLDEDDILYFSHKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKKPG

XP_031739764.1 uncharacterized protein LOC116403291 [Cucumis sativus]2.6e-9681.67Show/hide
Query:  IKIYLQSIDYNLWLIVAKGSYVPMKKVDNVDKPKLEEDYGENEMKKCSFNAKAINCLYCALSNDEFNRISM--------------------FKKSKISML
        +KIYLQSIDYNLWLIVAKG YVPMK VDNVD PKLEE+Y ENEMKKCSFNAKAINCLYCALS DEFNRISM                     K+SKISM 
Subjt:  IKIYLQSIDYNLWLIVAKGSYVPMKKVDNVDKPKLEEDYGENEMKKCSFNAKAINCLYCALSNDEFNRISM--------------------FKKSKISML

Query:  VHNYELFKMDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLTKLPLEELIGSLMTHEIITKKHLEDESKNKKSIA
        VHNYELFKMDANETITDMFTRFTNIINALKG GKVYTTSENVRKILRSLPKTWEAKVT IQEAKDLTKLPLEELIGSLMTHEII K+HLEDESK KKSIA
Subjt:  VHNYELFKMDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLTKLPLEELIGSLMTHEIITKKHLEDESKNKKSIA

Query:  LNTISLEV--EDEDDLDEDDILYFSHKYKNFIKRKKYFKK
        L TISLEV  EDED LDEDDI YFS KYKNFIKRKK F++
Subjt:  LNTISLEV--EDEDDLDEDDILYFSHKYKNFIKRKKYFKK

XP_031741720.1 uncharacterized protein LOC116403915 [Cucumis sativus]1.8e-24377.23Show/hide
Query:  MENLLVNEIIEDQSTSRPPYFDGSNYAYWKGRIKIYLQSIDYNLWLIVAKGSYVPMKKVDNVDKPKLEEDYGENEMKKCSFNAKAINCLYCALSNDEFNR
        M NLL N I+E QSTSRPPYFDGSNYAYWK R+KIYLQSIDYNLWLIVAKG YVPMK VDNVD PKLEE+Y ENEMKKCSFNAKAINCLYCALS DEFNR
Subjt:  MENLLVNEIIEDQSTSRPPYFDGSNYAYWKGRIKIYLQSIDYNLWLIVAKGSYVPMKKVDNVDKPKLEEDYGENEMKKCSFNAKAINCLYCALSNDEFNR

Query:  ISM--------------------FKKSKISMLVHNYELFKMDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLTK
        ISM                     K+SKISM VHNYELFKMDANETITDMFTRFTNIINALKG GKVYTTSENVRKILRSLPKTWEAKVT IQEAKDLTK
Subjt:  ISM--------------------FKKSKISMLVHNYELFKMDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLTK

Query:  LPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEV--EDEDDLDEDDILYFSHKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKKPGH
        LPLEELIGSLMTHEII K+HLEDESK KKSIAL TISLEV  EDED LDEDDI YFS KYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECK+ GH
Subjt:  LPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEV--EDEDDLDEDDILYFSHKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKKPGH

Query:  IRTDCPLLKSSKKSKKKAMKATWDDSKESESEVEEMANLGLMAHSDKEDEHDDEVTLEP-PIDKLFGNLESIQNYLEKLSSKYVVLKKKYNVLTSKNKSL
        IRTDCPLLKSSKKSKKKAMKATWDDS ESESEVEEMANLGLMAHSDK+DEHDD+VTLEP  ID+LF N ES+QN LEKLSSKYVVLKKKYNVL S+NKSL
Subjt:  IRTDCPLLKSSKKSKKKAMKATWDDSKESESEVEEMANLGLMAHSDKEDEHDDEVTLEP-PIDKLFGNLESIQNYLEKLSSKYVVLKKKYNVLTSKNKSL

Query:  LDKIDCFKENENM-------------LCLIKLDFLSMI----------DNLIKVLKENELNVLQDLDKAKETIKKLTIGAQRLDKIIEVGKSYGDKR---
        LD I CFKENEN              +C+ K   L  +          DNLIKVLKENEL+VLQ+LDKAKETIKKLTIGAQRLDKIIEVGKSYGDKR   
Subjt:  LDKIDCFKENENM-------------LCLIKLDFLSMI----------DNLIKVLKENELNVLQDLDKAKETIKKLTIGAQRLDKIIEVGKSYGDKR---

Query:  -----------------ASPNVPKLNMPNDVPNQVKSSFVSICHNCCVEGHIRPKCFKMKYAHTSSSRRNISQRAKLHNAPRKNFSKKSRVHKFVVKNKS
                         ASP VPK NM N V N VKSSFV ICHNC VEGHIRPKCFK+KYA  + SRRN SQRAK + APRKNFS KSRVHKFV+KNKS
Subjt:  -----------------ASPNVPKLNMPNDVPNQVKSSFVSICHNCCVEGHIRPKCFKMKYAHTSSSRRNISQRAKLHNAPRKNFSKKSRVHKFVVKNKS

Query:  FHNVVC
         HNVVC
Subjt:  FHNVVC

TrEMBL top hitse value%identityAlignment
A0A5A7TRZ7 Zf-CCHC domain-containing protein/UBN2 domain-containing protein5.4e-12472.95Show/hide
Query:  RKILRSLPKTWEAKVTTIQEAKDLTKLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEVEDEDDLDEDDILYFSHKYKNFIKRKKYFKKHLST
        RKIL  LPKTW+AKVT IQEAKDLTKLPLEELIGSLMTHEII ++HLEDESK KKSIALNTISLE+EDEDDLDEDDI+YFS KYKNFIKRKK FKK+LST
Subjt:  RKILRSLPKTWEAKVTTIQEAKDLTKLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEVEDEDDLDEDDILYFSHKYKNFIKRKKYFKKHLST

Query:  QKESKGEKSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSKESESEVEEMANLGLMAHSDKEDEHDDEVTLEPP-IDKLFGNLESIQNYL
        QK SKGEKSKKDEVICYECKK  HIRTDCP LKSSKKSK+KAMKATWDDS ESESEVEE ANLGLM  SDKEDEHDDEVTLEPP I++LF N E++QN L
Subjt:  QKESKGEKSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSKESESEVEEMANLGLMAHSDKEDEHDDEVTLEPP-IDKLFGNLESIQNYL

Query:  EKLSSKYVVLKKKYNVLTSKNKSLLDKIDCFKENENM--------------------LCLIKLDFLS----MIDNLIKVLKENELNVLQDLDKAKETIKK
        EKLSSKYVVLKKKYNVL+S+NKSLLDKI CFKEN N                       L K+ FL       DNLIKVLKENELNVLQDLDKAKETIKK
Subjt:  EKLSSKYVVLKKKYNVLTSKNKSLLDKIDCFKENENM--------------------LCLIKLDFLS----MIDNLIKVLKENELNVLQDLDKAKETIKK

Query:  LTIGAQRLDKIIEVGKSYGDKR-------------------ASPNVPKLNMPNDVPNQVKSSFVSI
        LTIGAQRLDKIIEVGKSYGDKR                   ASP VPK NM   + N V   +V I
Subjt:  LTIGAQRLDKIIEVGKSYGDKR-------------------ASPNVPKLNMPNDVPNQVKSSFVSI

A0A5D3BUV2 Zf-CCHC domain-containing protein/DUF4219 domain-containing protein/UBN2 domain-containing protein6.8e-9578.6Show/hide
Query:  MDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLTKLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEV
        MDANETITDMFTRFTNIINALKG GKVYTTSENVRKILRSLPKTWEA                              K+HL+DESK KKSIALNTISLE+
Subjt:  MDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLTKLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEV

Query:  EDEDDLDEDDILYFSHKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSKESESEVEEMANLGLM
        EDEDDLDEDDI YFS KY+NFIKRKKYFKKHLSTQKESKGEK+KKDEVI YECKK G+IRTDCP LKSSKKSKKKA+KATWDDS +SESEVEEMANLGLM
Subjt:  EDEDDLDEDDILYFSHKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSKESESEVEEMANLGLM

Query:  AHSDKEDEHDDEVTLE-PPIDKLFGNLESIQNYLEKLSSKYVVLKKKYNVLTSKNKS
        AHSDKEDEHDDEVTLE P ID+LF N ES+QN LEKLSSK VVLKKKYNVLTS+NKS
Subjt:  AHSDKEDEHDDEVTLE-PPIDKLFGNLESIQNYLEKLSSKYVVLKKKYNVLTSKNKS

A0A5D3DLU8 UBN2 domain-containing protein6.9e-8760.22Show/hide
Query:  FKMDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLTKLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISL
        + MDANETITD+FTRFTNIINALK  GK+YTTSEN RKILRSLPKTWEAKV  IQEAK   KLPLEELIGSLMTHEII KKHLEDESK KKSIAL TISL
Subjt:  FKMDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLTKLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISL

Query:  EVE--DEDDLDEDDILYFSHKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSKESESEVEEMAN
        EV+  DEDDLDEDDI YFS KYKNFIK K   +     +K  K                          LK  KKSKKKAMKATWDDS ES  EVE+MA 
Subjt:  EVE--DEDDLDEDDILYFSHKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSKESESEVEEMAN

Query:  LGLMAHSDKEDEHDDEVTLEPPIDKLFGNLESIQNYLEKLSSKYVVLKKKYNVLTSKNKSLLDKIDCFKENENML---------------------CLIK
        LGLMAH                               +KLSS+YVVLKKKYNVLTS+NKSLL K  CFKENEN++                      L K
Subjt:  LGLMAHSDKEDEHDDEVTLEPPIDKLFGNLESIQNYLEKLSSKYVVLKKKYNVLTSKNKSLLDKIDCFKENENML---------------------CLIK

Query:  LDFLS----MIDNLIKVLKENELNVLQDLDKAKETIKKLTIGAQRLDKIIEVGKSYG
        + FL       DNLIKVLKENELNVLQDL+KAKETI+KLTI A+RLDKII VGKSYG
Subjt:  LDFLS----MIDNLIKVLKENELNVLQDLDKAKETIKKLTIGAQRLDKIIEVGKSYG

A0A6J1F0H1 uncharacterized protein LOC1114380991.2e-10768.79Show/hide
Query:  MENLLV-NEIIEDQSTSRPPYFDGSNYAYWKGRIKIYLQSIDYNLWLIVAKGSYVPMKKVDNVDKPKLEEDYGENEMKKCSFNAKAINCLYCALSNDEFN
        M NL V N   E QSTSRPPYFDG+NY  WK R+KIYLQS+DY LWL V+ G Y+P+K V+N++ PKLE ++ E++MKKCS NA AINCLYCALSNDEFN
Subjt:  MENLLV-NEIIEDQSTSRPPYFDGSNYAYWKGRIKIYLQSIDYNLWLIVAKGSYVPMKKVDNVDKPKLEEDYGENEMKKCSFNAKAINCLYCALSNDEFN

Query:  RISM--------------------FKKSKISMLVHNYELFKMDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLT
        R+ M                     K++KISMLVHNYELFKM+ NE I DMFTRFTNI+NALK  GKVY+TSENVRKILRSLPK+WEAKVT IQEAKDLT
Subjt:  RISM--------------------FKKSKISMLVHNYELFKMDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLT

Query:  KLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEVEDEDDLDEDDILYFSHKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKKPG
        KLPL+EL+GSLMTHEI    H+E+ESK KKSIAL +I ++ EDED LDEDD+ YF+ KYKNFIKRKK FKKH + QKESKGEKSK DEVICYECKKPG
Subjt:  KLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEVEDEDDLDEDDILYFSHKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKKPG

A0A6J1I2X4 uncharacterized protein LOC1114704652.4e-8765.64Show/hide
Query:  MENLLV-NEIIEDQSTSRPPYFDGSNYAYWKGRIKIYLQSIDYNLWLIVAKGSYVPMKKVDNVDKPKLEEDYGENEMKKCSFNAKAINCLYCALSNDEFN
        M NL V N   E QSTSRPPYFDG+NY  WK R+KIYLQS+D+ LWL V+ G Y+P+K V+N++ PKLE ++ E++MKKCS NA AINCLYCALSNDEFN
Subjt:  MENLLV-NEIIEDQSTSRPPYFDGSNYAYWKGRIKIYLQSIDYNLWLIVAKGSYVPMKKVDNVDKPKLEEDYGENEMKKCSFNAKAINCLYCALSNDEFN

Query:  RISM--------------------FKKSKISMLVHNYELFKMDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLT
        R+ M                     K++KISMLVHNYELFKM+ NE I DMFTRFTNI+NALK  GKVY+TSENVRKILRSLPK+WEAKVT IQEAKDLT
Subjt:  RISM--------------------FKKSKISMLVHNYELFKMDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLT

Query:  KLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEVEDEDDLDEDDILYFSHKY
        KLPL+EL+GSLMTHEI    H+E+ESK KKSIAL +I ++ EDED LDEDD+ YF+ KY
Subjt:  KLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEVEDEDDLDEDDILYFSHKY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACCTATTGGTAAATGAGATTATTGAAGACCAATCTACTTCTAGGCCTCCTTATTTTGATGGTTCAAATTATGCATATTGGAAAGGTAGAATAAAAATTTATTT
GCAATCTATTGACTATAATTTGTGGTTAATTGTTGCTAAAGGCTCTTATGTACCCATGAAAAAGGTTGATAATGTTGATAAGCCTAAATTAGAAGAAGACTATGGTGAAA
ATGAAATGAAAAAGTGTTCTTTTAATGCTAAAGCTATTAATTGTTTATATTGTGCTTTGAGTAATGATGAATTTAATAGAATATCCATGTTTAAAAAGTCTAAAATTAGC
ATGCTTGTTCATAATTATGAATTGTTTAAGATGGATGCTAATGAGACTATCACCGATATGTTTACTAGATTTACTAACATCATAAATGCTTTGAAGGGTCATGGTAAAGT
CTATACAACTTCGGAAAATGTTAGAAAAATTCTAAGGTCTCTACCTAAGACTTGGGAAGCTAAGGTAACGACAATTCAAGAAGCAAAGGATCTCACTAAACTTCCACTAG
AGGAGCTTATTGGCTCACTCATGACCCATGAGATCATTACGAAGAAGCACTTAGAGGATGAGTCCAAAAATAAGAAAAGCATTGCATTAAATACCATCTCCTTGGAGGTT
GAAGATGAGGATGACCTTGATGAAGATGACATTCTTTATTTCTCACATAAGTACAAAAATTTCATAAAAAGGAAGAAATATTTCAAGAAACATCTATCAACCCAAAAAGA
GTCAAAAGGTGAGAAAAGCAAAAAGGATGAGGTGATTTGTTATGAATGCAAAAAGCCGGGTCACATAAGAACGGATTGCCCTCTTCTCAAATCATCTAAGAAATCCAAGA
AGAAGGCAATGAAGGCTACTTGGGATGATAGTAAGGAAAGTGAAAGTGAAGTTGAAGAAATGGCAAACCTCGGTCTCATGGCTCATAGTGACAAAGAAGACGAACATGAT
GATGAGGTAACTTTAGAACCTCCTATTGATAAATTGTTTGGAAATTTGGAAAGCATACAAAATTACCTAGAAAAACTTAGTTCTAAGTATGTTGTGCTTAAAAAGAAATA
CAATGTTTTAACTAGTAAAAATAAGTCTTTACTTGATAAAATTGATTGCTTTAAAGAGAATGAGAATATGCTTTGCTTGATAAAGTTAGATTTCTTGAGCATGATAGATA
ACTTGATTAAAGTGCTTAAAGAAAATGAACTAAATGTGTTACAAGATCTTGATAAAGCTAAAGAGACTATTAAAAAGTTGACAATAGGTGCTCAAAGATTGGACAAAATT
ATTGAAGTAGGAAAATCTTATGGTGATAAGAGAGCATCTCCTAATGTGCCTAAGCTTAATATGCCTAATGATGTGCCTAATCAGGTTAAATCTAGTTTTGTATCTATATG
TCATAATTGTTGTGTTGAAGGTCACATTAGACCTAAATGCTTTAAAATGAAGTATGCTCACACTTCTTCTTCAAGAAGAAATATTTCACAAAGAGCAAAGCTTCATAATG
CTCCAAGGAAGAATTTCTCCAAGAAAAGTAGAGTACATAAATTTGTTGTGAAAAATAAATCCTTCCATAATGTTGTTTGTTTGTTTGAAAGCCTCCAAGAAAAACAAATG
ATATTTGGATACAGGTTGCTCAAGACACATGATGAGAGATCGATCCAAGTTATCTCTTTCTCCAAAAAGGAAAGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGTAA
AATAATTGTTTTTATTAGTTTTTCAAAAAGAGTTCAAAATGAAAAAGGATTTTTTATTTCTAAAATTAGGAGTGATCATGGAAGAGAATTTGATAATGATGCTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAACCTATTGGTAAATGAGATTATTGAAGACCAATCTACTTCTAGGCCTCCTTATTTTGATGGTTCAAATTATGCATATTGGAAAGGTAGAATAAAAATTTATTT
GCAATCTATTGACTATAATTTGTGGTTAATTGTTGCTAAAGGCTCTTATGTACCCATGAAAAAGGTTGATAATGTTGATAAGCCTAAATTAGAAGAAGACTATGGTGAAA
ATGAAATGAAAAAGTGTTCTTTTAATGCTAAAGCTATTAATTGTTTATATTGTGCTTTGAGTAATGATGAATTTAATAGAATATCCATGTTTAAAAAGTCTAAAATTAGC
ATGCTTGTTCATAATTATGAATTGTTTAAGATGGATGCTAATGAGACTATCACCGATATGTTTACTAGATTTACTAACATCATAAATGCTTTGAAGGGTCATGGTAAAGT
CTATACAACTTCGGAAAATGTTAGAAAAATTCTAAGGTCTCTACCTAAGACTTGGGAAGCTAAGGTAACGACAATTCAAGAAGCAAAGGATCTCACTAAACTTCCACTAG
AGGAGCTTATTGGCTCACTCATGACCCATGAGATCATTACGAAGAAGCACTTAGAGGATGAGTCCAAAAATAAGAAAAGCATTGCATTAAATACCATCTCCTTGGAGGTT
GAAGATGAGGATGACCTTGATGAAGATGACATTCTTTATTTCTCACATAAGTACAAAAATTTCATAAAAAGGAAGAAATATTTCAAGAAACATCTATCAACCCAAAAAGA
GTCAAAAGGTGAGAAAAGCAAAAAGGATGAGGTGATTTGTTATGAATGCAAAAAGCCGGGTCACATAAGAACGGATTGCCCTCTTCTCAAATCATCTAAGAAATCCAAGA
AGAAGGCAATGAAGGCTACTTGGGATGATAGTAAGGAAAGTGAAAGTGAAGTTGAAGAAATGGCAAACCTCGGTCTCATGGCTCATAGTGACAAAGAAGACGAACATGAT
GATGAGGTAACTTTAGAACCTCCTATTGATAAATTGTTTGGAAATTTGGAAAGCATACAAAATTACCTAGAAAAACTTAGTTCTAAGTATGTTGTGCTTAAAAAGAAATA
CAATGTTTTAACTAGTAAAAATAAGTCTTTACTTGATAAAATTGATTGCTTTAAAGAGAATGAGAATATGCTTTGCTTGATAAAGTTAGATTTCTTGAGCATGATAGATA
ACTTGATTAAAGTGCTTAAAGAAAATGAACTAAATGTGTTACAAGATCTTGATAAAGCTAAAGAGACTATTAAAAAGTTGACAATAGGTGCTCAAAGATTGGACAAAATT
ATTGAAGTAGGAAAATCTTATGGTGATAAGAGAGCATCTCCTAATGTGCCTAAGCTTAATATGCCTAATGATGTGCCTAATCAGGTTAAATCTAGTTTTGTATCTATATG
TCATAATTGTTGTGTTGAAGGTCACATTAGACCTAAATGCTTTAAAATGAAGTATGCTCACACTTCTTCTTCAAGAAGAAATATTTCACAAAGAGCAAAGCTTCATAATG
CTCCAAGGAAGAATTTCTCCAAGAAAAGTAGAGTACATAAATTTGTTGTGAAAAATAAATCCTTCCATAATGTTGTTTGTTTGTTTGAAAGCCTCCAAGAAAAACAAATG
ATATTTGGATACAGGTTGCTCAAGACACATGATGAGAGATCGATCCAAGTTATCTCTTTCTCCAAAAAGGAAAGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGTAA
AATAATTGTTTTTATTAGTTTTTCAAAAAGAGTTCAAAATGAAAAAGGATTTTTTATTTCTAAAATTAGGAGTGATCATGGAAGAGAATTTGATAATGATGCTTTTTAA
Protein sequenceShow/hide protein sequence
MENLLVNEIIEDQSTSRPPYFDGSNYAYWKGRIKIYLQSIDYNLWLIVAKGSYVPMKKVDNVDKPKLEEDYGENEMKKCSFNAKAINCLYCALSNDEFNRISMFKKSKIS
MLVHNYELFKMDANETITDMFTRFTNIINALKGHGKVYTTSENVRKILRSLPKTWEAKVTTIQEAKDLTKLPLEELIGSLMTHEIITKKHLEDESKNKKSIALNTISLEV
EDEDDLDEDDILYFSHKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSKESESEVEEMANLGLMAHSDKEDEHD
DEVTLEPPIDKLFGNLESIQNYLEKLSSKYVVLKKKYNVLTSKNKSLLDKIDCFKENENMLCLIKLDFLSMIDNLIKVLKENELNVLQDLDKAKETIKKLTIGAQRLDKI
IEVGKSYGDKRASPNVPKLNMPNDVPNQVKSSFVSICHNCCVEGHIRPKCFKMKYAHTSSSRRNISQRAKLHNAPRKNFSKKSRVHKFVVKNKSFHNVVCLFESLQEKQM
IFGYRLLKTHDERSIQVISFSKKERGMVTFGDNKKGKIIVFISFSKRVQNEKGFFISKIRSDHGREFDNDAF