; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G030350 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G030350
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionWIYLD domain-containing protein
Genome locationCmo_Chr04:21280346..21283011
RNA-Seq ExpressionCmoCh04G030350
SyntenyCmoCh04G030350
Gene Ontology termsGO:0034968 - histone lysine methylation (biological process)
GO:0016020 - membrane (cellular component)
GO:0018024 - histone-lysine N-methyltransferase activity (molecular function)
InterPro domainsIPR018848 - WIYLD domain
IPR043017 - WIYLD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602621.1 putative zinc metalloprotease EGY2, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]5.4e-11699.52Show/hide
Query:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN
        MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGR+ADLQETSEAGCSSNVVN
Subjt:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN

Query:  EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT
        EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT
Subjt:  EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT

Query:  PAQLPEEFAR
        PAQLPEEFAR
Subjt:  PAQLPEEFAR

KAG7033305.1 hypothetical protein SDJN02_07360 [Cucurbita argyrosperma subsp. argyrosperma]4.4e-11893.51Show/hide
Query:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN
        MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSN   
Subjt:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN

Query:  EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT
                    VKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT
Subjt:  EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT

Query:  PAQLPEEFARLLIPHAQRKRKNRWDVKPSES
        PAQLPEEFARLLIPHAQRKRKNRWDVKPSES
Subjt:  PAQLPEEFARLLIPHAQRKRKNRWDVKPSES

XP_022953876.1 uncharacterized protein LOC111456280 isoform X1 [Cucurbita moschata]4.3e-129100Show/hide
Query:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN
        MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN
Subjt:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN

Query:  EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT
        EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT
Subjt:  EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT

Query:  PAQLPEEFARLLIPHAQRKRKNRWDVKPSES
        PAQLPEEFARLLIPHAQRKRKNRWDVKPSES
Subjt:  PAQLPEEFARLLIPHAQRKRKNRWDVKPSES

XP_022953890.1 uncharacterized protein LOC111456280 isoform X2 [Cucurbita moschata]4.1e-116100Show/hide
Query:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN
        MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN
Subjt:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN

Query:  EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT
        EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT
Subjt:  EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT

Query:  PAQLPEEFAR
        PAQLPEEFAR
Subjt:  PAQLPEEFAR

XP_023531273.1 uncharacterized protein LOC111793562 isoform X1 [Cucurbita pepo subsp. pepo]2.1e-12899.57Show/hide
Query:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN
        MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN
Subjt:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN

Query:  EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT
        EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISS GDDREDLVHLT
Subjt:  EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT

Query:  PAQLPEEFARLLIPHAQRKRKNRWDVKPSES
        PAQLPEEFARLLIPHAQRKRKNRWDVKPSES
Subjt:  PAQLPEEFARLLIPHAQRKRKNRWDVKPSES

TrEMBL top hitse value%identityAlignment
A0A6J1BXH3 uncharacterized protein LOC111006514 isoform X21.8e-6465.09Show/hide
Query:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN
        MAPR R +KRGNLRIDAALDAM PFGF PKLVRDTVKELLSVYGGD+GWVFIEEGSYTLLIDT+L+K KDG    VHEE+ R    +ETS AGCS     
Subjt:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN

Query:  EASTSNPGAEITVKPTEGV-ISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHL
            SNP  E+TVK  + V IS Y DNE FRIT  +   DS+ RY + +D G G  D+HF RS  NQS  AAHTPKI RR PYHGWISS  D +EDLVHL
Subjt:  EASTSNPGAEITVKPTEGV-ISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHL

Query:  TPAQLPEEFARLLIPHAQRKRKNRWDVKPSES
         P     EFARLL+   QRKRK RWDVKP+ES
Subjt:  TPAQLPEEFARLLIPHAQRKRKNRWDVKPSES

A0A6J1BY42 uncharacterized protein LOC111006514 isoform X11.5e-6665.52Show/hide
Query:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN
        MAPR R +KRGNLRIDAALDAM PFGF PKLVRDTVKELLSVYGGD+GWVFIEEGSYTLLIDT+L+K KDG I +VHEE+ R    +ETS AGCS     
Subjt:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN

Query:  EASTSNPGAEITVKPTEGV-ISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHL
            SNP  E+TVK  + V IS Y DNE FRIT  +   DS+ RY + +D G G  D+HF RS  NQS  AAHTPKI RR PYHGWISS  D +EDLVHL
Subjt:  EASTSNPGAEITVKPTEGV-ISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHL

Query:  TPAQLPEEFARLLIPHAQRKRKNRWDVKPSES
         P     EFARLL+   QRKRK RWDVKP+ES
Subjt:  TPAQLPEEFARLLIPHAQRKRKNRWDVKPSES

A0A6J1GPA9 uncharacterized protein LOC111456280 isoform X12.1e-129100Show/hide
Query:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN
        MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN
Subjt:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN

Query:  EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT
        EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT
Subjt:  EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT

Query:  PAQLPEEFARLLIPHAQRKRKNRWDVKPSES
        PAQLPEEFARLLIPHAQRKRKNRWDVKPSES
Subjt:  PAQLPEEFARLLIPHAQRKRKNRWDVKPSES

A0A6J1GQX1 uncharacterized protein LOC111456280 isoform X22.0e-116100Show/hide
Query:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN
        MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN
Subjt:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN

Query:  EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT
        EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT
Subjt:  EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT

Query:  PAQLPEEFAR
        PAQLPEEFAR
Subjt:  PAQLPEEFAR

A0A6J1JR10 uncharacterized protein LOC1114875576.4e-11598.57Show/hide
Query:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN
        MAPRER KKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGR+ADLQETSEAGCSSNVVN
Subjt:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVN

Query:  EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT
        EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAH PKIRRRKPYHGWISSGGDDREDLVHLT
Subjt:  EASTSNPGAEITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLT

Query:  PAQLPEEFAR
        PAQLPEEFAR
Subjt:  PAQLPEEFAR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G45248.2 Nucleolar histone methyltransferase-related protein1.1e-0543.06Show/hide
Query:  MAPRERIKKRGNL---RIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTL-LIDTLLEKR
        MAP E  K R N+   R DAA D M  FGF   ++  ++K++L VYG D+ W  IE+ +Y + LI  L  KR
Subjt:  MAPRERIKKRGNL---RIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTL-LIDTLLEKR

AT1G45248.3 Nucleolar histone methyltransferase-related protein1.1e-0543.06Show/hide
Query:  MAPRERIKKRGNL---RIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTL-LIDTLLEKR
        MAP E  K R N+   R DAA D M  FGF   ++  ++K++L VYG D+ W  IE+ +Y + LI  L  KR
Subjt:  MAPRERIKKRGNL---RIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTL-LIDTLLEKR

AT2G40020.1 Nucleolar histone methyltransferase-related protein1.2e-0724.89Show/hide
Query:  LRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIAD--LQETSEAGCSSNVVNEASTSNPGAE
        +R DAA D M  FGFH  ++ +++KELL VY  ++ W  IE+ SY  L+   LEK+++   +    +  ++ +   +E +E    + +  E        E
Subjt:  LRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIAD--LQETSEAGCSSNVVNEASTSNPGAE

Query:  ITV-------KPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHF------------IRSSMNQSLLAAHTPKIRRRKPYHGWISSGGD
          +       +  E  I+S   N+P    A +   ++   Y   +    GV + H               +   +       PK +  +P      S GD
Subjt:  ITV-------KPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHF------------IRSSMNQSLLAAHTPKIRRRKPYHGWISSGGD

Query:  DREDLVHLTPAQLPEEFARLLIP---HAQRKRKNRWD
        D ++++ LTP  L EE   LL       +RK++ RWD
Subjt:  DREDLVHLTPAQLPEEFARLLIP---HAQRKRKNRWD

AT2G40020.2 Nucleolar histone methyltransferase-related protein2.3e-1147.14Show/hide
Query:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKD
        MAPR R KK G +R DAA D M  FGFH  ++ +++KELL VY  ++ W  IE+ SY  L+   LEK+++
Subjt:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKD

AT2G40020.3 Nucleolar histone methyltransferase-related protein1.7e-1126.91Show/hide
Query:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIAD--LQETSEAGCSSNV
        MAPR R KK G +R DAA D M  FGFH  ++ +++KELL VY  ++ W  IE+ SY  L+   LEK+++   +    +  ++ +   +E +E    + +
Subjt:  MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIAD--LQETSEAGCSSNV

Query:  VNEASTSNPGAEITV-------KPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHF------------IRSSMNQSLLAAHTPKIRRR
          E        E  +       +  E  I+S   N+P    A +   ++   Y   +    GV + H               +   +       PK +  
Subjt:  VNEASTSNPGAEITV-------KPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHF------------IRSSMNQSLLAAHTPKIRRR

Query:  KPYHGWISSGGDDREDLVHLTPAQLPEEFARLLIP---HAQRKRKNRWD
        +P      S GDD ++++ LTP  L EE   LL       +RK++ RWD
Subjt:  KPYHGWISSGGDDREDLVHLTPAQLPEEFARLLIP---HAQRKRKNRWD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCCAGGGAGCGAATTAAAAAGAGGGGGAACTTACGAATTGATGCTGCGCTGGATGCTATGAATCCTTTCGGATTTCATCCGAAGTTGGTTCGTGACACGGTCAA
GGAGCTCCTCAGTGTGTATGGAGGAGACGAAGGATGGGTATTCATTGAAGAAGGCTCTTATACTCTCTTGATTGATACCCTTCTTGAGAAACGGAAAGATGGTGCAATAG
AGAAGGTTCATGAAGAGGATGGAAGAATTGCAGATCTTCAGGAGACCTCTGAAGCGGGCTGTTCCTCGAATGTTGTTAATGAAGCTTCCACATCTAATCCTGGGGCTGAG
ATTACCGTGAAACCGACTGAGGGTGTAATCTCATCGTACGTGGACAATGAACCTTTCAGGATCACAGCCACAGTACCTGCAAATGATTCAGATGAAAGATACTGGAAGGA
GGAGGATATAGGTTCTGGTGTTGCTGATAACCATTTTATAAGGAGTTCTATGAACCAGTCTTTACTGGCGGCACATACCCCGAAAATAAGGAGGCGAAAACCTTATCACG
GGTGGATCTCGTCCGGTGGCGACGACAGGGAAGACTTGGTGCACTTGACACCAGCTCAATTGCCTGAGGAGTTTGCCAGGTTACTCATTCCTCATGCTCAAAGAAAAAGA
AAGAATCGTTGGGACGTGAAGCCTTCAGAATCATGA
mRNA sequenceShow/hide mRNA sequence
TGTTGCGCCAAAATCGTTCGTCTTCTTCCTGCATGCTCTTTCGTCTTCGTACCAGAAACCGCAAAATTTTCGACTGACATTCCTCTTTCTCTCTTATTTCTGAACAAAAT
CTCTCCCCATATCTCTTCATATTGATTCCCACTCCATCTAATTCTGCTCATCAAGATATGCTCCTTTTTCAAGCTTTCGATTCTCAGGTTTCTCGGTTACCCTAGCACCG
GAATTTTTTCTCAGTTGGAGCTTGTTCGACGTTTCTTTGTTGGGCTTCATAAATGGCTCCCAGGGAGCGAATTAAAAAGAGGGGGAACTTACGAATTGATGCTGCGCTGG
ATGCTATGAATCCTTTCGGATTTCATCCGAAGTTGGTTCGTGACACGGTCAAGGAGCTCCTCAGTGTGTATGGAGGAGACGAAGGATGGGTATTCATTGAAGAAGGCTCT
TATACTCTCTTGATTGATACCCTTCTTGAGAAACGGAAAGATGGTGCAATAGAGAAGGTTCATGAAGAGGATGGAAGAATTGCAGATCTTCAGGAGACCTCTGAAGCGGG
CTGTTCCTCGAATGTTGTTAATGAAGCTTCCACATCTAATCCTGGGGCTGAGATTACCGTGAAACCGACTGAGGGTGTAATCTCATCGTACGTGGACAATGAACCTTTCA
GGATCACAGCCACAGTACCTGCAAATGATTCAGATGAAAGATACTGGAAGGAGGAGGATATAGGTTCTGGTGTTGCTGATAACCATTTTATAAGGAGTTCTATGAACCAG
TCTTTACTGGCGGCACATACCCCGAAAATAAGGAGGCGAAAACCTTATCACGGGTGGATCTCGTCCGGTGGCGACGACAGGGAAGACTTGGTGCACTTGACACCAGCTCA
ATTGCCTGAGGAGTTTGCCAGGTTACTCATTCCTCATGCTCAAAGAAAAAGAAAGAATCGTTGGGACGTGAAGCCTTCAGAATCATGAGCTGCTTGTGATTAGTATAAAC
AGAAACAATCCCATTTGCTTGCGGTGTAAGGAAAAGAAATTTCACCCTCTCTTGGGTAACTTTGGAAGAAACGGTTAGGAATCTGACTTATAGTGTACTTTGTTCGAGGG
GAGGATTGTTGAGAATTATTGGGAGTGAGTCCTGATGGACAATATCATACCATTGTGGAGACCCGCGATTCCTAACGTTTCCCAAG
Protein sequenceShow/hide protein sequence
MAPRERIKKRGNLRIDAALDAMNPFGFHPKLVRDTVKELLSVYGGDEGWVFIEEGSYTLLIDTLLEKRKDGAIEKVHEEDGRIADLQETSEAGCSSNVVNEASTSNPGAE
ITVKPTEGVISSYVDNEPFRITATVPANDSDERYWKEEDIGSGVADNHFIRSSMNQSLLAAHTPKIRRRKPYHGWISSGGDDREDLVHLTPAQLPEEFARLLIPHAQRKR
KNRWDVKPSES