; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0018606 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0018606
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionDUF4228 domain-containing protein
Genome locationchr07:2929478..2930993
RNA-Seq ExpressionPay0018606
SyntenyPay0018606
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064682.1 DUF4228 domain-containing protein [Cucumis melo var. makuwa]7.5e-8897.79Show/hide
Query:  GHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALAVKATLALQNASTNNLHLSHLPR
        GHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALAVKATLALQNASTNNLHLSHLPR
Subjt:  GHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALAVKATLALQNASTNNLHLSHLPR

Query:  NKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSFKLRLSTIYEGTDL
        NKGRRHRISPLFDLDSPNDQQ    HEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSFKLRLSTIYEGTDL
Subjt:  NKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSFKLRLSTIYEGTDL

XP_008453039.1 PREDICTED: uncharacterized protein LOC103493864 [Cucumis melo]9.4e-10798.12Show/hide
Query:  MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR
        MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR
Subjt:  MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR

Query:  LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSF
        LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQ    HEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSF
Subjt:  LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSF

Query:  KLRLSTIYEGTDL
        KLRLSTIYEGTDL
Subjt:  KLRLSTIYEGTDL

XP_011654294.1 uncharacterized protein LOC101220453 [Cucumis sativus]7.5e-9692.02Show/hide
Query:  MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR
        MGGC SNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRL++DDFIPSLPLDHQLHPNQIYFILPSSNLHHR
Subjt:  MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR

Query:  LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSF
        LTAPDMAALAVKATLALQNASTNNL   HLP NKGRR RISPLFDLDSPNDQQ    +EHEHEHALS NSNSKNNT +SSSVKKLQRLTSRRAKMAVRSF
Subjt:  LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSF

Query:  KLRLSTIYEGTDL
        KLRLSTIYEGT L
Subjt:  KLRLSTIYEGTDL

XP_022981858.1 uncharacterized protein LOC111480876 [Cucurbita maxima]6.6e-6066.2Show/hide
Query:  MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR
        MG CLS+CL  PK SS    PPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSS SDSFLCNSDRLY+DDFIP LPLD QL PNQIYF+LPSSNLHHR
Subjt:  MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR

Query:  LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKN---NTASSSSVKKLQRLTSRRAKMAV
        L+A  MAALAVKA+LALQNAS         P ++ ++ R+SPL +L              + +H +S   + KN   +T++S SV+KLQRLTSRRAKMAV
Subjt:  LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKN---NTASSSSVKKLQRLTSRRAKMAV

Query:  RSFKLRLSTIYEGTDL
        RSFKL+LSTIYEG  L
Subjt:  RSFKLRLSTIYEGTDL

XP_038896630.1 uncharacterized protein LOC120084892 [Benincasa hispida]4.0e-6572.09Show/hide
Query:  MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR
        MG CLSNCLIIPK SS   PPPPPTAKVI+LQG LREYPVPISVSRVLQTE+SSSSTSDSFLCNSDRLY+DDFIP LPLDHQL PN+IYF+L SS LH R
Subjt:  MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR

Query:  LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINS--NSKNNTASSSSVKKLQRLTSRRAKMAVR
        LTA DMAALAVKATLALQN STN+     L RNKG   RISP+      +  ++  +   + EHA SINS  NS + T++SSSV++LQRLTSRRAKMAVR
Subjt:  LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINS--NSKNNTASSSSVKKLQRLTSRRAKMAVR

Query:  SFKLRLSTIYEGTDL
        SFKLRLSTIYEG  L
Subjt:  SFKLRLSTIYEGTDL

TrEMBL top hitse value%identityAlignment
A0A0A0L5Z9 Uncharacterized protein3.6e-9692.02Show/hide
Query:  MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR
        MGGC SNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRL++DDFIPSLPLDHQLHPNQIYFILPSSNLHHR
Subjt:  MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR

Query:  LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSF
        LTAPDMAALAVKATLALQNASTNNL   HLP NKGRR RISPLFDLDSPNDQQ    +EHEHEHALS NSNSKNNT +SSSVKKLQRLTSRRAKMAVRSF
Subjt:  LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSF

Query:  KLRLSTIYEGTDL
        KLRLSTIYEGT L
Subjt:  KLRLSTIYEGTDL

A0A1S3BUP5 uncharacterized protein LOC1034938644.5e-10798.12Show/hide
Query:  MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR
        MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR
Subjt:  MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR

Query:  LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSF
        LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQ    HEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSF
Subjt:  LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSF

Query:  KLRLSTIYEGTDL
        KLRLSTIYEGTDL
Subjt:  KLRLSTIYEGTDL

A0A5D3D8Q9 DUF4228 domain-containing protein3.6e-8897.79Show/hide
Query:  GHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALAVKATLALQNASTNNLHLSHLPR
        GHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALAVKATLALQNASTNNLHLSHLPR
Subjt:  GHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALAVKATLALQNASTNNLHLSHLPR

Query:  NKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSFKLRLSTIYEGTDL
        NKGRRHRISPLFDLDSPNDQQ    HEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSFKLRLSTIYEGTDL
Subjt:  NKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSFKLRLSTIYEGTDL

A0A6J1FIQ7 uncharacterized protein LOC1114461017.1e-6065.74Show/hide
Query:  MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR
        MG CLS+CL  PK SS    PPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSS SDSFLCNSDRLY+DDFIP LPLD QL PNQIYF+LPSSNLHHR
Subjt:  MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR

Query:  LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKN---NTASSSSVKKLQRLTSRRAKMAV
        L+A  MAALAVKA+LALQNAS         P ++ ++ R+SPL +L              + +H +S   + KN   +T++S SV+KLQRLTS+RAKMAV
Subjt:  LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKN---NTASSSSVKKLQRLTSRRAKMAV

Query:  RSFKLRLSTIYEGTDL
        RSFKL+LSTIYEG  L
Subjt:  RSFKLRLSTIYEGTDL

A0A6J1J381 uncharacterized protein LOC1114808763.2e-6066.2Show/hide
Query:  MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR
        MG CLS+CL  PK SS    PPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSS SDSFLCNSDRLY+DDFIP LPLD QL PNQIYF+LPSSNLHHR
Subjt:  MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHR

Query:  LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKN---NTASSSSVKKLQRLTSRRAKMAV
        L+A  MAALAVKA+LALQNAS         P ++ ++ R+SPL +L              + +H +S   + KN   +T++S SV+KLQRLTSRRAKMAV
Subjt:  LTAPDMAALAVKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKN---NTASSSSVKKLQRLTSRRAKMAV

Query:  RSFKLRLSTIYEGTDL
        RSFKL+LSTIYEG  L
Subjt:  RSFKLRLSTIYEGTDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21010.1 unknown protein4.4e-3043.5Show/hide
Query:  PTAKVISLQGHLREYPVPISVSRVLQTE-------NSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALAVKATLA
        PT K++++ G LREY VP+  S+VL+ E       +SSS  S  F+C+SD LY+DDFIP++  +  L  +QIYF+LP S    RLTA DMAALAVKA++A
Subjt:  PTAKVISLQGHLREYPVPISVSRVLQTE-------NSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALAVKATLA

Query:  LQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKNNTAS-----SSSVKKLQRLTSRRAKMAVRSFKLRLSTIYEGT
        +QN+             + ++ RISP+  L   ND  + +  E   +      S +    AS     S SV+ L+R TS+RAK+AVRSF+L+LSTIYEG+
Subjt:  LQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKNNTAS-----SSSVKKLQRLTSRRAKMAVRSFKLRLSTIYEGT

AT1G76600.1 unknown protein5.6e-3345.59Show/hide
Query:  TAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDS----FLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALAVKATLALQNA
        TAK++++ G LREY VP+  S+VL++E++SSS+S S    FLCNSD LY+DDFIP++  D  L  NQIYF+LP S   +RL+A DMAALAVKA++A++ A
Subjt:  TAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDS----FLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALAVKATLALQNA

Query:  STNNLHLSHLPRNKGRRH-RISPLFDLDSPNDQQHEHEHEHEHEHALSINS------------NSKNNTASSSSVKKLQRLTSRRAKMAVRSFKLRLSTI
        +          +N+ RR  RISP+  L+  ND +    +      A ++                 N  + S SV+KL+R TS RAK+AVRSF+LRLSTI
Subjt:  STNNLHLSHLPRNKGRRH-RISPLFDLDSPNDQQHEHEHEHEHEHALSINS------------NSKNNTASSSSVKKLQRLTSRRAKMAVRSFKLRLSTI

Query:  YEGT
        YEG+
Subjt:  YEGT

AT2G23690.1 unknown protein5.5e-1239.25Show/hide
Query:  VSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALAVKA
        + SS       TAK+I   G + E+  P+ V  VLQ           F+CNSD + FD+ + ++  D +    Q+YF LP S+LHH L A +MAALAVKA
Subjt:  VSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALAVKA

Query:  TLALQNA
        + AL  +
Subjt:  TLALQNA

AT3G50800.1 unknown protein2.9e-1341.58Show/hide
Query:  TAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALAVKATLALQNASTNN
        TAK+I   G L+E+  P+ V ++LQ          SF+CNSD + FDD + ++P    L P ++YF+LP + L+H L A +MAALAVKA+ AL  +    
Subjt:  TAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALAVKATLALQNASTNN

Query:  L
        L
Subjt:  L

AT5G66580.1 unknown protein1.1e-1243.01Show/hide
Query:  TAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALAVKATLAL
        +AK+I L G L+E+  P+ V ++LQ          SF+CNSD + FDD + ++  + +L   Q+YF+LP + L+H L A +MAALAVKA+ AL
Subjt:  TAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALAVKATLAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGGGTGTTTGTCAAACTGCCTAATTATTCCAAAAGTCTCTTCGTCTGTTCCTCCACCTCCTCCTCCTACCGCCAAAGTTATCTCTTTACAAGGACATCTCCGGGA
ATACCCTGTTCCTATATCCGTCTCCCGTGTTCTTCAGACTGAAAATTCATCTTCTTCCACTTCCGACTCTTTTCTATGCAACTCCGACCGCTTATACTTTGATGATTTCA
TTCCGTCTTTGCCTCTCGACCACCAGCTCCACCCTAATCAGATCTATTTCATCCTTCCTTCCTCCAACCTCCACCACCGATTGACCGCCCCAGATATGGCTGCCTTAGCC
GTCAAAGCCACCCTCGCCCTCCAGAATGCCTCCACCAACAACCTCCATCTCTCCCATCTCCCTCGTAATAAGGGTCGTCGTCATCGTATTTCTCCCCTCTTTGATCTTGA
TAGCCCCAACGACCAACAACACGAACACGAACACGAACACGAACACGAACACGCCCTCTCCATTAACTCCAACTCCAAGAACAACACCGCCTCCTCCTCCTCCGTTAAAA
AATTGCAGAGATTGACATCCAGAAGGGCAAAAATGGCAGTTCGTTCCTTCAAACTCAGATTGAGCACCATCTATGAAGGCACCGATCTGTAG
mRNA sequenceShow/hide mRNA sequence
CTGACCGCAAGATCTCCTCCTTCCAGTTCCATTATCCCTCCTCATTCTTTCCCCAAATTAATTAAATTTGCCTGAAAAACGCGGATCGAGATACTCGGTCCCGGTCGTGA
TGGGCGGGTGTTTGTCAAACTGCCTAATTATTCCAAAAGTCTCTTCGTCTGTTCCTCCACCTCCTCCTCCTACCGCCAAAGTTATCTCTTTACAAGGACATCTCCGGGAA
TACCCTGTTCCTATATCCGTCTCCCGTGTTCTTCAGACTGAAAATTCATCTTCTTCCACTTCCGACTCTTTTCTATGCAACTCCGACCGCTTATACTTTGATGATTTCAT
TCCGTCTTTGCCTCTCGACCACCAGCTCCACCCTAATCAGATCTATTTCATCCTTCCTTCCTCCAACCTCCACCACCGATTGACCGCCCCAGATATGGCTGCCTTAGCCG
TCAAAGCCACCCTCGCCCTCCAGAATGCCTCCACCAACAACCTCCATCTCTCCCATCTCCCTCGTAATAAGGGTCGTCGTCATCGTATTTCTCCCCTCTTTGATCTTGAT
AGCCCCAACGACCAACAACACGAACACGAACACGAACACGAACACGAACACGCCCTCTCCATTAACTCCAACTCCAAGAACAACACCGCCTCCTCCTCCTCCGTTAAAAA
ATTGCAGAGATTGACATCCAGAAGGGCAAAAATGGCAGTTCGTTCCTTCAAACTCAGATTGAGCACCATCTATGAAGGCACCGATCTGTAGTCGTAGGGAATAGGGATTA
CTTCATCCACCGGTTTGCACGAGGACGGTTTCGGTAGGAGGGGTTTAAATTATCTTAACTCTTCCCAGTTCAATTCATGATCCATCTGATTCCAACTTCCCATTGATGAC
ATTGGTAACTGCTTTGTGCGTTCGGATATACACATAACATAAGTAGGGGCGCCGGATCTATTTAGGGTTCGAGGCACGTCCGTTCTACATTTGTATATATTACTGTTTAT
CATATACATCCATCAAATAACATACATGGAATGAACGGTTGAAGTTCTAGTCATGCCATTAAATTCAAAATACAATGTGAATGTGGCGAAGAATAATTCAGAATTTGAAG
TGGCGCAGCCAAAACTGACGGACCCCGAAGACCATGGCATCTGCCACTTTGTTTACCAGAACATACAATATCTCTTCTTTCTTCTCTTTTCTGTTCCGTTTGGTTCAACT
CGTTTCCAATTCCTAGTTGCATCTGTGTAATTTTTCAGATAATTACTACAGATTATTTATTTATTTATTATTTTTAATTAACACGGAAATGCAAATAAACTACTGTCCTT
ATAATTTTAATTATAGCTATCGATGAGTATATCATTTAATCAGCTTCAAAAACATGCTAAGAGGTTCCATTTGTTTATCCTTCTGTAATTCTCTATATTTTGTTGGTTTT
CAAAATTGAATTACAATTGTAAAACGTTCACACGCTTCTTCTGCGAGAGGAATTTCATTCCTTTAAATATGGCATATCTATGATAT
Protein sequenceShow/hide protein sequence
MGGCLSNCLIIPKVSSSVPPPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSTSDSFLCNSDRLYFDDFIPSLPLDHQLHPNQIYFILPSSNLHHRLTAPDMAALA
VKATLALQNASTNNLHLSHLPRNKGRRHRISPLFDLDSPNDQQHEHEHEHEHEHALSINSNSKNNTASSSSVKKLQRLTSRRAKMAVRSFKLRLSTIYEGTDL