; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy5G013530 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy5G013530
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptionhydroxyproline-rich glycoprotein family protein
Genome locationGy14Chr5:17527168..17527893
RNA-Seq ExpressionCsGy5G013530
SyntenyCsGy5G013530
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648356.1 hypothetical protein Csa_023118, partial [Cucumis sativus]9.15e-111100Show/hide
Query:  MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY
        MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY
Subjt:  MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY

Query:  LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPN
        LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPN
Subjt:  LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPN

KAE8650389.1 hypothetical protein Csa_011234 [Cucumis sativus]9.62e-7556.85Show/hide
Query:  PSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRF
        PS+Q    E++   +P   T N+  +   R +R R +AD +RIEPPYPW+T++ A++H+L+YLQ+NNI  IKGEV CK+C+ K EIEYDLM+K  E+ +F
Subjt:  PSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRF

Query:  FEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPS
         E E  +MHDRAPNCWT P L NC FCN+EKCV P+I   +D+KINWLFL LG FLG L+L QLKHFC Q+ IHRTGAK+RL+Y +Y  L  QLQP+
Subjt:  FEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPS

XP_008463189.1 PREDICTED: uncharacterized protein LOC103501397 [Cucumis melo]2.18e-12081.08Show/hide
Query:  LDLSLRPPSAQPPPAPEYPQL-SSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIKGE
        LDLSLR PS +PP  P+Y Q  +STLPS QI NEES A  +PN ETSN+QQQ RRR   RRTRADMTRIEPPYPW+TD+RAVVHELKYLQ NNIM IKGE
Subjt:  LDLSLRPPSAQPPPAPEYPQL-SSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIKGE

Query:  VICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSNIH
        VICKKCEMKYE+EYDLMNKVNEITRFFEEEIDSMHDRAP+CWT PNLPNC+ CNEEKCVMPV S+ED +KINWLFLFLGQFLGCL+L+QLK+FC Q+NIH
Subjt:  VICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSNIH

Query:  RTGAKNRLLYLSYRALFHQLQP
        RTGAKNRLLYLSYR LF QLQP
Subjt:  RTGAKNRLLYLSYRALFHQLQP

XP_011656206.1 uncharacterized protein LOC105435666 [Cucumis sativus]1.97e-176100Show/hide
Query:  MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY
        MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY
Subjt:  MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY

Query:  LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLK
        LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLK
Subjt:  LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLK

Query:  QLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPSPTLNIN
        QLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPSPTLNIN
Subjt:  QLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPSPTLNIN

XP_038895979.1 junction-mediating and -regulatory protein-like [Benincasa hispida]2.11e-10265.87Show/hide
Query:  LDLSLR----PPSAQPPPAP---------------EYPQLSST--LPSSQIPNEESNATPQPNIETSNDQQQHRR------RLRRRRTRADMTRIEPPYP
        L+LSLR    PP   PPP P               EYP LS+T  L   + PNE      Q N ETSN QQQ         R RRRRTRADMTRIEPPYP
Subjt:  LDLSLR----PPSAQPPPAP---------------EYPQLSST--LPSSQIPNEESNATPQPNIETSNDQQQHRR------RLRRRRTRADMTRIEPPYP

Query:  WATDKRAVVHELKYLQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWL
        W+TD+RAV+HELKYLQSNNI+ IKGEV CKKCE KYE+EYDLMNK NEI RF E E DSMHDRAP CWTKP LPNCN CN+E+CV PVIS+ED +KINWL
Subjt:  WATDKRAVVHELKYLQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWL

Query:  FLFLGQFLGCLRLKQLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPSPTL
        FL LG+FLGCL+LKQLK+FCAQ+NIHRTGAKNRLLYL Y  L +QLQPS  L
Subjt:  FLFLGQFLGCLRLKQLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPSPTL

TrEMBL top hitse value%identityAlignment
A0A0A0KMQ2 Uncharacterized protein9.52e-177100Show/hide
Query:  MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY
        MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY
Subjt:  MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY

Query:  LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLK
        LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLK
Subjt:  LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLK

Query:  QLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPSPTLNIN
        QLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPSPTLNIN
Subjt:  QLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPSPTLNIN

A0A0A0LAK2 Uncharacterized protein9.94e-7549.6Show/hide
Query:  LDLSLRPPSAQPPPAPEYPQLSSTLPSSQ--------IPNEESNATP--------------------QPNIETSNDQQQHRRRLRRRRTRADMTRIEPPY
        L LSL PP   PPP    P   S LPS+         +P++  +  P                    +P   T N+  +   R +R R +AD +RIEPPY
Subjt:  LDLSLRPPSAQPPPAPEYPQLSSTLPSSQ--------IPNEESNATP--------------------QPNIETSNDQQQHRRRLRRRRTRADMTRIEPPY

Query:  PWATDKRAVVHELKYLQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINW
        PW+T++ A++H+L+YLQ+NNI  IKGEV CK+C+ K EIEYDLM+K  E+ +F E E  +MHDRAPNCWT P L NC FCN+EKCV P+I   +D+KINW
Subjt:  PWATDKRAVVHELKYLQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINW

Query:  LFLFLGQFLGCLRLKQLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPS
        LFL LG FLG L+L QLKHFC Q+ IHRTGAK+RL+Y +Y  L  QLQP+
Subjt:  LFLFLGQFLGCLRLKQLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPS

A0A1S3AZB1 protein PAF1 homolog5.19e-7461.96Show/hide
Query:  PQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAP
        P+P  +T N Q     + +RRRT+AD +RIEPPYPW+T+K AV+H+L+YL++NNI+ IKGEV CK+C+ K EIEY+L++K +EI RF E E D+MHDRAP
Subjt:  PQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAP

Query:  NCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPS
        + W  P L NCNFCN+E+CV P+IS E +S INWLFL LG FLGCL+L QLK+FC Q+NIHRTGAK+RL+YL+Y AL  QLQP+
Subjt:  NCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPS

A0A1S3CK70 uncharacterized protein LOC1035013971.05e-12081.08Show/hide
Query:  LDLSLRPPSAQPPPAPEYPQL-SSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIKGE
        LDLSLR PS +PP  P+Y Q  +STLPS QI NEES A  +PN ETSN+QQQ RRR   RRTRADMTRIEPPYPW+TD+RAVVHELKYLQ NNIM IKGE
Subjt:  LDLSLRPPSAQPPPAPEYPQL-SSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIKGE

Query:  VICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSNIH
        VICKKCEMKYE+EYDLMNKVNEITRFFEEEIDSMHDRAP+CWT PNLPNC+ CNEEKCVMPV S+ED +KINWLFLFLGQFLGCL+L+QLK+FC Q+NIH
Subjt:  VICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSNIH

Query:  RTGAKNRLLYLSYRALFHQLQP
        RTGAKNRLLYLSYR LF QLQP
Subjt:  RTGAKNRLLYLSYRALFHQLQP

A0A6J1GM83 mucin-16-like4.93e-7056.14Show/hide
Query:  SLRPPSAQPPPAPEYPQLSSTLPSS--QIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIKGEVI
        +L  P A P    E P  S+T+  +  Q PN+ S   PQ     +N     R R RR RTRAD  RIEPPYPW+ ++RA +H L+YLQSNNI+ IKG+V 
Subjt:  SLRPPSAQPPPAPEYPQLSSTLPSS--QIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIKGEVI

Query:  CKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDD----SKINWLFLFLGQFLGCLRLKQLKHFCAQSN
        CKKCE  YEIEY+LMNK +EI RF E E D+MHDRAP CW  P LPNC  C EE CV P+I  E+D    S+INWLFL LGQ +G L+LKQLK+FCA + 
Subjt:  CKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDD----SKINWLFLFLGQFLGCLRLKQLKHFCAQSN

Query:  IHRTGAKNRLLYLSYRALFHQLQPSPTL
         HRTGAK+RL++L+Y AL  QLQPS  L
Subjt:  IHRTGAKNRLLYLSYRALFHQLQPSPTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein2.5e-3741.38Show/hide
Query:  PPSAQPP--------PAPEYPQL-SSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTR----IEPPYPWATDKRAVVHELKYLQSNNI
        PPS Q P          P  PQL +   P S +    SN TP P       ++     +R  R+R+ +++    I PP+PWAT++R  +  L+YL+SN I
Subjt:  PPSAQPP--------PAPEYPQL-SSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTR----IEPPYPWATDKRAVVHELKYLQSNNI

Query:  MKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFC
          I GEV C+ CE  Y++ Y+L  +  E+ +F+  E   M DRA   W  P    C  C  EK V PVI+ E  S+INWLFL LGQ LG   L+QLK+FC
Subjt:  MKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFC

Query:  AQSNIHRTGAKNRLLYLSYRALFHQLQPSPTL
          S  HRTGAK+R+LYL+Y  L   LQP   L
Subjt:  AQSNIHRTGAKNRLLYLSYRALFHQLQPSPTL

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)3.0e-3035.93Show/hide
Query:  LDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRR----RRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKI
        L+ ++ PP+        Y      LP  Q+    + A   P        Q  R   R      R   D   I PPYPWAT K   +   + L SNNI  I
Subjt:  LDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRR----RRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKI

Query:  KGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQS
         G+V CK C+    +EY+L  K +E+  + +   + M  RAP  W+ P L  C  C  E  + PV+S E   +INWLFL LGQ LGC  L QL++FC  +
Subjt:  KGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQS

Query:  NIHRTGAKNRLLYLSYRALFHQLQPSPTLNI
        + HRTG+K+R++Y++Y +L  QL P    N+
Subjt:  NIHRTGAKNRLLYLSYRALFHQLQPSPTLNI

AT2G16190.2 FUNCTIONS IN: molecular_function unknown2.4e-1935.23Show/hide
Query:  LDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRR----RRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKI
        L+ ++ PP+        Y      LP  Q+    + A   P        Q  R   R      R   D   I PPYPWAT K   +   + L SNNI  I
Subjt:  LDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRR----RRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKI

Query:  KGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQL
         G+V CK C+    +EY+L  K +E+  + +   + M  RAP  W+ P L  C  C  E  + PV+S E   +INWLFL LGQ LGC  L QL
Subjt:  KGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGCCGAAGATGGCAATAACGAAAGGAGCCCCCGTCTCGATCTTTCTCTCCGTCCGCCGTCTGCTCAACCTCCTCCAGCACCTGAATATCCCCAGCTGTCATCCAC
ATTGCCCTCGTCGCAAATTCCAAACGAAGAATCCAACGCAACCCCTCAACCCAACATTGAAACTTCAAATGACCAACAACAACACCGACGGAGACTGAGACGACGTAGAA
CGAGAGCAGACATGACAAGGATTGAGCCACCGTATCCATGGGCGACTGACAAACGAGCGGTAGTCCACGAACTCAAGTACCTTCAATCGAACAACATAATGAAAATCAAG
GGGGAAGTGATATGCAAAAAATGCGAGATGAAGTATGAAATTGAGTATGATCTAATGAATAAGGTTAATGAAATAACAAGATTCTTTGAAGAAGAAATAGATAGTATGCA
TGATAGAGCTCCAAATTGTTGGACAAAACCCAATTTACCAAATTGTAATTTCTGCAATGAAGAAAAATGTGTAATGCCAGTGATATCTAAAGAAGATGATTCAAAGATCA
ATTGGTTGTTCTTGTTCTTGGGGCAATTTCTTGGATGTTTGAGGCTCAAGCAACTCAAACATTTTTGTGCTCAATCAAATATTCATAGAACTGGGGCCAAGAATCGTCTT
CTTTATCTCAGTTATCGTGCTTTGTTTCATCAACTCCAACCCTCCCCAACACTCAACATTAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACGCCGAAGATGGCAATAACGAAAGGAGCCCCCGTCTCGATCTTTCTCTCCGTCCGCCGTCTGCTCAACCTCCTCCAGCACCTGAATATCCCCAGCTGTCATCCAC
ATTGCCCTCGTCGCAAATTCCAAACGAAGAATCCAACGCAACCCCTCAACCCAACATTGAAACTTCAAATGACCAACAACAACACCGACGGAGACTGAGACGACGTAGAA
CGAGAGCAGACATGACAAGGATTGAGCCACCGTATCCATGGGCGACTGACAAACGAGCGGTAGTCCACGAACTCAAGTACCTTCAATCGAACAACATAATGAAAATCAAG
GGGGAAGTGATATGCAAAAAATGCGAGATGAAGTATGAAATTGAGTATGATCTAATGAATAAGGTTAATGAAATAACAAGATTCTTTGAAGAAGAAATAGATAGTATGCA
TGATAGAGCTCCAAATTGTTGGACAAAACCCAATTTACCAAATTGTAATTTCTGCAATGAAGAAAAATGTGTAATGCCAGTGATATCTAAAGAAGATGATTCAAAGATCA
ATTGGTTGTTCTTGTTCTTGGGGCAATTTCTTGGATGTTTGAGGCTCAAGCAACTCAAACATTTTTGTGCTCAATCAAATATTCATAGAACTGGGGCCAAGAATCGTCTT
CTTTATCTCAGTTATCGTGCTTTGTTTCATCAACTCCAACCCTCCCCAACACTCAACATTAATTGA
Protein sequenceShow/hide protein sequence
MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIK
GEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSNIHRTGAKNRL
LYLSYRALFHQLQPSPTLNIN