; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G13590 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G13590
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionhydroxyproline-rich glycoprotein family protein
Genome locationChr5:13395411..13396136
RNA-Seq ExpressionCSPI05G13590
SyntenyCSPI05G13590
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648356.1 hypothetical protein Csa_023118, partial [Cucumis sativus]2.0e-84100Show/hide
Query:  MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY
        MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY
Subjt:  MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY

Query:  LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPN
        LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPN
Subjt:  LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPN

XP_008463189.1 PREDICTED: uncharacterized protein LOC103501397 [Cucumis melo]6.7e-9381.08Show/hide
Query:  LDLSLRPPSAQPPPAPEYPQ-LSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIKGE
        LDLSLR PS +PP  P+Y Q  +STLP SQI NEES A  +PN ETSN+QQQ R   RRRRTRADMTRIEPPYPW+TD+RAVVHELKYLQ NNIM IKGE
Subjt:  LDLSLRPPSAQPPPAPEYPQ-LSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIKGE

Query:  VICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSNIH
        VICKKCEMKYE+EYDLMNKVNEITRFFEEEIDSMHDRAP+CWT PNLPNC+ CNEEKCVMPV S+E D+KINWLFLFLGQFLGCL+L+QLK+FC Q+NIH
Subjt:  VICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSNIH

Query:  RTGAKNRLLYLSYRALFHQLQP
        RTGAKNRLLYLSYR LF QLQP
Subjt:  RTGAKNRLLYLSYRALFHQLQP

XP_011652763.1 uncharacterized protein LOC105435088 [Cucumis sativus]1.1e-5850.4Show/hide
Query:  LDLSLRPPSAQPPPAPEYPQLSSTLPS------------SQIPN----------------EESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPY
        L LSL PP   PPP    P   S LPS            SQ P+                 E++   +P   T N+  +   R +R R +AD +RIEPPY
Subjt:  LDLSLRPPSAQPPPAPEYPQLSSTLPS------------SQIPN----------------EESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPY

Query:  PWATDKRAVVHELKYLQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINW
        PW+T++ A++H+L+YLQ+NNI  IKGEV CK+C+ K EIEYDLM+K  E+ +F E E  +MHDRAPNCWT P L NC FCN+EKCV P+I   +D+KINW
Subjt:  PWATDKRAVVHELKYLQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINW

Query:  LFLFLGQFLGCLRLKQLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPS
        LFL LG FLG L+L QLKHFC Q+ IHRTGAK+RL+Y +Y  L  QLQP+
Subjt:  LFLFLGQFLGCLRLKQLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPS

XP_011656206.1 uncharacterized protein LOC105435666 [Cucumis sativus]1.1e-135100Show/hide
Query:  MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY
        MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY
Subjt:  MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY

Query:  LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLK
        LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLK
Subjt:  LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLK

Query:  QLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPSPTLNIN
        QLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPSPTLNIN
Subjt:  QLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPSPTLNIN

XP_038895979.1 junction-mediating and -regulatory protein-like [Benincasa hispida]8.5e-8070.18Show/hide
Query:  PPSAQPPPAP-EYPQLSST--LPSSQIPNEESNATPQPNIETSNDQQQ------HRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIK
        P  + PPP+P EYP LS+T  L   + PNE      Q N ETSN QQQ       + R RRRRTRADMTRIEPPYPW+TD+RAV+HELKYLQSNNI+ IK
Subjt:  PPSAQPPPAP-EYPQLSST--LPSSQIPNEESNATPQPNIETSNDQQQ------HRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIK

Query:  GEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSN
        GEV CKKCE KYE+EYDLMNK NEI RF E E DSMHDRAP CWTKP LPNCN CN+E+CV PVIS+ED +KINWLFL LG+FLGCL+LKQLK+FCAQ+N
Subjt:  GEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSN

Query:  IHRTGAKNRLLYLSYRALFHQLQPSPTL
        IHRTGAKNRLLYL Y  L +QLQPS  L
Subjt:  IHRTGAKNRLLYLSYRALFHQLQPSPTL

TrEMBL top hitse value%identityAlignment
A0A0A0KMQ2 Uncharacterized protein5.3e-136100Show/hide
Query:  MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY
        MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY
Subjt:  MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKY

Query:  LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLK
        LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLK
Subjt:  LQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLK

Query:  QLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPSPTLNIN
        QLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPSPTLNIN
Subjt:  QLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPSPTLNIN

A0A0A0LAK2 Uncharacterized protein5.2e-5950.4Show/hide
Query:  LDLSLRPPSAQPPPAPEYPQLSSTLPS------------SQIPN----------------EESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPY
        L LSL PP   PPP    P   S LPS            SQ P+                 E++   +P   T N+  +   R +R R +AD +RIEPPY
Subjt:  LDLSLRPPSAQPPPAPEYPQLSSTLPS------------SQIPN----------------EESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPY

Query:  PWATDKRAVVHELKYLQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINW
        PW+T++ A++H+L+YLQ+NNI  IKGEV CK+C+ K EIEYDLM+K  E+ +F E E  +MHDRAPNCWT P L NC FCN+EKCV P+I   +D+KINW
Subjt:  PWATDKRAVVHELKYLQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINW

Query:  LFLFLGQFLGCLRLKQLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPS
        LFL LG FLG L+L QLKHFC Q+ IHRTGAK+RL+Y +Y  L  QLQP+
Subjt:  LFLFLGQFLGCLRLKQLKHFCAQSNIHRTGAKNRLLYLSYRALFHQLQPS

A0A1S3AZB1 protein PAF1 homolog2.6e-5856.19Show/hide
Query:  SPRLDLSLRPPSAQPPPAP-EYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKI
        SP    S  P   Q PP P +Y Q   T  +   P E     P+P  +T N Q     + +RRRT+AD +RIEPPYPW+T+K AV+H+L+YL++NNI+ I
Subjt:  SPRLDLSLRPPSAQPPPAP-EYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKI

Query:  KGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQS
        KGEV CK+C+ K EIEY+L++K +EI RF E E D+MHDRAP+ W  P L NCNFCN+E+CV P+IS E +S INWLFL LG FLGCL+L QLK+FC Q+
Subjt:  KGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQS

Query:  NIHRTGAKNRLLYLSYRALFHQLQPS
        NIHRTGAK+RL+YL+Y AL  QLQP+
Subjt:  NIHRTGAKNRLLYLSYRALFHQLQPS

A0A1S3CK70 uncharacterized protein LOC1035013973.3e-9381.08Show/hide
Query:  LDLSLRPPSAQPPPAPEYPQ-LSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIKGE
        LDLSLR PS +PP  P+Y Q  +STLP SQI NEES A  +PN ETSN+QQQ R   RRRRTRADMTRIEPPYPW+TD+RAVVHELKYLQ NNIM IKGE
Subjt:  LDLSLRPPSAQPPPAPEYPQ-LSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIKGE

Query:  VICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSNIH
        VICKKCEMKYE+EYDLMNKVNEITRFFEEEIDSMHDRAP+CWT PNLPNC+ CNEEKCVMPV S+E D+KINWLFLFLGQFLGCL+L+QLK+FC Q+NIH
Subjt:  VICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSNIH

Query:  RTGAKNRLLYLSYRALFHQLQP
        RTGAKNRLLYLSYR LF QLQP
Subjt:  RTGAKNRLLYLSYRALFHQLQP

A0A6J1GM83 mucin-16-like3.4e-5856.14Show/hide
Query:  SLRPPSAQPPPAPEYPQLSSTLPSS--QIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIKGEVI
        +L  P A P    E P  S+T+  +  Q PN +S   PQ     +N     R R RR RTRAD  RIEPPYPW+ ++RA +H L+YLQSNNI+ IKG+V 
Subjt:  SLRPPSAQPPPAPEYPQLSSTLPSS--QIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIKGEVI

Query:  CKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDD----SKINWLFLFLGQFLGCLRLKQLKHFCAQSN
        CKKCE  YEIEY+LMNK +EI RF E E D+MHDRAP CW  P LPNC  C EE CV P+I  E+D    S+INWLFL LGQ +G L+LKQLK+FCA + 
Subjt:  CKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDD----SKINWLFLFLGQFLGCLRLKQLKHFCAQSN

Query:  IHRTGAKNRLLYLSYRALFHQLQPSPTL
         HRTGAK+RL++L+Y AL  QLQPS  L
Subjt:  IHRTGAKNRLLYLSYRALFHQLQPSPTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein2.5e-3741.38Show/hide
Query:  PPSAQPP--------PAPEYPQL-SSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTR----IEPPYPWATDKRAVVHELKYLQSNNI
        PPS Q P          P  PQL +   P S +    SN TP P       ++     +R  R+R+ +++    I PP+PWAT++R  +  L+YL+SN I
Subjt:  PPSAQPP--------PAPEYPQL-SSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTR----IEPPYPWATDKRAVVHELKYLQSNNI

Query:  MKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFC
          I GEV C+ CE  Y++ Y+L  +  E+ +F+  E   M DRA   W  P    C  C  EK V PVI+ E  S+INWLFL LGQ LG   L+QLK+FC
Subjt:  MKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFC

Query:  AQSNIHRTGAKNRLLYLSYRALFHQLQPSPTL
          S  HRTGAK+R+LYL+Y  L   LQP   L
Subjt:  AQSNIHRTGAKNRLLYLSYRALFHQLQPSPTL

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)3.0e-3035.93Show/hide
Query:  LDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRR----RRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKI
        L+ ++ PP+        Y      LP  Q+    + A   P        Q  R   R      R   D   I PPYPWAT K   +   + L SNNI  I
Subjt:  LDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRR----RRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKI

Query:  KGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQS
         G+V CK C+    +EY+L  K +E+  + +   + M  RAP  W+ P L  C  C  E  + PV+S E   +INWLFL LGQ LGC  L QL++FC  +
Subjt:  KGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQS

Query:  NIHRTGAKNRLLYLSYRALFHQLQPSPTLNI
        + HRTG+K+R++Y++Y +L  QL P    N+
Subjt:  NIHRTGAKNRLLYLSYRALFHQLQPSPTLNI

AT2G16190.2 FUNCTIONS IN: molecular_function unknown2.4e-1935.23Show/hide
Query:  LDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRR----RRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKI
        L+ ++ PP+        Y      LP  Q+    + A   P        Q  R   R      R   D   I PPYPWAT K   +   + L SNNI  I
Subjt:  LDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRR----RRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKI

Query:  KGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQL
         G+V CK C+    +EY+L  K +E+  + +   + M  RAP  W+ P L  C  C  E  + PV+S E   +INWLFL LGQ LGC  L QL
Subjt:  KGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGCCGAAGATGGCAATAACGAAAGGAGCCCCCGTCTCGATCTTTCTCTCCGTCCGCCGTCTGCTCAACCTCCTCCAGCACCTGAATATCCCCAGCTGTCATCCAC
ATTGCCCTCGTCGCAAATTCCAAACGAAGAATCCAACGCAACCCCTCAACCCAACATTGAAACTTCAAATGACCAACAACAACACCGACGGAGACTGAGACGACGTAGAA
CGAGAGCAGACATGACAAGGATTGAGCCACCGTATCCATGGGCGACTGACAAGCGAGCGGTAGTCCACGAACTCAAGTACCTTCAATCGAACAACATAATGAAAATCAAG
GGGGAAGTGATATGCAAAAAATGCGAGATGAAGTATGAAATTGAGTATGATCTAATGAATAAGGTTAATGAAATAACAAGATTCTTTGAAGAAGAAATAGATAGTATGCA
TGATAGAGCTCCAAATTGTTGGACAAAACCCAATTTACCAAATTGTAATTTCTGCAATGAAGAAAAATGTGTAATGCCAGTGATATCTAAAGAAGATGATTCAAAGATCA
ATTGGTTGTTCTTGTTCTTGGGGCAATTTCTTGGATGTTTGAGGCTCAAGCAACTCAAACATTTTTGTGCTCAATCAAATATTCATAGAACTGGGGCCAAGAATCGTCTT
CTTTATCTCAGTTATCGTGCTTTGTTTCATCAACTCCAACCCTCCCCAACACTCAACATTAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACGCCGAAGATGGCAATAACGAAAGGAGCCCCCGTCTCGATCTTTCTCTCCGTCCGCCGTCTGCTCAACCTCCTCCAGCACCTGAATATCCCCAGCTGTCATCCAC
ATTGCCCTCGTCGCAAATTCCAAACGAAGAATCCAACGCAACCCCTCAACCCAACATTGAAACTTCAAATGACCAACAACAACACCGACGGAGACTGAGACGACGTAGAA
CGAGAGCAGACATGACAAGGATTGAGCCACCGTATCCATGGGCGACTGACAAGCGAGCGGTAGTCCACGAACTCAAGTACCTTCAATCGAACAACATAATGAAAATCAAG
GGGGAAGTGATATGCAAAAAATGCGAGATGAAGTATGAAATTGAGTATGATCTAATGAATAAGGTTAATGAAATAACAAGATTCTTTGAAGAAGAAATAGATAGTATGCA
TGATAGAGCTCCAAATTGTTGGACAAAACCCAATTTACCAAATTGTAATTTCTGCAATGAAGAAAAATGTGTAATGCCAGTGATATCTAAAGAAGATGATTCAAAGATCA
ATTGGTTGTTCTTGTTCTTGGGGCAATTTCTTGGATGTTTGAGGCTCAAGCAACTCAAACATTTTTGTGCTCAATCAAATATTCATAGAACTGGGGCCAAGAATCGTCTT
CTTTATCTCAGTTATCGTGCTTTGTTTCATCAACTCCAACCCTCCCCAACACTCAACATTAATTGA
Protein sequenceShow/hide protein sequence
MNAEDGNNERSPRLDLSLRPPSAQPPPAPEYPQLSSTLPSSQIPNEESNATPQPNIETSNDQQQHRRRLRRRRTRADMTRIEPPYPWATDKRAVVHELKYLQSNNIMKIK
GEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRAPNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSNIHRTGAKNRL
LYLSYRALFHQLQPSPTLNIN