; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G003300 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G003300
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationCmo_Chr10:1489137..1492985
RNA-Seq ExpressionCmoCh10G003300
SyntenyCmoCh10G003300
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589615.1 hypothetical protein SDJN03_15038, partial [Cucurbita argyrosperma subsp. sororia]3.3e-23099.5Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ
        MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSV AQYFSKTTMRGWRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ

Query:  PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA
        PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA
Subjt:  PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA

Query:  CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL
        CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL
Subjt:  CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR
        DACFLTYHSLSTPIRGNGHGQAPAMIYPND DGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR
Subjt:  DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR

XP_022134722.1 uncharacterized protein LOC111006925 [Momordica charantia]9.7e-19885.79Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSE----VVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCD
        M GTALQFGGIKGEDRFYIPV+ARK+YNQQKPSRRP K+DETE+PSS+    VVASTT PSKPLTPQ KSNLERFLDAT PSVPAQYFSKTTMRGWRTCD
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSE----VVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES A+RSDSK RLA EDSDLDSS+DTSS+GSI+YE GK+  +SREQWVH
Subjt:  IEFQPYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVH

Query:  HHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPT
          L CE+ + MR  S+RDEH   QEGFSSDDGDA  PRS LLFQF EQDLPYQRVPLADKIFDLAYQ+PGLK+LRSCDI PASW+SVAWYPIYRIPTGPT
Subjt:  HHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYW
        LKDLDACFLTYHSLSTPIRGNGHGQAP MIYPND DG+PKVSLPVFGLASYKLKGSIWAQN VKEHQMANSLMQAA+ WLR LQV+QPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYW

Query:  R
        R
Subjt:  R

XP_022921943.1 uncharacterized protein LOC111430050 [Cucurbita moschata]7.8e-232100Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ
        MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ

Query:  PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA
        PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA
Subjt:  PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA

Query:  CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL
        CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL
Subjt:  CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR
        DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR
Subjt:  DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR

XP_022987436.1 uncharacterized protein LOC111484983 [Cucurbita maxima]6.9e-22897.98Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ
        MLGTALQFGGIKGEDRFYIPVRARK YNQQKPSRRPTKTDETETPSS+VVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ

Query:  PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA
        PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSS+DTSSEGSIDYEFGKSCNLSREQWVHHHLA
Subjt:  PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA

Query:  CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL
        C+SA+T+RKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL
Subjt:  CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR
        DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKE+QM NSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR
Subjt:  DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR

XP_023516127.1 uncharacterized protein LOC111780081 [Cucurbita pepo subsp. pepo]3.3e-23099.24Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ
        MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSS+VVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ

Query:  PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA
        PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSS+DTSSEGSIDYEFGKSCNLSREQWVHHHLA
Subjt:  PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA

Query:  CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL
        CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL
Subjt:  CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR
        DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFAS+MTYWR
Subjt:  DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR

TrEMBL top hitse value%identityAlignment
A0A1S3BX10 uncharacterized protein LOC103494138 isoform X11.0e-19283.88Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ
        MLGTALQFGGIKGEDRFYIPVRARK+YNQQKPSRRPTKTDETE+ SS+VV  TT P + LTPQSKSNLERFL+AT+PSVPAQYFSKTTMR WRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ

Query:  PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA
        PYF+LNDLWESFKEWSAYGAGVPLVL+GGDSVVQYYVPYLSGIQIYGE+AALRSDS  RLA EDSDLDSS+DTSS+GSIDY+ GKS NLSREQW H HLA
Subjt:  PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA

Query:  CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL
        CE+   MRKTSL DE    QEGF SDDGDA YPRSGLLFQFLEQDLPYQRVPLADKIF+LAYQFPGLKTLRSCDILPASW+SVAWYPIYRIPTGPTLKDL
Subjt:  CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR
        DACFLTYHSLSTP +GN H   P M+YP D D I K+SLPVFG+ASYKLKGSIW QN + +HQ ANSLMQAA+KWLR LQV+QPDFQFF+SH TYWR
Subjt:  DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR

A0A5A7USF1 Uncharacterized protein1.0e-19283.88Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ
        MLGTALQFGGIKGEDRFYIPVRARK+YNQQKPSRRPTKTDETE+ SS+VV  TT P + LTPQSKSNLERFL+AT+PSVPAQYFSKTTMR WRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ

Query:  PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA
        PYF+LNDLWESFKEWSAYGAGVPLVL+GGDSVVQYYVPYLSGIQIYGE+AALRSDS  RLA EDSDLDSS+DTSS+GSIDY+ GKS NLSREQW H HLA
Subjt:  PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA

Query:  CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL
        CE+   MRKTSL DE    QEGF SDDGDA YPRSGLLFQFLEQDLPYQRVPLADKIF+LAYQFPGLKTLRSCDILPASW+SVAWYPIYRIPTGPTLKDL
Subjt:  CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR
        DACFLTYHSLSTP +GN H   P M+YP D D I K+SLPVFG+ASYKLKGSIW QN + +HQ ANSLMQAA+KWLR LQV+QPDFQFF+SH TYWR
Subjt:  DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR

A0A6J1C0E1 uncharacterized protein LOC1110069254.7e-19885.79Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSE----VVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCD
        M GTALQFGGIKGEDRFYIPV+ARK+YNQQKPSRRP K+DETE+PSS+    VVASTT PSKPLTPQ KSNLERFLDAT PSVPAQYFSKTTMRGWRTCD
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSE----VVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES A+RSDSK RLA EDSDLDSS+DTSS+GSI+YE GK+  +SREQWVH
Subjt:  IEFQPYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVH

Query:  HHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPT
          L CE+ + MR  S+RDEH   QEGFSSDDGDA  PRS LLFQF EQDLPYQRVPLADKIFDLAYQ+PGLK+LRSCDI PASW+SVAWYPIYRIPTGPT
Subjt:  HHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYW
        LKDLDACFLTYHSLSTPIRGNGHGQAP MIYPND DG+PKVSLPVFGLASYKLKGSIWAQN VKEHQMANSLMQAA+ WLR LQV+QPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYW

Query:  R
        R
Subjt:  R

A0A6J1E577 uncharacterized protein LOC1114300503.8e-232100Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ
        MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ

Query:  PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA
        PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA
Subjt:  PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA

Query:  CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL
        CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL
Subjt:  CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR
        DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR
Subjt:  DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR

A0A6J1JE68 uncharacterized protein LOC1114849833.3e-22897.98Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ
        MLGTALQFGGIKGEDRFYIPVRARK YNQQKPSRRPTKTDETETPSS+VVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ

Query:  PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA
        PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSS+DTSSEGSIDYEFGKSCNLSREQWVHHHLA
Subjt:  PYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLA

Query:  CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL
        C+SA+T+RKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL
Subjt:  CESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR
        DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKE+QM NSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR
Subjt:  DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)2.2e-9156.1Show/hide
Query:  SKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ-PYFVLNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGESAALRSDSKSRLA
        S SN+ERFLD+  PSVPA Y SKT +R     D+E Q PYF+L D+WESF EWSAYG GVPL LN   D V QYYVP LSGIQ+Y +  AL S  ++R  
Subjt:  SKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ-PYFVLNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGESAALRSDSKSRLA

Query:  NEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLA
         E+S+ D  +D+SSEGS   E  +    S+EQ          +  M K SLR EH   QE  SSDDG+    +  L+F++LE+DLPY R P ADK+ DLA
Subjt:  NEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLA

Query:  YQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKE
         +FP LKTLRSCD+LP+SW SVAWYPIY+IPTGPTLKDLDACFLTYHSL TP +G G     +M      + + K+ LPVFGLASYKL+GS+W       
Subjt:  YQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKE

Query:  HQMANSLMQAAEKWLRRLQVNQPDFQFF
        HQ+ANSL QAA+ WLR  QVN PDF FF
Subjt:  HQMANSLMQAAEKWLRRLQVNQPDFQFF

AT2G01260.1 Protein of unknown function (DUF789)1.7e-9150.38Show/hide
Query:  MLGTALQF-GGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQ--SKSNLERFLDATKPSVPAQYFSKTTMRGWRTCD-
        MLG   Q   G  G+D FY   + R++ NQ+    R  ++D +  PSS    + +   + L P   S SNL+RFL++  PSVPAQ+ SKT +R  R  D 
Subjt:  MLGTALQF-GGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQ--SKSNLERFLDATKPSVPAQYFSKTTMRGWRTCD-

Query:  -IEFQPYFVLNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQW
          +  PYFVL D+W+SF EWSAYG GVPLVLN   D V+QYYVP LS IQIY  S AL S  KSR   + SD D  +D+SS+ S D         S  + 
Subjt:  -IEFQPYFVLNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQW

Query:  VHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTG
        V   + C         SLRD+H   QE  SSDDG+    +  L+F++LE+DLPY R P ADK+ DLA QFP L TLRSCD+L +SW SVAWYPIYRIPTG
Subjt:  VHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTG

Query:  PTLKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFF
        PTLKDLDACFLTYHSL T   G G  Q+ ++  P +++   K+SLPVFGLASYK +GS+W      EHQ+ NSL QAA+KWL    V+ PDF FF
Subjt:  PTLKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFF

AT2G01260.2 Protein of unknown function (DUF789)4.8e-7050.62Show/hide
Query:  MLGTALQF-GGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQ--SKSNLERFLDATKPSVPAQYFSKTTMRGWRTCD-
        MLG   Q   G  G+D FY   + R++ NQ+    R  ++D +  PSS    + +   + L P   S SNL+RFL++  PSVPAQ+ SKT +R  R  D 
Subjt:  MLGTALQF-GGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQ--SKSNLERFLDATKPSVPAQYFSKTTMRGWRTCD-

Query:  -IEFQPYFVLNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQW
          +  PYFVL D+W+SF EWSAYG GVPLVLN   D V+QYYVP LS IQIY  S AL S  KSR   + SD D  +D+SS+ S D         S  + 
Subjt:  -IEFQPYFVLNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQW

Query:  VHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTG
        V   + C         SLRD+H   QE  SSDDG+    +  L+F++LE+DLPY R P ADK+ DLA QFP L TLRSCD+L +SW SVAWYPIYRIPTG
Subjt:  VHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTG

Query:  PTLKDLDACFLTYHSLSTPIRG
        PTLKDLDACFLTYHSL T   G
Subjt:  PTLKDLDACFLTYHSLSTPIRG

AT4G16100.1 Protein of unknown function (DUF789)4.2e-8244.36Show/hide
Query:  IKGEDRFYIPVRARKSYNQQKPSR------------------RPTKTDETETPSSE-------VVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFS
        I+GE+RFY P   RK   +++  R                  R  K +E E    E        V S  + +   T  + SNL RFLD T P V  Q+  
Subjt:  IKGEDRFYIPVRARKSYNQQKPSR------------------RPTKTDETETPSSE-------VVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFS

Query:  KTTMRGWRTCDIEFQPYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGK
         T+ +GWRT + E++PYF+LNDLW+SF+EWSAYG GVPL+LNG DSVVQYYVPYLSGIQ+Y + +  R+ +  R   E+SD DS +D SS+GS D     
Subjt:  KTTMRGWRTCDIEFQPYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGK

Query:  SCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSG-LLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVA
         C              E +  + + SL ++      G SSD+ +A     G L+F++LE  +P+ R PL DKI +L+ QFP L+T RSCD+ P+SW+SVA
Subjt:  SCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSG-LLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVA

Query:  WYPIYRIPTGPTLKDLDACFLTYHSLSTPIRG--NGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWA-QNCVKEHQMANSLMQAAEKWLRRLQV
        WYPIYRIP G +L++LDACFLT+HSLSTP RG  N  GQ+ +    +      K+ LP FGLASYK K S W+ ++ V E+Q   +L++ AE+WLRRL+V
Subjt:  WYPIYRIPTGPTLKDLDACFLTYHSLSTPIRG--NGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWA-QNCVKEHQMANSLMQAAEKWLRRLQV

Query:  NQPDFQFFASHM-TYWR
          PDF+ F SH  + WR
Subjt:  NQPDFQFFASHM-TYWR

AT5G49220.1 Protein of unknown function (DUF789)5.1e-7239.68Show/hide
Query:  GTALQFGGIKGEDRFYIPVRARKSYN----QQKPSRRPTKTDETET------PSSEVVASTTTPSKPLTPQSK---------------------------
        G ++    I+GE+RFY P   R+       QQ+   +  + DE E         +  VA  TT       +SK                           
Subjt:  GTALQFGGIKGEDRFYIPVRARKSYN----QQKPSRRPTKTDETET------PSSEVVASTTTPSKPLTPQSK---------------------------

Query:  -SNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLNDLWESFKEWSAYGAGV-----PLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSR
         SNL+RFL+ T P VPA+ F   +    +T + +   YFVL DLWESF EWSAYGAGV     PL ++G DS VQYYVPYLSGIQ+Y        D   +
Subjt:  -SNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLNDLWESFKEWSAYGAGV-----PLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSR

Query:  LANEDSDLDSSKDTSSEG---SIDYEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADK
          N   D + S + SS      +D   G+                     + + SL+D+  T     SS + +   P+  LLF++LE + P+ R PLA+K
Subjt:  LANEDSDLDSSKDTSSEG---SIDYEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADK

Query:  IFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQ
        I DLA + P L T RSCD+LP+SW+SV+WYPIYRIP GPTL++LDACFLT+HSLST    +  G        +D+    K+ LP FGLASYKLK S+W Q
Subjt:  IFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQ

Query:  NCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR
        N ++E Q   SL+QAA+KWL+RLQV+ PD++FF S+    R
Subjt:  NCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGAACTGCGTTGCAGTTTGGGGGAATCAAAGGTGAGGATCGGTTTTATATTCCGGTGAGGGCACGGAAGAGTTATAATCAGCAAAAGCCATCGAGGAGACCTAC
CAAGACCGATGAAACTGAGACTCCATCGAGTGAAGTTGTGGCTTCTACTACAACGCCTTCTAAGCCACTAACTCCTCAGTCTAAGAGCAACTTAGAGAGATTCTTGGACG
CCACAAAGCCTTCAGTTCCAGCGCAGTACTTCTCTAAGACAACTATGAGGGGTTGGAGGACTTGTGATATTGAATTTCAACCTTATTTCGTTCTGAATGATCTGTGGGAG
TCTTTCAAGGAGTGGAGTGCATACGGTGCTGGAGTTCCTTTAGTACTCAATGGAGGTGACTCTGTTGTTCAATATTACGTTCCATATTTGTCTGGTATTCAAATATATGG
CGAATCTGCTGCATTGAGATCAGATTCTAAGTCCAGGCTGGCTAATGAGGACAGTGATCTTGACTCTTCTAAAGATACAAGCAGCGAGGGAAGCATTGACTACGAATTTG
GTAAAAGCTGTAACTTATCCAGAGAACAGTGGGTTCATCACCATTTAGCTTGTGAAAGCGCTATTACAATGAGAAAGACATCTTTAAGAGATGAACATAGCACAAGACAA
GAAGGTTTTTCGAGTGACGATGGGGATGCAGAATATCCTCGGAGTGGTTTGCTCTTTCAGTTTCTGGAGCAAGATCTTCCTTATCAACGTGTACCGTTGGCTGATAAAAT
ATTTGATCTTGCTTACCAATTTCCTGGTTTGAAAACTTTAAGAAGTTGTGATATCCTGCCAGCCAGTTGGATCTCTGTAGCATGGTACCCAATATACCGTATACCCACTG
GTCCCACATTGAAGGATTTGGATGCTTGCTTCCTAACATATCATTCCCTTTCCACACCCATAAGAGGTAATGGACATGGTCAGGCACCAGCAATGATATATCCAAATGAC
AACGATGGTATCCCAAAGGTCTCCTTGCCTGTCTTTGGATTGGCTTCTTATAAGCTGAAAGGCTCCATATGGGCGCAAAATTGTGTCAAAGAGCATCAAATGGCAAATTC
CCTCATGCAGGCGGCAGAGAAGTGGCTGAGGCGCCTTCAGGTCAATCAACCTGATTTTCAGTTCTTTGCATCACATATGACATACTGGAGATGA
mRNA sequenceShow/hide mRNA sequence
TCTGGTTCAAGCTTCTTCCTCTCTCTTTCTCTCTTTCTTTCTGCGTTTTCCTTCTCGCCTGGTTTCTTAATTCACACAAAACGCATTTTGGTCATTCATTGCGTGCATTC
TTCAAATCTTTGCTGCGAAGTTTTTGAATTGATTTTCAGTTCTCTTCTTCATTTAATCGATTTTGAAACCCTCTGTGGTATTGATTTCTCTCTGATCTGATATTGTTTTG
CTAGCATCCTCCATTCAATTCAGTGTTCTTGAGAGATATTTCGAGTAATTTTTGGAAGAAGAAAGCTGAGTTTAAAGAATCTGCATACTATTTCTTTTAATCGCCATTTC
GTCGGCTTTATGCTACTTGGGAGCTTAAAGAAATCGATATTTCTTTTGATTCTGGTTGGTTGAGTTATTTTTTCTTCTTCTCAGTTTAGTGAACAAGGTAGAGAGCTTTG
GGGTTACCGATTATAATCTGTGTAATTTCCTCCGTTTTCCGTTGATACTCAACGGAGGATTCTTCTTTATTGTCGTTCTTTTGGTTTAGATATAGAAATGTTGGGAACTG
CGTTGCAGTTTGGGGGAATCAAAGGTGAGGATCGGTTTTATATTCCGGTGAGGGCACGGAAGAGTTATAATCAGCAAAAGCCATCGAGGAGACCTACCAAGACCGATGAA
ACTGAGACTCCATCGAGTGAAGTTGTGGCTTCTACTACAACGCCTTCTAAGCCACTAACTCCTCAGTCTAAGAGCAACTTAGAGAGATTCTTGGACGCCACAAAGCCTTC
AGTTCCAGCGCAGTACTTCTCTAAGACAACTATGAGGGGTTGGAGGACTTGTGATATTGAATTTCAACCTTATTTCGTTCTGAATGATCTGTGGGAGTCTTTCAAGGAGT
GGAGTGCATACGGTGCTGGAGTTCCTTTAGTACTCAATGGAGGTGACTCTGTTGTTCAATATTACGTTCCATATTTGTCTGGTATTCAAATATATGGCGAATCTGCTGCA
TTGAGATCAGATTCTAAGTCCAGGCTGGCTAATGAGGACAGTGATCTTGACTCTTCTAAAGATACAAGCAGCGAGGGAAGCATTGACTACGAATTTGGTAAAAGCTGTAA
CTTATCCAGAGAACAGTGGGTTCATCACCATTTAGCTTGTGAAAGCGCTATTACAATGAGAAAGACATCTTTAAGAGATGAACATAGCACAAGACAAGAAGGTTTTTCGA
GTGACGATGGGGATGCAGAATATCCTCGGAGTGGTTTGCTCTTTCAGTTTCTGGAGCAAGATCTTCCTTATCAACGTGTACCGTTGGCTGATAAAATATTTGATCTTGCT
TACCAATTTCCTGGTTTGAAAACTTTAAGAAGTTGTGATATCCTGCCAGCCAGTTGGATCTCTGTAGCATGGTACCCAATATACCGTATACCCACTGGTCCCACATTGAA
GGATTTGGATGCTTGCTTCCTAACATATCATTCCCTTTCCACACCCATAAGAGGTAATGGACATGGTCAGGCACCAGCAATGATATATCCAAATGACAACGATGGTATCC
CAAAGGTCTCCTTGCCTGTCTTTGGATTGGCTTCTTATAAGCTGAAAGGCTCCATATGGGCGCAAAATTGTGTCAAAGAGCATCAAATGGCAAATTCCCTCATGCAGGCG
GCAGAGAAGTGGCTGAGGCGCCTTCAGGTCAATCAACCTGATTTTCAGTTCTTTGCATCACATATGACATACTGGAGATGATAAGAAATGGCTCAAAACATGTCTCTACT
TCCATTCTGCCCACTCGAACTACCAAAATAACAAACTCGTGCTCTTATCGACTATTCGACCTGCACTCAATACCGGTAAGGATACATTTTGATTTTGGGTGGTTTGTTCT
TTTGGGGATTACATTTCATGGAGGCAGTCAAGAAAAATATTCACACCAACTGGCTAAAAACATGTGGAAAATGGGAGAGGGAGTAATAAAACTATGAAATGGGAGGGCAA
AAGCAATATGGAAAGTAATGGCAATCAAGATGAATGTTGCCTATAGCATTATACACTGCCCTTGGAAACTTGTGAGAGTCATCGTTTGTCGTCGTTGCATCGATATGGTC
TTTAGTATGGTCTTTTGTGAACTTGAAGGAAACTTGTAAAAGTCGTCGTTTGTCATTGTCGTTGCATCGATACGGTCTTCAGTGTGGTCTTTTGTGAACTTGAAGGAGTA
CAAATTTGTATGTATATTACGTACGATTTGATGTAAGGTATCTATTTTAGATTCACACAAAGTATAGATTAAAGATTATACTTCCAGCATCTTATGTTTCCGAATATGGG
CCATTCTGGATTGAACAAGGAACCATATTCTTCCCCATCCAACAATGGAGAATCAGTGTTCTTTTCCATTGGCAAACCAGATGCTTATTCTGAAAGGTTACTTTGTTTTT
GACATTCTCACTAGTTTAGACCCAACATTTTCTTGTCCAAAATGTATAGTCATATGGACACATAAGAAAGTTGAAGTGATATCGTTGTTGAGCACGTGGGTGAAGGAGTT
CTAGGGTCAGTGGTGACTCACTCACAACAAGAGAAGTAAGACTCGGATTACTTTGTTGCTTGAATATCTCAAGATAAGAATATTTGTTTGAGATTCGAATTATTGTACAA
ACAATATCGAATAATTCTAAACATGCAATCTAAACTACTTAC
Protein sequenceShow/hide protein sequence
MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLNDLWE
SFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSIDYEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQ
EGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPND
NDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR