; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015873 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015873
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionN-(5-amino-5-carboxypentanoyl)-L-cysteinyl-D-valine synthase
Genome locationscaffold943_2:582527..584782
RNA-Seq ExpressionMS015873
SyntenyMS015873
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143979.1 uncharacterized protein LOC111013764 [Momordica charantia]2.8e-12394.98Show/hide
Query:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL
        MSNLSLLRKEFGSHFLSRAPRTS GFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKT           GLVYGKLYGITRNTLKTDIVNLLEGCNL
Subjt:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL

Query:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP
        GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP
Subjt:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP

Query:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
        EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ

XP_022963065.1 uncharacterized protein LOC111463377 [Cucurbita moschata]2.7e-10280.75Show/hide
Query:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL
        MS L LLRK   SHF++ + R + G PVFFR+SP  R FSTE EQPP E PAD FLDTSKT          +GLVYGKL GITRNTLKTDIVNLLEGCNL
Subjt:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL

Query:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP
         LDDVKV+YNRSFTPTSMMMQFPSRQAYDNA RVIGRKGRLYRLERADRSQWD+LSPY+GKTVLLQG+PRNA+ EDVERFL GC+YDATSINMFFRAS P
Subjt:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP

Query:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
        EPMR+ATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ

XP_022972751.1 uncharacterized protein LOC111471264 [Cucurbita maxima]1.1e-10683.26Show/hide
Query:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL
        MS L LLRK  GSHF++ + R + G PVFFR+SP  R FSTE EQPP EPPADSFLDTSKT          +GLVYGKLYGITRNTLKTDIVNLLEGCNL
Subjt:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL

Query:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP
         LDDVKV+YNRSFTPTSMMMQFPSRQAYDNA RVIGRKGRLYRLERADRSQWD+LSPYNGKTVLLQG+PRNA+ EDVERFL GC+YDATSINMFFRAS P
Subjt:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP

Query:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
        EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ

XP_023518188.1 uncharacterized protein LOC111781730 [Cucurbita pepo subsp. pepo]1.5e-10582.43Show/hide
Query:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL
        MS L LLRK   SHF++ + R + G PVFFR+SP  R FSTE EQPPSEPPAD FLDTSKT          +GLVYGKLYGITRNTLKTDIVNLLEGCNL
Subjt:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL

Query:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP
         LDDVKV+YNRSFTPTSMMMQFPSRQAYDNA RVIGRKGRLYRLERADRSQWD+LSPYNGKTVLLQG+PRNA+ EDVERFL GC+YDATSINMFFRAS P
Subjt:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP

Query:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
        EPMR+ATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ

XP_023544682.1 uncharacterized protein LOC111804194 [Cucurbita pepo subsp. pepo]1.7e-10179.5Show/hide
Query:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL
        M+NL+LLRK  GSHF+SR+   + G PVFFR+S   R FSTE EQ P E  ADSFLDTS            +GLVYGKLYGITRN LKTDIVNLLEGCNL
Subjt:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL

Query:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP
         LDDVKV+YNRSFTPTSMMMQFPSR+AYDNA RVIGR+GR+YRLERADRSQWDLLSPYNGKTVLLQG+PRNA  +DVERFLSGC+YDATSINMFFRAS+P
Subjt:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP

Query:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
        EP+RMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ

TrEMBL top hitse value%identityAlignment
A0A6J1CQY2 uncharacterized protein LOC1110137641.3e-12394.98Show/hide
Query:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL
        MSNLSLLRKEFGSHFLSRAPRTS GFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKT           GLVYGKLYGITRNTLKTDIVNLLEGCNL
Subjt:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL

Query:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP
        GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP
Subjt:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP

Query:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
        EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ

A0A6J1GDZ1 uncharacterized protein LOC1114531264.6e-10078.24Show/hide
Query:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL
        M+NL+LLRK  GSHF SR+   + G PVFFR+S   R FSTE EQ P E  A+SFLDTS+T          +GLVYGKLYGITRN LKTDIVNLLEGCNL
Subjt:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL

Query:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP
         LDDVKV+YNRSFTPTSMMMQFPS +AYDNA RVIGR+GR+YRLERADRSQWDLLSPYNGK +LLQG+PRNA  +DVERFLSGC+YDATSINMFFRAS+P
Subjt:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP

Query:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
        EP+RMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ

A0A6J1HGY5 uncharacterized protein LOC1114633771.3e-10280.75Show/hide
Query:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL
        MS L LLRK   SHF++ + R + G PVFFR+SP  R FSTE EQPP E PAD FLDTSKT          +GLVYGKL GITRNTLKTDIVNLLEGCNL
Subjt:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL

Query:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP
         LDDVKV+YNRSFTPTSMMMQFPSRQAYDNA RVIGRKGRLYRLERADRSQWD+LSPY+GKTVLLQG+PRNA+ EDVERFL GC+YDATSINMFFRAS P
Subjt:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP

Query:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
        EPMR+ATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ

A0A6J1I6U3 uncharacterized protein LOC1114712645.1e-10783.26Show/hide
Query:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL
        MS L LLRK  GSHF++ + R + G PVFFR+SP  R FSTE EQPP EPPADSFLDTSKT          +GLVYGKLYGITRNTLKTDIVNLLEGCNL
Subjt:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL

Query:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP
         LDDVKV+YNRSFTPTSMMMQFPSRQAYDNA RVIGRKGRLYRLERADRSQWD+LSPYNGKTVLLQG+PRNA+ EDVERFL GC+YDATSINMFFRAS P
Subjt:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP

Query:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
        EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ

A0A6J1INA4 uncharacterized protein LOC1114779763.9e-9977.41Show/hide
Query:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL
        M+NL+LLRK  GSHF+S +   + G PVFFR+S   R FS E EQ P E  A+SFLDT +T          +GLVYGKLYGITRN LKTDIVNLLEGCNL
Subjt:  MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNL

Query:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP
         LDDVKV+YNRSFTPTSMMMQFPSR++YDNA RVIGR+GR+YRLERADRSQWDLLSPYNGKTVLLQG+PRNA  +DVERFLSGC+YDAT INMFFRAS+P
Subjt:  GLDDVKVDYNRSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVP

Query:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
        EP+RMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  EPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G02740.1 Ribosomal protein S24e family protein1.6e-4445.85Show/hide
Query:  RLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNLGLDDVKVDYNR--SFTPTSMMMQFPSRQAYDNAIRV
        +  ST  EQPP   P       S             G  YGK  G +++ LKTDI+N+LEGC+L  DD+K +Y R  + TP ++ +QFPS  AYD A+R 
Subjt:  RLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNLGLDDVKVDYNR--SFTPTSMMMQFPSRQAYDNAIRV

Query:  IGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVPEPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQIL
        I +KG+LYRLE+A R+QWD + PY GK V L G+P NA+ +D++RFLSGC Y   SI       +    R+A V F S TQAM+A++TKNR F LN +I 
Subjt:  IGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVPEPMRMATVLFPSPTQAMHAFLTKNRGFCLNNQIL

Query:  MRVLQ
        ++VLQ
Subjt:  MRVLQ

AT5G02740.2 Ribosomal protein S24e family protein9.7e-3446.5Show/hide
Query:  RLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNLGLDDVKVDYNR--SFTPTSMMMQFPSRQAYDNAIRV
        +  ST  EQPP   P       S             G  YGK  G +++ LKTDI+N+LEGC+L  DD+K +Y R  + TP ++ +QFPS  AYD A+R 
Subjt:  RLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNLGLDDVKVDYNR--SFTPTSMMMQFPSRQAYDNAIRV

Query:  IGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSI
        I +KG+LYRLE+A R+QWD + PY GK V L G+P NA+ +D++RFLSGC Y   SI
Subjt:  IGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAATCTGAGTCTCCTCCGGAAGGAATTTGGATCCCACTTCCTTTCTCGTGCGCCAAGGACAAGCCTAGGCTTCCCTGTCTTCTTCCGGGAATCTCCTACAATTAG
ATTGTTCTCGACGGAAGCGGAACAACCACCTTCGGAACCGCCCGCCGATTCTTTCCTTGATACATCGAAAACAGGCATGAATCTCTTTTTTGTTATACTCTTCTCAGGTT
TGGTTTATGGAAAATTGTATGGAATTACAAGGAATACACTAAAGACGGACATTGTCAATTTGCTTGAAGGATGTAATTTGGGTTTGGATGATGTCAAAGTTGATTACAAT
CGGAGTTTCACACCCACCTCTATGATGATGCAGTTCCCCTCCCGACAGGCTTATGATAATGCTATTCGAGTGATTGGGAGAAAAGGTCGCTTGTACAGATTGGAGCGGGC
TGATCGTTCACAGTGGGACCTTCTTTCACCTTACAATGGAAAAACTGTCCTTCTGCAAGGACTTCCTCGAAATGCAATGCAAGAAGATGTAGAACGCTTCCTATCTGGCT
GTAACTACGATGCAACCTCAATCAATATGTTTTTCAGGGCATCAGTTCCAGAACCCATGAGAATGGCAACAGTGCTATTCCCTTCACCAACACAAGCAATGCATGCATTC
CTTACAAAGAACAGAGGCTTTTGTCTGAACAACCAAATTTTGATGCGGGTTCTCCAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAATCTGAGTCTCCTCCGGAAGGAATTTGGATCCCACTTCCTTTCTCGTGCGCCAAGGACAAGCCTAGGCTTCCCTGTCTTCTTCCGGGAATCTCCTACAATTAG
ATTGTTCTCGACGGAAGCGGAACAACCACCTTCGGAACCGCCCGCCGATTCTTTCCTTGATACATCGAAAACAGGCATGAATCTCTTTTTTGTTATACTCTTCTCAGGTT
TGGTTTATGGAAAATTGTATGGAATTACAAGGAATACACTAAAGACGGACATTGTCAATTTGCTTGAAGGATGTAATTTGGGTTTGGATGATGTCAAAGTTGATTACAAT
CGGAGTTTCACACCCACCTCTATGATGATGCAGTTCCCCTCCCGACAGGCTTATGATAATGCTATTCGAGTGATTGGGAGAAAAGGTCGCTTGTACAGATTGGAGCGGGC
TGATCGTTCACAGTGGGACCTTCTTTCACCTTACAATGGAAAAACTGTCCTTCTGCAAGGACTTCCTCGAAATGCAATGCAAGAAGATGTAGAACGCTTCCTATCTGGCT
GTAACTACGATGCAACCTCAATCAATATGTTTTTCAGGGCATCAGTTCCAGAACCCATGAGAATGGCAACAGTGCTATTCCCTTCACCAACACAAGCAATGCATGCATTC
CTTACAAAGAACAGAGGCTTTTGTCTGAACAACCAAATTTTGATGCGGGTTCTCCAA
Protein sequenceShow/hide protein sequence
MSNLSLLRKEFGSHFLSRAPRTSLGFPVFFRESPTIRLFSTEAEQPPSEPPADSFLDTSKTGMNLFFVILFSGLVYGKLYGITRNTLKTDIVNLLEGCNLGLDDVKVDYN
RSFTPTSMMMQFPSRQAYDNAIRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGLPRNAMQEDVERFLSGCNYDATSINMFFRASVPEPMRMATVLFPSPTQAMHAF
LTKNRGFCLNNQILMRVLQ