; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012641 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012641
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTransmembrane protein
Genome locationscaffold63:2256630..2258387
RNA-Seq ExpressionMS012641
SyntenyMS012641
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600645.1 hypothetical protein SDJN03_05878, partial [Cucurbita argyrosperma subsp. sororia]4.9e-11777.05Show/hide
Query:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVAS---PCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELL
        MSL SQNLF CS R KFC F NS  R NTSFSLP+AS     L+QFH   H LQ P+NL+S RS C YGIGVFESE +  S DR GDFNLES+L FSEL 
Subjt:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVAS---PCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELL

Query:  CLFSSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKL
         LF+SAVFLV FVVNFVGS+SKKALWVLIGDRGLVWGFPLLVATVVLN WIRRRQWRRVC + A G L+VNLLDRIEKLEEDLRSST  IRALSR+LEKL
Subjt:  CLFSSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKL

Query:  GIRFMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSKEL
        GIRF+VTRKT+RD IAE+AALAQRNS+DTRTLAVQED+LEKELLE+QKVLLAMQEQQ+KQ+ELIIAIGEK KL+  KQ  DQE TR +R  SANE+SKEL
Subjt:  GIRFMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSKEL

Query:  EAYGI
        EAYGI
Subjt:  EAYGI

KAG7031283.1 hypothetical protein SDJN02_05323 [Cucurbita argyrosperma subsp. argyrosperma]2.4e-11676.72Show/hide
Query:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVAS---PCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELL
        MSL SQNLF CS R KFC F NS  R NTSFSLP+AS     L+QFH   H LQ P+NL+  RS C YGIGVFESE +  S DR GDFNLES+L FSEL 
Subjt:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVAS---PCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELL

Query:  CLFSSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKL
         LF+SAVFLV FVVNFVGS+SKKALWVLIGDRGLVWGFPLLVATVVLN WIRRRQWRRVC + A G L+VNLLDRIEKLEEDLRSST  IRALSR+LEKL
Subjt:  CLFSSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKL

Query:  GIRFMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSKEL
        GIRF+VTRKT+RD IAE+AALAQRNS+DTRTLAVQED+LEKELLE+QKVLLAMQEQQ+KQ+ELIIAIGEK KL+  KQ  DQE TR +R  SANE+SKEL
Subjt:  GIRFMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSKEL

Query:  EAYGI
        EAYGI
Subjt:  EAYGI

XP_022136776.1 uncharacterized protein LOC111008398 isoform X1 [Momordica charantia]2.7e-16099.01Show/hide
Query:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVASPCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELLCLF
        MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVASPCLHQFHFQKHNLQIPHNLTSRRSNCLY IGVFESEQVAGSHDRDGDFNLESILSFSELLCLF
Subjt:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVASPCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELLCLF

Query:  SSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKLGIR
        SSAVFLVVFVVNFVG NSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKLGIR
Subjt:  SSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKLGIR

Query:  FMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSKELEAY
        FMVTRKTLRD IAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSKELEAY
Subjt:  FMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSKELEAY

Query:  GI
        GI
Subjt:  GI

XP_022136778.1 uncharacterized protein LOC111008398 isoform X2 [Momordica charantia]4.9e-13398.8Show/hide
Query:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVASPCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELLCLF
        MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVASPCLHQFHFQKHNLQIPHNLTSRRSNCLY IGVFESEQVAGSHDRDGDFNLESILSFSELLCLF
Subjt:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVASPCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELLCLF

Query:  SSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKLGIR
        SSAVFLVVFVVNFVG NSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKLGIR
Subjt:  SSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKLGIR

Query:  FMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQ
        FMVTRKTLRD IAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQ
Subjt:  FMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQ

XP_038899386.1 uncharacterized protein LOC120086693 [Benincasa hispida]6.8e-11977.85Show/hide
Query:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVASPC---LHQFHFQKHNLQIPH-NLTSR-RSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSE
        MSLPSQNLF CS RFKFCCF NS  RNN  FS P++SP    LHQFHF  H L   H N TSR RS C YGIGV+ESE VA    R GDFNLES+L  SE
Subjt:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVASPC---LHQFHFQKHNLQIPH-NLTSR-RSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSE

Query:  LLCLFSSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLE
        LL LFSSAVFLVVF VNFVGS+SKKALWVLIGDRGLVWGFPLLVATVVLN+WIRRRQWRR+CW    G L+VNLLDRIEKLEEDLRS    IRALSR+LE
Subjt:  LLCLFSSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLE

Query:  KLGIRFMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSK
        KLGIRF VTRKTL+DPIAETAALAQRNSE++RTLAVQED+LEKELLEMQKVLLAMQEQQ+KQ+ELI+AIGEK KLM  KQ LDQER+R ERH SANE+SK
Subjt:  KLGIRFMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSK

Query:  ELEAYGI
        ELEAY I
Subjt:  ELEAYGI

TrEMBL top hitse value%identityAlignment
A0A5A7TN19 Uncharacterized protein1.8e-10471.01Show/hide
Query:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVAS---PCLHQFHFQKHNLQIPH-NLTSR-RSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSE
        M LPSQNLF CS R KFC F NS  RNN  FS  ++S     L+QFHF  H L   H N TSR R  C  GIGV+ESE  A    R GDFNLES+L  SE
Subjt:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVAS---PCLHQFHFQKHNLQIPH-NLTSR-RSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSE

Query:  LLCLFSSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLE
         + LFSSAVFLVVFV+NFVGS+SKK +W+L+ DRGLVWGFPLLVATVVLN+WIRR QWRR+CW    G L+VNLLDR EKLEEDLRS    IR LSR+LE
Subjt:  LLCLFSSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLE

Query:  KLGIRFMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSK
        KLGIR+ VTRKTL+DPIAETAALAQRNSED +TLAVQEDILEKELLEMQKVLLAMQEQQ+KQ+ELI+AIGEK KLM  KQ  DQERTRI+R  SANE+ K
Subjt:  KLGIRFMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSK

Query:  ELEAYGI
        ELEAY I
Subjt:  ELEAYGI

A0A6J1C4G5 uncharacterized protein LOC111008398 isoform X11.3e-16099.01Show/hide
Query:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVASPCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELLCLF
        MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVASPCLHQFHFQKHNLQIPHNLTSRRSNCLY IGVFESEQVAGSHDRDGDFNLESILSFSELLCLF
Subjt:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVASPCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELLCLF

Query:  SSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKLGIR
        SSAVFLVVFVVNFVG NSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKLGIR
Subjt:  SSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKLGIR

Query:  FMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSKELEAY
        FMVTRKTLRD IAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSKELEAY
Subjt:  FMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSKELEAY

Query:  GI
        GI
Subjt:  GI

A0A6J1C8H6 uncharacterized protein LOC111008398 isoform X22.4e-13398.8Show/hide
Query:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVASPCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELLCLF
        MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVASPCLHQFHFQKHNLQIPHNLTSRRSNCLY IGVFESEQVAGSHDRDGDFNLESILSFSELLCLF
Subjt:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVASPCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELLCLF

Query:  SSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKLGIR
        SSAVFLVVFVVNFVG NSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKLGIR
Subjt:  SSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKLGIR

Query:  FMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQ
        FMVTRKTLRD IAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQ
Subjt:  FMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQ

A0A6J1FNU5 uncharacterized protein LOC1114468912.0e-11676.72Show/hide
Query:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVAS---PCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELL
        MSL SQNLF CS R KFC F NS  R NTSFSLP+AS     L+QFH   H LQ P+NL+S RS C YGIGV ESE +  S DR GDFNLES+L FSEL 
Subjt:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVAS---PCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELL

Query:  CLFSSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKL
         LF+SAVFLV FVVNFVGS+SKKALWVLIGDRGLVWGFPLLVATVVLN WIRRRQWRRVC + A G L+VNLLDRIEKLEEDLRSST  IRALSR+LEKL
Subjt:  CLFSSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKL

Query:  GIRFMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSKEL
        GIRF+VTRKT+RD IAE+AALAQRNS+DTRTLAVQED+LEKELLE+QKVLLAMQEQQ+KQ+ELIIAIGEK KL+  KQ  DQE TR +R  SANE+SKEL
Subjt:  GIRFMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSKEL

Query:  EAYGI
        EAYGI
Subjt:  EAYGI

A0A6J1J2E7 uncharacterized protein LOC1114806961.6e-11375.74Show/hide
Query:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVAS---PCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELL
        MSL SQNLF CS R KFC F NS  R +TS SLP+AS     L+QFH   H LQ P+NL+S RS C YGIGVFESE +  S DR GDFNLES+L FSEL 
Subjt:  MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVAS---PCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELL

Query:  CLFSSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKL
         LF+SAVFLV FVVNFVG  SKKALWVLIGDR LVWGFPLLVATVVLN WIRRRQWRRVC + A G L+VNLLDRIEKLEEDLRSST  IR LSR+LEKL
Subjt:  CLFSSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKL

Query:  GIRFMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSKEL
        GIRF+VTRKT+RD IAE+AALAQRNSEDTRTLAVQED+LEKELLE+QKVLLAMQEQQ+KQ+ELIIAIGEK KL+  KQ  DQE TR +R  SANE+SKEL
Subjt:  GIRFMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSKEL

Query:  EAYGI
        EAYGI
Subjt:  EAYGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G65250.1 unknown protein4.6e-4945.48Show/hide
Query:  MSLPSQ-NLFTCS-----GRFKFCCFANSGLRNNTSFSLPVASPC-LHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFE---SEQVAGSHDRDGDFNLESI
        +SLPS+  LF+ S      R    CFA S  R +   S+    P  L      + N +I  N  +  S+    IG FE   S  +      DG F+L S 
Subjt:  MSLPSQ-NLFTCS-----GRFKFCCFANSGLRNNTSFSLPVASPC-LHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFE---SEQVAGSHDRDGDFNLESI

Query:  LSFSELLCLFSSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRAL
        +SF+E LC+ SSAV  VV  VN+V           IG + L  GF  LV +V   +W+RRRQW R+C K A+ +   NL+ R+EKLE+DL+SST+ +R L
Subjt:  LSFSELLCLFSSAVFLVVFVVNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRAL

Query:  SRQLEKLGIRFMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKS
        SR LEKLGIRF VTRK L++PI+ETAALAQ+NSE TR L  Q++ILEKEL E+QKVLLAMQEQQRKQ+ELI+ I +  KL     +  Q  +   ++K+
Subjt:  SRQLEKLGIRFMVTRKTLRDPIAETAALAQRNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTGCCTTCTCAAAATCTCTTCACATGTTCCGGCCGTTTCAAGTTCTGTTGCTTCGCCAACTCCGGTCTCAGAAACAACACTTCCTTCTCTCTCCCGGTTGCATC
TCCATGCCTCCATCAGTTCCACTTTCAGAAGCATAACCTACAAATACCTCACAATTTAACCTCCCGCCGGAGTAATTGTCTGTATGGCATTGGAGTTTTTGAATCCGAAC
AAGTTGCGGGGAGTCACGACAGGGATGGCGATTTCAACCTGGAATCGATTCTTTCGTTCTCTGAATTGCTTTGTCTCTTTTCCTCTGCTGTTTTCTTGGTTGTTTTTGTC
GTGAATTTCGTTGGTTCTAATTCCAAGAAGGCGCTCTGGGTATTGATAGGAGATAGGGGTTTGGTTTGGGGCTTCCCTTTGCTGGTAGCTACTGTTGTTCTTAACGCGTG
GATTCGAAGGCGGCAGTGGAGACGAGTATGTTGGAAAACAGCGAAAGGTGCATTGGAGGTGAATTTGTTGGATAGGATTGAGAAACTGGAGGAGGATTTAAGGAGCTCGA
CGGCCACGATTCGGGCCTTGTCTAGGCAGCTTGAGAAGTTGGGCATAAGGTTTATGGTCACACGAAAAACTCTCAGAGATCCAATTGCTGAGACTGCAGCATTAGCTCAA
AGAAATTCTGAGGACACTCGAACGTTGGCTGTGCAAGAAGATATTCTTGAAAAGGAACTCCTTGAAATGCAAAAGGTCTTACTAGCCATGCAGGAGCAGCAGCGAAAGCA
GATTGAGCTGATTATTGCGATTGGAGAAAAGGGGAAGCTGATGGCAAGAAAACAGGCACTTGATCAAGAACGAACAAGGATTGAAAGACACAAGTCTGCCAATGAAGATT
CAAAAGAACTTGAAGCTTATGGAATC
mRNA sequenceShow/hide mRNA sequence
ATGTCATTGCCTTCTCAAAATCTCTTCACATGTTCCGGCCGTTTCAAGTTCTGTTGCTTCGCCAACTCCGGTCTCAGAAACAACACTTCCTTCTCTCTCCCGGTTGCATC
TCCATGCCTCCATCAGTTCCACTTTCAGAAGCATAACCTACAAATACCTCACAATTTAACCTCCCGCCGGAGTAATTGTCTGTATGGCATTGGAGTTTTTGAATCCGAAC
AAGTTGCGGGGAGTCACGACAGGGATGGCGATTTCAACCTGGAATCGATTCTTTCGTTCTCTGAATTGCTTTGTCTCTTTTCCTCTGCTGTTTTCTTGGTTGTTTTTGTC
GTGAATTTCGTTGGTTCTAATTCCAAGAAGGCGCTCTGGGTATTGATAGGAGATAGGGGTTTGGTTTGGGGCTTCCCTTTGCTGGTAGCTACTGTTGTTCTTAACGCGTG
GATTCGAAGGCGGCAGTGGAGACGAGTATGTTGGAAAACAGCGAAAGGTGCATTGGAGGTGAATTTGTTGGATAGGATTGAGAAACTGGAGGAGGATTTAAGGAGCTCGA
CGGCCACGATTCGGGCCTTGTCTAGGCAGCTTGAGAAGTTGGGCATAAGGTTTATGGTCACACGAAAAACTCTCAGAGATCCAATTGCTGAGACTGCAGCATTAGCTCAA
AGAAATTCTGAGGACACTCGAACGTTGGCTGTGCAAGAAGATATTCTTGAAAAGGAACTCCTTGAAATGCAAAAGGTCTTACTAGCCATGCAGGAGCAGCAGCGAAAGCA
GATTGAGCTGATTATTGCGATTGGAGAAAAGGGGAAGCTGATGGCAAGAAAACAGGCACTTGATCAAGAACGAACAAGGATTGAAAGACACAAGTCTGCCAATGAAGATT
CAAAAGAACTTGAAGCTTATGGAATC
Protein sequenceShow/hide protein sequence
MSLPSQNLFTCSGRFKFCCFANSGLRNNTSFSLPVASPCLHQFHFQKHNLQIPHNLTSRRSNCLYGIGVFESEQVAGSHDRDGDFNLESILSFSELLCLFSSAVFLVVFV
VNFVGSNSKKALWVLIGDRGLVWGFPLLVATVVLNAWIRRRQWRRVCWKTAKGALEVNLLDRIEKLEEDLRSSTATIRALSRQLEKLGIRFMVTRKTLRDPIAETAALAQ
RNSEDTRTLAVQEDILEKELLEMQKVLLAMQEQQRKQIELIIAIGEKGKLMARKQALDQERTRIERHKSANEDSKELEAYGI