; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0857 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0857
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionExtracellular ligand-binding receptor
Genome locationMC09:9602462..9612633
RNA-Seq ExpressionMC09g0857
SyntenyMC09g0857
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576012.1 hypothetical protein SDJN03_26651, partial [Cucurbita argyrosperma subsp. sororia]7.38e-7453.08Show/hide
Query:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKL---PAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV
        SSS +   AST LL LLL+I SLK       LL NG NSLPT +F LFFK+  CVL+S +     PA+A L A Q LA A++ V + T+EMGLGII SF+
Subjt:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKL---PAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV

Query:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG
        + VL  LKN VFGS  E GS FG L++K K+S  E S+++QVREII  +S KI+D  LETASS AG +F F   +I  LLN+P SA+G+LV  +K  L G
Subjt:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG

Query:  S-SAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        + S+M GVR IV++ + K+   G GVASSSASG FE VK A+ L +ESG TVGGL+EK K +LEVL ME LR +I+S++++ +++I SY  G
Subjt:  S-SAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

XP_011659765.2 uncharacterized protein LOC105436268 [Cucumis sativus]2.10e-7250Show/hide
Query:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV
        SSS +S   STFLL LLL+I SLKI+        NGFNSL T    LF K+ PC+++SF   IKLPA+A LSAFQ L +A++++ + TIEMG GII+SFV
Subjt:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV

Query:  LMVLEFLKNAVFGSILESG-SIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALA
        + VLE + N VFGS +ES  S FGGL+E TK S+   ++ +QVR IIES  + ++    E A+SFAG MF+F    +  + NEP S IG LVE +K +L 
Subjt:  LMVLEFLKNAVFGSILESG-SIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALA

Query:  GS--SAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
            S M+GV+ IV+  + KM+   S VA+SS  GLFE VK   +L V+SG +VGGL+EK +  LE+L ME LRGII +I+KI ++++ +YLFG
Subjt:  GS--SAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

XP_022156757.1 uncharacterized protein LOC111023598 [Momordica charantia]5.64e-18399.66Show/hide
Query:  MSCCSSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSF
        MSCCSSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSF
Subjt:  MSCCSSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSF

Query:  VLMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALA
        VLMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSM DQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALA
Subjt:  VLMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALA

Query:  GSSAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        GSSAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
Subjt:  GSSAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

XP_022954027.1 uncharacterized protein LOC111456411 [Cucurbita moschata]2.97e-7352.05Show/hide
Query:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKL---PAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV
        SSS +   AST LL LLL+I SLK       LL NG NSLPT +F LFFK+ PCVL+S +     PA+A L A Q LA A++ V + ++EMGLGII SF+
Subjt:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKL---PAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV

Query:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG
        + VL  LKN VFGS  E GS+FGGL++K ++S  E S+++ VREII  +S KI+D  LETA+S AG +F F   +I  LLN+P SA+G+LV  +K  L G
Subjt:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG

Query:  S-SAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        + S+M GVR IV++ + K+   G GVASSSASG FE VK A+ L +ESG TVGGL+EK K +LEVL ME LR +I+S++++ +++  SY  G
Subjt:  S-SAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

XP_022991981.1 uncharacterized protein LOC111488468 [Cucurbita maxima]2.28e-7553.08Show/hide
Query:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKL---PAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV
        SSS +   AST LL LLL+I SLK       LL NG NSLPT +F LFFK+ PCVL+S +     PA+A L A Q LA A++ V + T+EMGLGII SF+
Subjt:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKL---PAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV

Query:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG
        + VL  LKN VFGS  E GS FGGL++K K+S  E S+++QVREII  +S KI+D  LETA+S AG +F F   +I   LN+P SA+G+LV  +K +L G
Subjt:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG

Query:  S-SAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        + S+M GVR IV++ + K+   G GVASSSASG FE VK A+ L +ESG TVGGL+EK K +LEVL ME LR +I+S++++ +++I SY  G
Subjt:  S-SAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

TrEMBL top hitse value%identityAlignment
A0A0A0K6V2 Uncharacterized protein9.04e-7150.18Show/hide
Query:  STFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFVLMVLEFLKN
        STFLL LLL+I SLKI+        NGFNSL T    LF K+ PC+++SF   IKLPA+A LSAFQ L +A++++ + TIEMG GII+SFV+ VLE + N
Subjt:  STFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFVLMVLEFLKN

Query:  AVFGSILESG-SIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAGS--SAMDG
         VFGS +ES  S FGGL+E TK S+   ++ +QVR IIES  + ++    E A+SFAG MF+F    +  + NEP S IG LVE +K +L     S M+G
Subjt:  AVFGSILESG-SIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAGS--SAMDG

Query:  VREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        V+ IV+  + KM+   S VA+SS  GLFE VK   +L V+SG +VGGL+EK +  LE+L ME LRGII +I+KI ++++ +YLFG
Subjt:  VREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

A0A5D3BCI5 Uncharacterized protein3.26e-4349.48Show/hide
Query:  VLEFLKNAVFGSILESG-SIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAGS
        VLE + N VFGS +ES  S FGGL+E T   F   ++ +QVR IIES  K ++    E A+SFAG MF+F    I  + NEPSSA+G LV  +K++L   
Subjt:  VLEFLKNAVFGSILESG-SIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAGS

Query:  --SAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
          S+M+GVR IV++F+ KM+ A S V SSSA GLFE VK  ++L V+SG +VGGL+EK + +LE+L ME LR II +I+ I +++I +YL G
Subjt:  --SAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

A0A6J1DST0 uncharacterized protein LOC1110235982.73e-18399.66Show/hide
Query:  MSCCSSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSF
        MSCCSSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSF
Subjt:  MSCCSSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSF

Query:  VLMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALA
        VLMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSM DQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALA
Subjt:  VLMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALA

Query:  GSSAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        GSSAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
Subjt:  GSSAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

A0A6J1GPW1 uncharacterized protein LOC1114564111.44e-7352.05Show/hide
Query:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKL---PAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV
        SSS +   AST LL LLL+I SLK       LL NG NSLPT +F LFFK+ PCVL+S +     PA+A L A Q LA A++ V + ++EMGLGII SF+
Subjt:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKL---PAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV

Query:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG
        + VL  LKN VFGS  E GS+FGGL++K ++S  E S+++ VREII  +S KI+D  LETA+S AG +F F   +I  LLN+P SA+G+LV  +K  L G
Subjt:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG

Query:  S-SAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        + S+M GVR IV++ + K+   G GVASSSASG FE VK A+ L +ESG TVGGL+EK K +LEVL ME LR +I+S++++ +++  SY  G
Subjt:  S-SAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

A0A6J1JSB1 uncharacterized protein LOC1114884681.10e-7553.08Show/hide
Query:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKL---PAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV
        SSS +   AST LL LLL+I SLK       LL NG NSLPT +F LFFK+ PCVL+S +     PA+A L A Q LA A++ V + T+EMGLGII SF+
Subjt:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKL---PAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV

Query:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG
        + VL  LKN VFGS  E GS FGGL++K K+S  E S+++QVREII  +S KI+D  LETA+S AG +F F   +I   LN+P SA+G+LV  +K +L G
Subjt:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG

Query:  S-SAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        + S+M GVR IV++ + K+   G GVASSSASG FE VK A+ L +ESG TVGGL+EK K +LEVL ME LR +I+S++++ +++I SY  G
Subjt:  S-SAMDGVREIVQNFLSKMIGAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTGTTGTTCTTCTTCTCCCCTTTCTTCCAAGGCTTCAACATTTCTCCTCTACCTCCTCCTTGTAATCACATCTCTAAAAATCCTTAATCTCACTTTAAATCTCCT
TTCCAATGGCTTCAACAGCTTGCCCACTTTTATTTTCTCCCTATTCTTCAAATCCACACCTTGTGTTCTTATTTCCTTTATAAAGCTCCCTGCCCAAGCCCTCCTCAGCG
CATTTCAGCAGCTTGCACAAGCCATGAGGGCTGTTCTCATGTACACAATAGAGATGGGTTTGGGAATCATAACTTCTTTTGTGCTTATGGTGCTCGAGTTTCTCAAGAAT
GCCGTTTTCGGTTCGATTTTGGAGTCTGGTTCGATCTTTGGTGGCCTGGTGGAGAAGACAAAGTCGTCGTTTATGGAGAGTTCGATGATGGATCAAGTGCGGGAGATTAT
CGAGAGCATTTCGAAGAAGATCATCGACATGGCTCTGGAAACAGCAAGCTCTTTTGCAGGCAGCATGTTCGACTTTGTGAAGGACACCATCCTTGAGTTGTTGAATGAGC
CTAGTTCAGCCATCGGAGAGCTGGTAGAGAAGATGAAGGAGGCATTGGCAGGCAGCTCGGCCATGGACGGGGTGCGGGAGATTGTTCAAAACTTCTTAAGCAAGATGATC
GGTGCGGGATCGGGAGTGGCGAGTTCTTCGGCCAGTGGTTTGTTTGAATTTGTGAAGAATGCTTTGAGTTTGGCTGTTGAATCTGGTCTCACTGTTGGAGGCTTATTGGA
GAAGATGAAGGAATCTTTGGAGGTTCTAAGTATGGAAGGACTACGAGGGATAATTGAGAGTATTTCCAAGATAATTTTGGATGTCATTTTTAGCTATTTATTTGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATTTTTTTCTTTATTTAAAATTTCTTTTCTTTTAAACAAAATTCAACTTTACTTATTTTCTCCTTAAAGGAATTTAATTTAAATTTGTAGAATGAAATTTATTTACCAAA
ATATTTTACTTTATTTGAAAAAAAATTAAATCTTATTTTAAAAGGTTTGATTTTTTTTTTTTTTTTAAACTTGATTTAATTTACTTAAAATTCTTTTCCTTATTAAAAAG
AAATCTTCTTTCCATTTTTTTAAAACAACTTCATAATCCACTTTTTCTCATTTATGTTGTTCTAATGATGGAGTTCGTCGGAATCTGTTGTTATCGCCGCCGCCACTCAA
GATCTGATGTTGCTACTCGCAGCACAGGTTTGGAGGTTGCCGGAGTTCTCGTAGAAACGCGTTGGGATGACTGTTTGAACTGTCGAGTTCGGAGTGGGTCGGAATTTGCT
GCTGCATCAAAGTTCTTAGCTGCTGTCATTCATGTCGTTACTTGTGTTCATAGTTGTTGGGTGTCGTTGAAATTCGTAGCTGCACATGCCGGAGGTGTAGATCTGTTGTC
GGGAGCTGGTCGTCGGAAATTTGCTACTATAGACCTAAGTTCGTTGTGCTGTCACCAGGTTCTGCCGCTGTCGGATCTACTGCTGCCATTCGCGCCGCTGCCTGATCTAC
TGCCACCAGAGTGGGTCGCTTGAACTCTACTTTCGGGGTGGGTAAGGGAGAGTGAGAGTGAGAAGGATGAAGAGAAAAAGTGGTGGCTATGGAGGAAAAATGTGGAGTTC
TAGAGAGAGAAAGAATAATAAAGGGAAGAAAAAAGAGGCTCTAGGGTTTTGGATAGTTTTTTTTTTTTGACAAAAACGGAAATATCTTTATTAAAATAAAACACCACAAA
GGGCAAAATTAGACAGAGGGGAAAGACTCTCCTCCCTCAGCAACCACGAAGGGAAAACCCTAACCCAACACATCGATTCACGCCTCAACAACGCCGATTTCGCCACTGCA
TCTACTCGCCAATTCCGATCATGTTTGATTGCCTCTAAACAATCAGAGGATATTTCCAAAGATCCAATTTCTAAATTTGACGCCAAAACTAGGGCCTCCTTGATGACACA
CAACTCTGCGTGTAGCGGTGGCAATCCAGAAAGAAGAAATCTTACATTTGCTTGCCATAGCTTGCCCGAAGGTAAACAGAAAACAATCCCTAGGCCTGTTGTCAACCCAT
CTCGAGAACAGGCAGCGTTGGTAGAGATTCTCATGCATCCCGACAACTCCCTTGACTCTGTCGCCATCTCTGTAATGGGAGCCCCCATCGCCTGAGTTGAGCGCCTCCTT
GCATCTTACGATAGCCCAGCACACAACTAACGCCCATGCCAACTCACCCGCAGACAAAAGCTCACACAAGCGCCACCACGAGTCCTCGAACTTTGGAAAAAATATCGACT
GTAATTGAACAGTCGGCAGGCTCCGCAACTAAATCTCCTTGGCCCTTGAGCATTTAAAGAACAAGTAGGCAACATCCTCTCTGGACGAATCACACACTGGACAAAATTGG
GTAACATCCATACCTTTACTAGATAAATTATCTTTTGTAGGGATTATTTCATGTATAGCTCTCCATATTAAATGTTTTACTTTAGGAGGAATATTTAAATTCCACATAAT
CTTCCAAAATTTCTTATTTGCATCTGGGCTAGATGAGGACTGATGTAGCTTAGACTTTATAAACATCTTGTACCCACTCTTCACAGAATATCTTCCTTTCTTATCATAGT
GTCAAATCCAATTATCTTGGATATGGGATAGGCTGATAGGAATTCCAAGTATACATTGGACATCTTCTGGTCTGAACATAGCATTTAGCTTATCAACATCCCATTCCCTA
CCATGCTTGATAAACTCCGACACTCTAACGTTCATATTAGCCCCAGGGGGTGTAATTGGTTTGAACATTGATTCATGAGGTAACCATTTATCTTTGAAAACCTTTATTTT
CTCCCCATTACCAACTCGATACCGTAACCCCTGTACCAAAAGCTCACGGCCCCATAGACAACTTCTCCAAAAATACGATGAATTCCTTCCTCTACCTGCTTCTAGAATCG
ATGATTCAGGATAGTATCGAGCCTTGATAATCTTTGACACCAGCAAATCAGGAAATATAATCATGCGCCAAACTTGCTTTGCTAGTAAAGCTTTATTGAAGCCTTCCAAA
TCTCTAAAATTAAGCCCCCCCAATTCCTTCGGATAACACATCAAATCTCATCTCTTCCAGTGAATCTTCCTTTTCTCCTCTGATGTACCCCACCAAAAGCGGGCAAACAT
TTTTGATAAATCATCACAACAAACTTTTGGAATCCTGAATACACTTATCGCTTAAGTTGGTATAGCTTGCCCTACACTTTTAATTAGGATCTCCTTACCTGCCACTGAAA
AGAACTTCCTCTGCCAACCCTGCAAAGCTCTCCACACCTTGTCTTTTAAAAATTTCCAATCATCTGAGCGTTTTCTGGTAAAAGCTGAAGGCAAACCTAAATAATTCCCA
AGGTTATGCACCGGCCTCACTCCAAGAAGGGATTGAAGATAATGTCCCCTATCCGCCACAATATTAGGTGAAAAGCACACTACAGATTTGTCAAAATTTACACTTTGACC
TGAAGTGCGCTCATAAAGCCCTAAAATTCTCTTAAAAGCACAACACTCAGCTGCATCCGCCCGAAGAAAAACAAGACTGTCATCAGCAAAAAATAGGTGAGATACCATTG
GGCCTTGATCAGACAAAGCCAACCCCAATACCTGATTCTTGAGAGATGCATTCGTAATCAACGAGGACAGACCTTCCGTACATAGGAGAAATAAATATGGGGAGATAGGA
TCCCCCTGCCGAAGTCCTCTAGTAGGTATTATCTTCCCTAATGTTGTCACATTAATGTTAACTGCAAAAGATAGCGTAGATATGCAGTGCATAATACGTCCAACCCACTA
AGTAGCAAAACCCAACCTCAACATAACTTTTTCTAAGAAAATCCACCCCGCTCGATCATAAGCTTTGCTCATATCGAGTTTTAGTGCAGCGAAACCCTCTCTACCCAATT
TCTAATCCTTCAAAACATGAAGACACTCATACCCAATGATTATATTGTCAGAGATTGATCTACCTGGAATAACGTCGACTGAGATTCAGAAATCACTACATCTAGAACTT
CTTTCAGCCGATTAGCCAACACTTTTGTCCCAATTTTATAGGATACATTACACAAGCTAATAGGACGAAAATCAGTTACTCTACTAGGATCCTTACACTTCGGGATAAGG
ACTATCGTCGTCTTATTCCAATCTTTAATACTGCTTCTTTCATTAAGGATCTTAAGAACACACGCCACCGTCTTTGGTCCCACCACAGACCAATATCTCTGATAGAATAA
AGCTGGAAACTCGTCCGGCCCTGGCGCCTTTGTTGGAAACATCTCCCTTACTACCCGATAAACCTCTCCTTCAACAAAAGGAGCTAGCAGCTTTGCATTCATATCCGTTG
AAACCCTAGATTGTATAGTTTCCATCACTTTATCAATATATGCTAGATCCAGCCTAACCGACTGAAAAATAGAGTTAAAATAAGAGATTAAAGTGTCTCCTATTTCATTT
TCATCATTCGTCCAGCTTCCCGAAAAGGTGGTAATACCACCAATATACATCTTATTCCTCAATGAAGCTTGTTTATGGAACCATTTGGTGTTCTTATCCCCTCATTTCAG
CCAATTCTCCCTTGATCGTTGTTTCCAATAAATTTCTTCATCTTCCAACAACTTATCTAACTCTGTTTTAATGGCGTTGATCTTTGAGAAGTCTAAAGGGAACTGTCTCC
TGTACTCCTCATCAAGCACAACTTTAAGTTTCTCAATTTCACTGAACAATCTTTTCCTTTGCTTACCCCCTTACGGTCTTAATGCCAACTTTGATCCTTTTAGATGTGCT
CCCAACGACTTCTCCTTCCACTCCTGATCTAACCACTTACCTTCTGCCTCAATCAAAGCTCTGCATTCCTTGTTTCCCGTCCACATCTCCTCAAATCGAAAAATACGTGA
CCTTTTCCTCCCCAAAGGACTCCTTTTATTCTCAGCCAGGCTACACTCAATTGCCCTGTGATCCAACCTTGCCCAAGGAAGATGAAGAACTCTGTAACTGTCAAACATTT
CTAAAAACACGTTATTGCACAAAAACCTGTCAAGTCTCTCTTGAATCTGTGCCTCCTCCCTCCTATTATTCCACGTAAATAGATCCCCCACACAATCCATTTCAACTAGA
CCGCATGCATCCATCGTTTCACGAAAGTCAGAAATCAAAGAACTCCTCCTATGCCCACCTCCTATTTTCTCAAATTCCCAAAGAATTTCATTTAGATCGCCCCCAACCAA
CCAAGCCGTATTGTCCATATTGTGAATCCGCTTCAAAAGATTCCACGTCTCCCCCCCGTTTAGACGCATTAGGATGTCTATAGAACCTAGTGAACCTCCACCTAAAAGAC
TTCCAAACAATGTTAGCATCAATGTGATTAGATGAAAACGATCGAACGGAAACATCTACCTCAGAGTCCCACAATAAACACAACCCTCTACTAGAGTCTCTACTCCCTAC
TGAAAAACAACCCTGAAACTTCAACCTATCTCTCAGCCTTCCAATTAATGAAACATCGGACTTTGTCTCAGAAAGAAAAAGCACTTGGGCACTTTTGTGACTCAAAAAGT
CACACTTAGCCCGGAACGCCCTAGGATTCCCCATCCCTTGGAACGCAAGATTTTCATTGCTTCCGACAAGGCTGGACTTTAGCCTCTGCCGACCTCAAAGCAATTTCTAT
AAGTTTATTCCCAGCCTCAACTACAACCATTCCACTCACCTCACAACGTTTAGAACCCGACTCCTTGTCAACCTGGATATCGTCCATCTTCCTCTTACCATTCACTTGCT
TCACTCCACCCCCTACACCCAAATTCCTAGCTTTCCTCTTCCATCCTCGGACATTGACGGCCCTCCCCTCTTGAAACTCCCTATCTACCAACTGGGCCTTGACTGCACAA
CCCAAGTCTACAGCCATCAGATTTTGCAAACCATCAGCAAATTCATCTTGAACCTTCTTCTGTTCCAGCTGCCTCTCTTAAAGCATCACTCTGTCACTCCCACTAATCTG
ATTGCACACCATGGCATCCTTCCTAACCAAATCCTCTAGGCCCAAATCACCCCCACTACTACTTGATCCCCACCTCTTACCCTCAATTAGAACCTCCAACTTCGCCCCAC
CTTCTAGGGCACTTGTAGGGCCACTTACAGGCATATTCTCCTTCAAAGAGCCAACAGGCTCCCTCGAAACTACCTCCACCCGCCCCACAGAATCTGCCATCCCCTTCTCA
ATCTGCTTGTCTGTTAGGGCCCCATTATCCACCTCCCTCACACTTCCTCTGTTCAAAACCACCTCTGGCCCTTTCCTCCTCCTTTGTCATCCCCCTGATAGCAGAGCCAA
GGACCATATTGTAAATATGGACCCTCCCCTGAATTCTCACCTACAGCTTCATCGCAATCAGACTTGACATGCCCAACTCGACCACAACAAAAGCAGAAATCCGGCAGCCT
TTCATACCTGATCGGCACCCAACACCCCCCATAGGCCTTTCTATATTGACCTTGATACCACGCCTGAGTGGCTTAGTTATATTTAACAATACTCGCAAACGCAAACTGCT
CCCCCAGCAGAATTTATCCTCCGAACAATCCACCCCCTCAAACTGGCCGATAACATTACCCAATTGTTTAGCCATAGAAGTATTCAGGCAAAATAACGGCAAATCACAAA
AGCGATCCCAAAAAGCCGCCTTATTGAAATCAAAACCTGAAAGTTTTAAGGTTAACACAGGTTTCTCCAAGACAAGCAATGCCTTATCAAAAAACTAGGGACCCCCTCTC
AATACTCTATTACGAATGGCCGCTGACTCGAAACGGAAAAGAAAAATATTGTGCCCTAATGATTCCACCCCAAAATCCCCATTGACATACCAAACTTTCTTTAGAACTCT
CAGCAGAACCTCCCTTTGGATATTTCGCTTTAATAGAAGCTTTCCAACCAAACTTACATCCAGGCACTTTTCAACATCCAATGCCGCTACCCCGTCCACATCCACTGTAA
CTTCCTCTTCTTCGGAAGTCAATGATAGTTTCCTCCATTCATCAACCAAGGCCTCTATACCCATATTGATCTGTGCGAAAACTAAACAGACAAAAAGCCCGACCAACTCC
AAAGAAAGCTAGAGAAAAACCAGTCAAAGACTGGGAACAACGCTCATCCCTAGAGAGAGCGAAAAAAAAACCCTAGGATTTTGGATAGTAATTAGAGAAAAAGGGGAAGA
AGAACATGAAGAATATGCCACTGAGGTTTGACGAGGGGAAAAAAACAAAACACAATCCATTTTTTAAAAATAAAATCCTTTTTGTTTCTTAATTTTATTAATAAAATCGA
ATCCTTTTAAACAAAATCTAACTATTTAACTTAATTAAAACTTTACCATAATAAGATAAACTTTAGTTTTTTCTAAATAAAATAAATAAGAAACTAAAATAAAAACTAAT
TAAAATAAAGTAAAAATGAAAGAAAATAAAAATGAGAGTCGGACTAAATACGGTGTTTATAATACTATCGTGTCAAAAACCTACAAATAAAAAATGATAGCTTGAGGTTC
CATTGTGATATTTGTCCAAACAAAACTCCACAAATCAACAAAGAAGGTCGTGATGATAAATAATGGAATTTAACAAAGTGATCTTAGTGAAAAAAAATGACTAGAAAAAA
CGTACCTACGTCAACCACGAGATGATATTTATAAGCAACAATATAATGAGTCTAGTATGTATGTGTATATATACATACATTAGTACAATCTATCATAGTATTTTAAATTA
AAATTTAGTTTGATTGATTAACGGTGAGATATAGTCGCATAACCATGCTCAAGTCAAGGGATCGAAATCCCTATTTGGATGTATGTAAGTAATGTTATGAGTAGTTGAAT
GAAAACGCGTCCAAAACTATTGTTGGCACATAATTTTTGGCTAAGGTTGAGTTTGGATTATAAAATTGCCATATCAAAGCCTTGTTGAGAATAGACCATATGATATCAAT
TTTGTCCTTTCTATACTGCGTGGCTCTTTCTTGTGGCGTTTACATACTAAAATTCAGGTTGAAAGTAATTTAAAAGAGGGAAAAAAAAATGACCCAATGATTTTAAATCA
TTGGTATAATTTAATGTTTTGCCCATTGACTTGATAAACCCTATTAATAAAATCATGTATATCTTGTCGCTAATTACCAAACTATTGAACTCTTCAAACTTTTTTGCTCT
CTATTAAACAATAATAATCAACTAAAATATACTAAAAAAATCGAAATAACAACTCATTTCTCTTCTCTCCTTCACATTTTATCCCATCAACATCAATCAAATGAGAAAAA
AAATAATAAATTAAAGTTTAGTCCAAATTTTAAGTTTGTTTATTGATCATTGAACTTTGAAAGTGTCTGATAATTTCTTAGCTAAATTTTTAATTTTGTCGGTCTTTAAA
CTTTAGAAAAATGTTTAATAGGTCTCTAAAATTTGATTTTGTGTCTAACAAGTCTCTAAACTTTCAATTTTGTGTTAATAGTGTTCCTAGCTATTCGACATTTTAAAAAA
TTCATGGTCCTATTCGATACAAAATTGAAACTTTCACGATTCTATACAAAATTAAGTTTTACGCGTAATTAATCGGTTAATTTTAAAAAATAATTAAATAGGTCAGTGGC
CTATATTTGATACCAAATTGAAAGATTAGGGATTATTAGACACTTTGTGAGTTCGCCGGTCTATTAAATATAAAATTGAAAATTCAATGACCTATTAGGCACAAAATTGA
AAATTCAAAGATCTAAAATTGAAACTTTACGATTCTATTAGATACAAAATTAAATTTCACGCCTAATTGATCCGTTGATTTTTAAAAATAATAAAATAGGTCAGTGGCCT
GTATTTGATACCAAATTGAAAGATTAGGGATTATTAGATACTTTTTGAGTTTGCGGGACTATTAGACATGAAAATTCAATGACCTATTAGATACAAAATTGCAAATTCAG
GGATCATAGATCTAATAATAGTCTTCAACTAAACTTATAATTTAATCCAAAAAAAAAAGAAAAAAAAAATCCCTTCAACCCTATGAATACTCCTCGACCAATTGAATTCA
AACACATAAACCAAATCAAATGTCTTGTTGTTCTTCTTCTCCCCTTTCTTCCAAGGCTTCAACATTTCTCCTCTACCTCCTCCTTGTAATCACATCTCTAAAAATCCTTA
ATCTCACTTTAAATCTCCTTTCCAATGGCTTCAACAGCTTGCCCACTTTTATTTTCTCCCTATTCTTCAAATCCACACCTTGTGTTCTTATTTCCTTTATAAAGCTCCCT
GCCCAAGCCCTCCTCAGCGCATTTCAGCAGCTTGCACAAGCCATGAGGGCTGTTCTCATGTACACAATAGAGATGGGTTTGGGAATCATAACTTCTTTTGTGCTTATGGT
GCTCGAGTTTCTCAAGAATGCCGTTTTCGGTTCGATTTTGGAGTCTGGTTCGATCTTTGGTGGCCTGGTGGAGAAGACAAAGTCGTCGTTTATGGAGAGTTCGATGATGG
ATCAAGTGCGGGAGATTATCGAGAGCATTTCGAAGAAGATCATCGACATGGCTCTGGAAACAGCAAGCTCTTTTGCAGGCAGCATGTTCGACTTTGTGAAGGACACCATC
CTTGAGTTGTTGAATGAGCCTAGTTCAGCCATCGGAGAGCTGGTAGAGAAGATGAAGGAGGCATTGGCAGGCAGCTCGGCCATGGACGGGGTGCGGGAGATTGTTCAAAA
CTTCTTAAGCAAGATGATCGGTGCGGGATCGGGAGTGGCGAGTTCTTCGGCCAGTGGTTTGTTTGAATTTGTGAAGAATGCTTTGAGTTTGGCTGTTGAATCTGGTCTCA
CTGTTGGAGGCTTATTGGAGAAGATGAAGGAATCTTTGGAGGTTCTAAGTATGGAAGGACTACGAGGGATAATTGAGAGTATTTCCAAGATAATTTTGGATGTCATTTTT
AGCTATTTATTTGGCTAAATCTAAGATTTTTCTTCTATTTAAATTCTAACTTTATCATTTTTTTAAATTATAACTTTTACTTAATGATAAGTTTAGTGTACTATCTCAGT
GAGTTGACATAAATTTAGTAGAATTAAGTCAATTTGAGAAAAAGTAGTCAAAACTTTTAC
Protein sequenceShow/hide protein sequence
MSCCSSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFVLMVLEFLKN
AVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAGSSAMDGVREIVQNFLSKMI
GAGSGVASSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG