; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0495 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0495
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUnknown protein
Genome locationMC01:11413628..11418867
RNA-Seq ExpressionMC01g0495
SyntenyMC01g0495
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142096.1 uncharacterized protein LOC101220441 [Cucumis sativus]6.22e-13482.53Show/hide
Query:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIRCTLNFRVS+ FL+VS FL+C + GAESAVVTLDS++IYKTHEWLA+KPTVYF CQGGN+T LPDVQKEHVLYSFNGEESWQPLTEF+SKKCKRCGF
Subjt:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHN----SEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKK
        YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFN KEFNATF+CL+CTAYSNV+S+S+ T +     E+GM  A IIVIS + S VLIIG+VVGYKYWQKK
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHN----SEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKK

Query:  RREQDQARFLKLFEDGDDIEDELGLGDVI
        RR+QDQARFLKLFEDGDDIEDELGL DVI
Subjt:  RREQDQARFLKLFEDGDDIEDELGLGDVI

XP_022143802.1 uncharacterized protein LOC111013628 [Momordica charantia]5.63e-161100Show/hide
Query:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
Subjt:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQ
        YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQ
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQ

Query:  DQARFLKLFEDGDDIEDELGLGDVI
        DQARFLKLFEDGDDIEDELGLGDVI
Subjt:  DQARFLKLFEDGDDIEDELGLGDVI

XP_022963066.1 uncharacterized protein LOC111463378 isoform X1 [Cucurbita moschata]2.79e-13682.22Show/hide
Query:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIRC+ +F V ++FL+VSA L+C + G ESAVVTLDS++IYKTHEWLASKPTVYF+CQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEF+SKKCKRCGF
Subjt:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQ
        YEEDSIKSDDVFEEWE CPSDFTAP G+YVR+N+KEFNATF+CL+CTAYSNV+SSS+ +H++E+GMH AAII+IS +VSTVLI+G+V+GYKYWQKKRREQ
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQ

Query:  DQARFLKLFEDGDDIEDELGLGDVI
        DQARFLKLFEDGDDIEDELGL DVI
Subjt:  DQARFLKLFEDGDDIEDELGLGDVI

XP_023518189.1 uncharacterized protein LOC111781731 [Cucurbita pepo subsp. pepo]7.66e-13481.78Show/hide
Query:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIR + +F+V ++FL+VSA L+C +GG ESAVVTLDS++IYKTHEWLASKPTVYF+CQGGNKTKLPDVQK HVLYSFNGEESWQPLTEF+SKKCKRCGF
Subjt:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQ
        YEEDSIKSDDVFEEWE CPSDFTAP G+YVR+N+KEFNATF+CL+CTAYSNV+SSS+ + ++E+GMH AAII+IS +VSTVLIIG+V+GYKYWQKKRREQ
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQ

Query:  DQARFLKLFEDGDDIEDELGLGDVI
        DQARFLKLFEDGDDIEDELGL DVI
Subjt:  DQARFLKLFEDGDDIEDELGLGDVI

XP_038883476.1 uncharacterized protein LOC120074430 isoform X1 [Benincasa hispida]3.18e-14286.22Show/hide
Query:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIRCTLNFRVS++FL+VS FL+C +GG ESAVVTLDS++IYKTHEWLAS+PTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
Subjt:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQ
        YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFN +EFNATF+CLQCTAYSNV+SSS P+++ E+GMH A IIVIS + STVLI+G+VVGYKYWQ+KRREQ
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQ

Query:  DQARFLKLFEDGDDIEDELGLGDVI
        DQARFLKLFEDGDDIEDELGL +VI
Subjt:  DQARFLKLFEDGDDIEDELGLGDVI

TrEMBL top hitse value%identityAlignment
A0A0A0KXD0 Uncharacterized protein3.01e-13482.53Show/hide
Query:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIRCTLNFRVS+ FL+VS FL+C + GAESAVVTLDS++IYKTHEWLA+KPTVYF CQGGN+T LPDVQKEHVLYSFNGEESWQPLTEF+SKKCKRCGF
Subjt:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHN----SEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKK
        YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFN KEFNATF+CL+CTAYSNV+S+S+ T +     E+GM  A IIVIS + S VLIIG+VVGYKYWQKK
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHN----SEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKK

Query:  RREQDQARFLKLFEDGDDIEDELGLGDVI
        RR+QDQARFLKLFEDGDDIEDELGL DVI
Subjt:  RREQDQARFLKLFEDGDDIEDELGLGDVI

A0A1S3BH72 uncharacterized protein LOC1034898063.51e-13381.66Show/hide
Query:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIRCTL FRV + FL+   FL+C +GGAESAVVTLDS++IYKTHEWLA+KPTVYF C GGNKT LPDVQKEHVLYSFNGEESWQPLTEF+SKKCKRCGF
Subjt:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNS----EQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKK
        YEEDSIKSDDVFEEWEFCPSDFT+PAGKYVRFN KEFNATF+CLQCTAYSNV+S+S+ T +S    E+GMH A IIVIS + S VLI+G+VVGYKYWQKK
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNS----EQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKK

Query:  RREQDQARFLKLFEDGDDIEDELGLGDVI
        RR+QDQARFLKLFEDGDDIEDELGL DVI
Subjt:  RREQDQARFLKLFEDGDDIEDELGLGDVI

A0A6J1CRX4 uncharacterized protein LOC1110136282.72e-161100Show/hide
Query:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
Subjt:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQ
        YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQ
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQ

Query:  DQARFLKLFEDGDDIEDELGLGDVI
        DQARFLKLFEDGDDIEDELGLGDVI
Subjt:  DQARFLKLFEDGDDIEDELGLGDVI

A0A6J1HE82 uncharacterized protein LOC111463378 isoform X11.35e-13682.22Show/hide
Query:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIRC+ +F V ++FL+VSA L+C + G ESAVVTLDS++IYKTHEWLASKPTVYF+CQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEF+SKKCKRCGF
Subjt:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQ
        YEEDSIKSDDVFEEWE CPSDFTAP G+YVR+N+KEFNATF+CL+CTAYSNV+SSS+ +H++E+GMH AAII+IS +VSTVLI+G+V+GYKYWQKKRREQ
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQ

Query:  DQARFLKLFEDGDDIEDELGLGDVI
        DQARFLKLFEDGDDIEDELGL DVI
Subjt:  DQARFLKLFEDGDDIEDELGLGDVI

A0A6J1ICG3 uncharacterized protein LOC111471262 isoform X13.04e-13381.33Show/hide
Query:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIR + +F V ++FL+VS  L+C + G ESAVVTLDS++IYKTHEWLASKPTVYF+CQGGNKTKLPDVQK HVLYSFNGEESWQPLTEF+SKKCKRCGF
Subjt:  MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQ
        YEEDSIKSDDVFEEWE CPSDFTAP G+YVR+N+KEFNATF+CL+CTAYSNV+SSS+ +H++E+GMH AAII+IS +VSTVLIIG+V+GYKYWQKKRREQ
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQ

Query:  DQARFLKLFEDGDDIEDELGLGDVI
        DQARFLKLFEDGDDIEDELGL DVI
Subjt:  DQARFLKLFEDGDDIEDELGLGDVI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G09645.1 unknown protein1.1e-0540Show/hide
Query:  THNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQDQARFLKLFEDGDDIEDELGLGD
        +H S        I+V+   V+  +    +  YK WQKK+R++  AR LKLFE+ D++E ELGL D
Subjt:  THNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQDQARFLKLFEDGDDIEDELGLGD

AT1G57765.1 unknown protein3.1e-0537.97Show/hide
Query:  SNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVG-----YKYWQKKRREQDQARFLKLFEDGDDIEDELGLGD
        S+ + + A +H S        I ++       L  G V G     YK WQKK+R++  AR LKLFE+ D++E ELGL D
Subjt:  SNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVG-----YKYWQKKRREQDQARFLKLFEDGDDIEDELGLGD

AT1G57765.2 unknown protein2.8e-0634.07Show/hide
Query:  NATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVG-----YKYWQKKRREQDQARFLKLFEDGDDIEDELGLGD
        ++ + C+     S+ + + A +H S        I ++       L  G V G     YK WQKK+R++  AR LKLFE+ D++E ELGL D
Subjt:  NATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVG-----YKYWQKKRREQDQARFLKLFEDGDDIEDELGLGD

AT3G53490.1 unknown protein5.3e-6656.81Show/hide
Query:  FLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGFYEEDSIKSDDVFEE
        FL+   F + + G   +  VTLDSV I+ TH+W ++KPTV+FQC+G NKT LPDV++ +V YSFNGEESWQPLTE +  KCKRCG YE+D +K  D F+E
Subjt:  FLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGFYEEDSIKSDDVFEE

Query:  WEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQ--GMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQDQARFLKLFEDG
        WE CPSDFTA  G Y RF EKEFNATF+C  C+     S+  + T   EQ  GMH   +++I  L+  V+ +G++VGYKYW+KK+R+Q+QARFLKLFEDG
Subjt:  WEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQ--GMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQDQARFLKLFEDG

Query:  DDIEDELGLGDVI
        DDIEDELGL + +
Subjt:  DDIEDELGLGDVI

AT5G02720.1 unknown protein9.8e-2841.73Show/hide
Query:  LTEFESKKCKRCGFYEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGI
        +T    +KCKRCG YE+ S+ SD  F+ WE CP+DF+A +  Y+ F EKE NATF+C  C  + +  ++S+P      G+     I+   L +T++++G 
Subjt:  LTEFESKKCKRCGFYEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGI

Query:  VVGYKYWQKKRREQDQARFLKLFEDGDDIEDELGLGDVI
        V  +K+ Q+ ++++DQARF++LFE+ D+ EDELGL  VI
Subjt:  VVGYKYWQKKRREQDQARFLKLFEDGDDIEDELGLGDVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGATTCGGTGTACTCTGAATTTCAGAGTTTCGTCGATCTTTCTTATTGTTTCGGCATTCCTGAACTGCGTTGCAGGGGGCGCAGAATCGGCGGTTGTTACGCTTGA
TTCCGTAATGATATACAAGACACACGAGTGGTTGGCCTCCAAACCCACTGTTTATTTTCAATGTCAAGGGGGAAACAAAACGAAATTGCCTGATGTACAGAAAGAGCATG
TTTTATACAGCTTCAATGGTGAAGAATCTTGGCAGCCATTGACTGAATTTGAAAGCAAAAAGTGTAAGCGATGTGGGTTCTACGAGGAGGACAGCATAAAATCTGATGAT
GTATTCGAGGAGTGGGAGTTTTGTCCATCTGATTTTACAGCTCCCGCTGGGAAATATGTACGATTCAACGAAAAAGAGTTCAATGCCACTTTTATGTGCTTGCAGTGCAC
AGCTTATTCCAATGTTTCCAGTTCAAGTGCGCCTACGCATAACAGCGAACAGGGAATGCATGTTGCTGCAATCATAGTTATAAGTGCTCTGGTTTCAACTGTGCTAATTA
TCGGTATCGTGGTTGGTTACAAGTATTGGCAAAAGAAGAGAAGGGAGCAAGATCAGGCTAGATTTCTGAAGCTGTTTGAAGATGGGGATGACATTGAAGATGAGTTGGGC
CTTGGCGATGTAATATGA
mRNA sequenceShow/hide mRNA sequence
GGCGGTTCCATACTACGTTCGTCGATCAATGGCAGATCGCCTCCGATTGCCGCAGCCTTACAGATTACGCTACGCAACCACCACCATCCGATGATGTGAGCTTCACATTA
CGTGGCAGAAATCCGGCGCAGTAACAGGTGCACGTTGGTGTATGCGTGTCACAGACGCAAATTCCCCAGTCCCTCCAACGTTACCCACAAATCCATGAACCGGTCAGCCA
AAGCTCCCAACTCCTAGTTATTTTCGCTCCCACGCAAAGCACCCCGAAACCGCCATAGCCTTGAAGCTTCCGTTTAATCGAATCGGAGTTCCAGCTCCGCCGTCTACCAC
CGCTCTCTCTCTCCGATCCCGGCGTTTTGACCTCCCTCCGATTTTTCTTATCTGGAATTTCTGGAATTGATTTAGGCGAGCTGAGGGCGGAGTTTCCAAATTTTAATTAA
TTCGCGATATGCCGATTCGGTGTACTCTGAATTTCAGAGTTTCGTCGATCTTTCTTATTGTTTCGGCATTCCTGAACTGCGTTGCAGGGGGCGCAGAATCGGCGGTTGTT
ACGCTTGATTCCGTAATGATATACAAGACACACGAGTGGTTGGCCTCCAAACCCACTGTTTATTTTCAATGTCAAGGGGGAAACAAAACGAAATTGCCTGATGTACAGAA
AGAGCATGTTTTATACAGCTTCAATGGTGAAGAATCTTGGCAGCCATTGACTGAATTTGAAAGCAAAAAGTGTAAGCGATGTGGGTTCTACGAGGAGGACAGCATAAAAT
CTGATGATGTATTCGAGGAGTGGGAGTTTTGTCCATCTGATTTTACAGCTCCCGCTGGGAAATATGTACGATTCAACGAAAAAGAGTTCAATGCCACTTTTATGTGCTTG
CAGTGCACAGCTTATTCCAATGTTTCCAGTTCAAGTGCGCCTACGCATAACAGCGAACAGGGAATGCATGTTGCTGCAATCATAGTTATAAGTGCTCTGGTTTCAACTGT
GCTAATTATCGGTATCGTGGTTGGTTACAAGTATTGGCAAAAGAAGAGAAGGGAGCAAGATCAGGCTAGATTTCTGAAGCTGTTTGAAGATGGGGATGACATTGAAGATG
AGTTGGGCCTTGGCGATGTAATATGAGGACAGTTTGTGTAATTCAAGAGAGAGAAAATTAATGTACAGTGATACATGTTGCACAATCTTTTTCCTTTTTCTGTCATTATT
AGAATAGATATAGTTTTAAATTTTGGCTTCATTCATTGATCACAGGGAGAACTTCCTCTTAAATGCTAAGAGATAGAGAGATAGAAGACAGACCATTGAGATGTCTGAAT
GATTTTTTTTGTTTATTTCTTTGGTGTATTAAACTGGGGGAAATTTATTATTAGTTCTGGTCCAAACATTTGAAGAGAAAACAAATAGCTCCGAGTACATTTCATCCAAC
CAATTGCCGTCTGACAAAAAAGTAAAGGAGTGATTAGTCTAACTTCATATATGAAAAAAAATTTCTAGTACTCAATGGATACTTGTAATTTCGTGTTTAAACAATACTGG
TTTAGTACTAGAAGGGAATTCTTTTCGATTAGTACCCTAAGATGGAAACAATGAGTGCCATAGCCTCGGAATTGCAGTCATGTTCAGGTTTCAGCTCGTCCATTCCAGTG
ATAGCTGAACTCATCTTCTCCAAGTCGAAACCAAGACTGAAAGCAATCCCAAGCAGATTCATAGCCCAAATTTCTAGAATCTCCATTATAATCCTCATTGTGGAGGAGAA
TGTCATCTGTATTCTCCATTGTGCAGCCATTTTGATAGCCATATGGGTCTTGGTCTTGGAAATTGAAGAAGAGATCTCTGCAAAATTCAGTGTAAGAAGTGGAATAACCA
AAACAACTGTGGTAAGACTGCTCACCCTCAGAACTGTCTCCTATTGGGTCTCCATCAGACTGGTGTCTGATTGGGAAACCGATCTCTGTTGTCCAAAATGGAGGAGAATA
CTGCTCTGGAGGGTAGTCGTAGCTGTTTCCATTTGGCATGATAAGGCCAATGGAGTCAGAACCACTGTAACTCATAGCAGATATTCAAGCTCCAACTCTCTTGTTTACAA
TATATTTGTTTATAAGGGAATCACAGGCCACAAGGTATCCTAGATTTTGAAACTGGGCCTTGGGGGAATGGAAGTGTTGATATGACCTGTTATATAAACTTGGAAAGGAG
AAAGAAAACACCATATTTGTAGTGGATTCCATTCTAATCTAAAAACATACTCTTTTTGAAATCACAGGACAGGTACAACTAACTTTTCAAGACAACGAAAATAGGTACTT
ATTGCGGTGAACTCGGGACCAATTCCTACATTTTAATGTGGAGGCATTGTTCATTTCCACCCAATAAAAGCATTCACCCAATCAAGAGAATGAAATCTCTTTATAAAGCT
TGCAAAAGGCAATATTCGAAGCAGCTTTTTTTGGGGGATGAAAAGCAGCAAAAAGGGAAAGGCATTTCCATATTAGTTGCCAAAGCATCTTATCTCATTTCTTTCACCGA
GATTGAACAAGAAGAAAGTAGCTGCCTCCAACTGCTGGCTAAGAATCCCTTTTCTTCTTTGCTTTCTGGCAGAGACATGCCTTGCACAGACCGTCTCCATAGTGGCCAAC
ATGTGCTATACACCATTTCTCTTATCTTCTGGAATACATGTTATTGAAGGCTCCTTCAGACCCTAAACACGACAATGCACTAAATATATTAGTGCATCATTCCATGGCTT
CAATCTCAACAATATTTAATCTTTGATGGAGAATCAGAAATTGGGAACATAAGTGTTGGCACCGCGATTTGCTTGGTTTGGGGTTCTATAAATAACCCGGGTTCAGGGTC
TGAGGCTAAACAAGAGAAGAGTTCTCTCTCTTTGCATCTATAATTCTTGCTCCAATGTTTGTATAAATGAACAGGCTGTTTAGAAGTTGTATGGAATGAATCGTAAAAGC
TATTGAAATTACTATCTACAAATATGTGATTTTAGAAAGCATAGCAACTTTATTTTCTTAAAAAAAAATTCATGAAAACACATAAAAGAACAAGAAAAACTCTCTTGGGA
AAAGTTATCTACTTCTACAGATTTTTTATTTACAAAAAAATGTTTTCAGAACAGTTTCCAAACAACCCTATGCATAGCTAGAAGGAAGGATTTGATCACAAGAAGACTGC
AGAGAATCATATATGTTTCTAGTCAACATGTCAGCATGGTACAACACAACAAATACTTGATCATCCATAAGAAATATATATGACCTTTGTATTTTATACAGACTCAGCTC
TGAACTGCCTTGTTGAAGTTTTCCAATTCCTTGGATGGGAAACCATAGACTGCTTTTATAGTGTCCCTACAGCTACTGCAGGAAAGATGAACAATCAGCCCAACCACAAG
AATTTGATATTCCTAAAAAACATCAAGGTGGATATGTTACTGGAATTCCTCCCATCCTATTCATCTGCCAAACTCTCTCTGGGTTTTCTCTCGTTCACAAGCTTTAACTA
GTTGGTTCATTTCTATACTCTATGTTTGTTCTCAGCCCTAGCCCAGTTCTTCTGTTTCTTATTTTCAGTGTTCGGTAACATTTCTGTTTATGGTTTATCATTTTCTCCTA
TCATTCGTTCTAATTGTCCAATTT
Protein sequenceShow/hide protein sequence
MPIRCTLNFRVSSIFLIVSAFLNCVAGGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGFYEEDSIKSDD
VFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQCTAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQDQARFLKLFEDGDDIEDELG
LGDVI