; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS011016 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS011016
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTransmembrane protein
Genome locationscaffold35:3597889..3599644
RNA-Seq ExpressionMS011016
SyntenyMS011016
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026464.1 hypothetical protein SDJN02_10464, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-8668.29Show/hide
Query:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD
        M+NFLKSFKIQTKYGT+A A ASSVIISGIGL+LIY YTQRK+EK  +RVF RSMSIGALHGG++AMKR+LQY K+RA ++ Q   LEKLE  +    PD
Subjt:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD

Query:  FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEF
        F  +Q+++AK+EM GQEDKAIEILKKA KEAKE SL ++EYEYQ+LLVE LIYKG+I EA +A CLN +E SDVRR LYK II++LLNNP++A+EEWE+F
Subjt:  FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEF

Query:  KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEK
        +EMR +F LPPD+KDS FYKLV  FE FK+VVDLL +DI+++ K K
Subjt:  KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEK

XP_022146501.1 uncharacterized protein LOC111015701 [Momordica charantia]2.3e-12899.6Show/hide
Query:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD
        MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD
Subjt:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD

Query:  FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEF
        FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKE+WEEF
Subjt:  FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEF

Query:  KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEKN
        KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEKN
Subjt:  KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEKN

XP_022926668.1 uncharacterized protein LOC111433729 [Cucurbita moschata]2.5e-8769.11Show/hide
Query:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD
        M+NFLKSFKIQTKYGT+A A ASSVIISGIGL+LIY YTQRK+EK  +RVF RSMSIGALHGG++AMKR+LQY K+RA ++ Q   LEKLE M     PD
Subjt:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD

Query:  FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEF
        F  +Q ++AK+EM GQEDKAIEILKKA KEAKE SL ++EYEYQ+LLVE LIYKG+I EA +A CLN +E SDVRR LYK II++LLNNP+KA+EEWE+F
Subjt:  FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEF

Query:  KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEK
        +EMR +F LPPD+KDS FYKLV  FE FK+VVDLL +DI+++ K K
Subjt:  KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEK

XP_023003990.1 uncharacterized protein LOC111497439 [Cucurbita maxima]1.9e-8769.11Show/hide
Query:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD
        M+NFLKSFKIQTKYGT+A A ASSVIISGIGL+LIY YTQRK+EK  +RVF RSMSIGALHGG++AMKR+LQYQKMRA ++ Q   LEKLE       PD
Subjt:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD

Query:  FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEF
        F  +Q+++ K+EMRGQEDKAIEILKKA KEAKE SL ++EYEYQ+LLVE LIYKG+I EA +A CLN +E SDVRR LYK II++LLNNP+KA+EEWE+F
Subjt:  FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEF

Query:  KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEK
        +EMR  F LPPD++DS FYKLV  FE FK+VVDLL +DI+++ K K
Subjt:  KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEK

XP_023518384.1 uncharacterized protein LOC111781887 [Cucurbita pepo subsp. pepo]3.6e-8668.29Show/hide
Query:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD
        M+NFLKSFKIQTKYGT+A A ASSVIISGIGL+LIY YTQRK+EK  +RVF RSMSIGALHGG++AMKR+LQY KMRA ++ Q   LE LE M     PD
Subjt:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD

Query:  FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEF
        F  +Q+++AK+EM GQEDKAIEILKKA KEA E SL ++EYEYQ+LLVE LIYKG+I EA +A CLN +E SDVRR LYK II++LLNNP++A+EEWE+F
Subjt:  FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEF

Query:  KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEK
        +EMR +F LPPD+KDS FYKLV  FE FK+VVDLL +DI+++ K K
Subjt:  KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEK

TrEMBL top hitse value%identityAlignment
A0A0A0KLZ3 Uncharacterized protein1.0e-7057.94Show/hide
Query:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIK-----
        + NF + F +QTKYG  A A AS+ I+SG+GLVL+Y  T+  ++KN +RVF RS+SIGALHGGK+AMKRLLQ+QKMRA  +N+D+ ++KL+  IK     
Subjt:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIK-----

Query:  -DCAPDFAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAK
            P+F K+Q+IV KLEM GQEDKAIE LK AA+EAK+ SL  YE+EYQ+LLVE+ IYKG++ +AE   CL  + TSDVRR LYKAII+VL N  ++A 
Subjt:  -DCAPDFAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAK

Query:  EEWEEFKEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEK
        +EWEEF+EMRS FLLPPDVKDS FY L+ +F+ FK+VV +L +DI ++ + K
Subjt:  EEWEEFKEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEK

A0A2H5N6A7 Uncharacterized protein3.4e-4545.65Show/hide
Query:  AAAASSVIISGIGLVLIYVYTQRK-REKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPDFAKLQSIVAKLEMRGQE
        A  A+ V+I      + + YT R  R+  +  V  RSMS+G LHGGKLA++RL+ Y   RA E +      +L+ ++++  PDF KLQ  VAKLEM G+E
Subjt:  AAAASSVIISGIGLVLIYVYTQRK-REKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPDFAKLQSIVAKLEMRGQE

Query:  DKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEFKEMRSRFLLPPDVKDSQ
         +A+ IL+KA ++A+  +  H  YE Q+L  EMLIYKG+  +A    CL++E+ SD RR LYKAII V+L  PK+A   WE+F EM+S FL PPD +D+Q
Subjt:  DKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEFKEMRSRFLLPPDVKDSQ

Query:  FYKLVTEFEMFKQVVDLLGKDIEERNKEKN
         Y+++ +F+ F  VV+LL +DI+E +K KN
Subjt:  FYKLVTEFEMFKQVVDLLGKDIEERNKEKN

A0A6J1CZJ5 uncharacterized protein LOC1110157011.1e-12899.6Show/hide
Query:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD
        MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD
Subjt:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD

Query:  FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEF
        FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKE+WEEF
Subjt:  FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEF

Query:  KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEKN
        KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEKN
Subjt:  KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEKN

A0A6J1EIU2 uncharacterized protein LOC1114337291.2e-8769.11Show/hide
Query:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD
        M+NFLKSFKIQTKYGT+A A ASSVIISGIGL+LIY YTQRK+EK  +RVF RSMSIGALHGG++AMKR+LQY K+RA ++ Q   LEKLE M     PD
Subjt:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD

Query:  FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEF
        F  +Q ++AK+EM GQEDKAIEILKKA KEAKE SL ++EYEYQ+LLVE LIYKG+I EA +A CLN +E SDVRR LYK II++LLNNP+KA+EEWE+F
Subjt:  FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEF

Query:  KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEK
        +EMR +F LPPD+KDS FYKLV  FE FK+VVDLL +DI+++ K K
Subjt:  KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEK

A0A6J1KTB7 uncharacterized protein LOC1114974399.4e-8869.11Show/hide
Query:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD
        M+NFLKSFKIQTKYGT+A A ASSVIISGIGL+LIY YTQRK+EK  +RVF RSMSIGALHGG++AMKR+LQYQKMRA ++ Q   LEKLE       PD
Subjt:  MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPD

Query:  FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEF
        F  +Q+++ K+EMRGQEDKAIEILKKA KEAKE SL ++EYEYQ+LLVE LIYKG+I EA +A CLN +E SDVRR LYK II++LLNNP+KA+EEWE+F
Subjt:  FAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEF

Query:  KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEK
        +EMR  F LPPD++DS FYKLV  FE FK+VVDLL +DI+++ K K
Subjt:  KEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34530.1 unknown protein2.6e-2937.31Show/hide
Query:  DRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPDFAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQIL
        D R   +S+S+GA+ GGKLA++RLL     R    +      + E ++    PDF  LQ  + K+EM G+E K  E+LKKA ++A++    H  YE ++L
Subjt:  DRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPDFAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQIL

Query:  LVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNP-KKAKEEWEEFKEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKE
        LVEMLIY GN+ EA +  CL  E  +D RR LY+ II  L  +P K+ +E +  F+E++     P   ++ +  ++   F+ FK+V++ L  +IE+ NK 
Subjt:  LVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNP-KKAKEEWEEFKEMRSRFLLPPDVKDSQFYKLVTEFEMFKQVVDLLGKDIEERNKE

Query:  K
        K
Subjt:  K

AT2G34530.2 unknown protein1.4e-2242.22Show/hide
Query:  DRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPDFAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQIL
        D R   +S+S+GA+ GGKLA++RLL     R    +      + E ++    PDF  LQ  + K+EM G+E K  E+LKKA ++A++    H  YE ++L
Subjt:  DRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPDFAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQIL

Query:  LVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKA
        LVEMLIY GN+ EA +  CL  E  +D RR LY+A
Subjt:  LVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKA

AT2G34540.2 unknown protein4.7e-0731.03Show/hide
Query:  KLAMKRLLQYQKMRATEK----NQDQVLEKLEKMIKDCAPDFAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGE
        K A++ L +   M A+ K     +   L KL  +      D  K++++    E  G+ ++A+++L+ A    +        +  Q+ LVE+LI      E
Subjt:  KLAMKRLLQYQKMRATEK----NQDQVLEKLEKMIKDCAPDFAKLQSIVAKLEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGE

Query:  AERASCLNQE--ETSDVRRSLYKAIIQVLLNNPKKAKEEWEEFKE
        A   SCLN E  + SDVR  LYKAII  +L+   +AK+ W+EF++
Subjt:  AERASCLNQE--ETSDVRRSLYKAIIQVLLNNPKKAKEEWEEFKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAACTTCCTCAAAAGTTTTAAGATTCAAACCAAATATGGAACATCAGCAGCAGCCGCAGCATCGAGTGTCATCATCTCGGGCATCGGTCTCGTGTTGATTTATGT
CTATACTCAGAGGAAAAGGGAGAAAAATGATCGACGTGTTTTCATGAGATCAATGTCCATTGGAGCTCTGCATGGTGGCAAACTAGCCATGAAAAGATTGCTTCAATACC
AAAAGATGCGAGCAACCGAGAAAAATCAAGATCAAGTTCTGGAGAAGTTAGAGAAAATGATAAAAGATTGCGCTCCTGATTTCGCGAAGCTGCAGAGCATTGTGGCAAAG
CTGGAAATGAGAGGACAAGAAGACAAAGCTATTGAAATATTAAAAAAAGCAGCAAAGGAAGCCAAGGAGAATTCACTTTTGCACTATGAATATGAATATCAGATTCTTCT
TGTGGAAATGCTCATTTACAAGGGAAATATTGGGGAGGCTGAAAGGGCTTCATGCCTGAATCAAGAAGAAACTTCAGATGTTCGACGCTCGTTATATAAGGCTATAATTC
AAGTGCTGCTGAATAATCCCAAAAAGGCAAAAGAAGAATGGGAAGAGTTCAAGGAAATGAGGAGCCGATTCCTGTTGCCACCCGACGTTAAAGACTCTCAATTTTACAAG
CTCGTTACGGAATTCGAGATGTTTAAGCAAGTCGTCGACCTCCTCGGAAAAGACATTGAGGAGAGGAACAAAGAAAAAAAC
mRNA sequenceShow/hide mRNA sequence
ATGAGAAACTTCCTCAAAAGTTTTAAGATTCAAACCAAATATGGAACATCAGCAGCAGCCGCAGCATCGAGTGTCATCATCTCGGGCATCGGTCTCGTGTTGATTTATGT
CTATACTCAGAGGAAAAGGGAGAAAAATGATCGACGTGTTTTCATGAGATCAATGTCCATTGGAGCTCTGCATGGTGGCAAACTAGCCATGAAAAGATTGCTTCAATACC
AAAAGATGCGAGCAACCGAGAAAAATCAAGATCAAGTTCTGGAGAAGTTAGAGAAAATGATAAAAGATTGCGCTCCTGATTTCGCGAAGCTGCAGAGCATTGTGGCAAAG
CTGGAAATGAGAGGACAAGAAGACAAAGCTATTGAAATATTAAAAAAAGCAGCAAAGGAAGCCAAGGAGAATTCACTTTTGCACTATGAATATGAATATCAGATTCTTCT
TGTGGAAATGCTCATTTACAAGGGAAATATTGGGGAGGCTGAAAGGGCTTCATGCCTGAATCAAGAAGAAACTTCAGATGTTCGACGCTCGTTATATAAGGCTATAATTC
AAGTGCTGCTGAATAATCCCAAAAAGGCAAAAGAAGAATGGGAAGAGTTCAAGGAAATGAGGAGCCGATTCCTGTTGCCACCCGACGTTAAAGACTCTCAATTTTACAAG
CTCGTTACGGAATTCGAGATGTTTAAGCAAGTCGTCGACCTCCTCGGAAAAGACATTGAGGAGAGGAACAAAGAAAAAAAC
Protein sequenceShow/hide protein sequence
MRNFLKSFKIQTKYGTSAAAAASSVIISGIGLVLIYVYTQRKREKNDRRVFMRSMSIGALHGGKLAMKRLLQYQKMRATEKNQDQVLEKLEKMIKDCAPDFAKLQSIVAK
LEMRGQEDKAIEILKKAAKEAKENSLLHYEYEYQILLVEMLIYKGNIGEAERASCLNQEETSDVRRSLYKAIIQVLLNNPKKAKEEWEEFKEMRSRFLLPPDVKDSQFYK
LVTEFEMFKQVVDLLGKDIEERNKEKN