; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g15880 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g15880
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:10570866..10571967
RNA-Seq ExpressionMoc03g15880
SyntenyMoc03g15880
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]3.1e-12875.72Show/hide
Query:  PGEAQQDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREV
        P    +DKYCRFHR+H HNT+  WELKR+IEDLIQD YFKKF+GKPR++S +KKEERK SRTP RR DR AVINTIFGGP GGQSG+KRKELAR ARREV
Subjt:  PGEAQQDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREV

Query:  CIIREQKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQD
        CIIREQ+PTC ITF  ADLE +HLPHNDA VIAPLIDHV+VRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFSRESV PEGC+DL VT+G D
Subjt:  CIIREQKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQD

Query:  ATQVTQMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQF
         TQVTQMAEFVVIDGRS YNAIF RPIIHSFR +PSTLHQVL+YSTPNG+GM+RGEQ  S+EC+ASALKGS+VCALE   +R      + +LP   +R+F
Subjt:  ATQVTQMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQF

Query:  SLPKEELELVPLL
        + P EELELVPLL
Subjt:  SLPKEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]5.2e-13676.4Show/hide
Query:  PGEAQQDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREV
        P    +DKYCRFHR+HGHNT+  WELK +IEDLIQDGYFKKF+GKPR++S +KKEERKRSRTPPRR DR AVINTIFGGP GGQSG+KRK+LAR ARREV
Subjt:  PGEAQQDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREV

Query:  CIIREQKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQD
        CIIREQ+PTC ITF  ADL  +HLPHNDA VIAPLIDHV+VRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFS ESV PEGC+DL VT+GQD
Subjt:  CIIREQKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQD

Query:  ATQVTQMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQF
         T+VTQMAEFVV+DGRS YNAIF RPIIHSFR +PSTLHQVL+YSTPNG+G +RGEQ  S+EC+AS LKG++VCALE   +R      E DLP    R+F
Subjt:  ATQVTQMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQF

Query:  SLPKEELELVPLLSPEKQVSIG
        + P EELELVPLLS EKQV +G
Subjt:  SLPKEELELVPLLSPEKQVSIG

XP_022154846.1 uncharacterized protein LOC111022006 [Momordica charantia]1.8e-14483.07Show/hide
Query:  NTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREVCIIREQKPTCSITFGDAD
        N  +CWELKR+IE+LIQDGYFKKF+GKPRSNS++KKEERKRSRTPPRRDDR AVINTIFGGP GGQ GNKR +LAR  RREVCIIREQKPTC ITFGDAD
Subjt:  NTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREVCIIREQKPTCSITFGDAD

Query:  LEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQDATQVTQMAEFVVIDGRST
        LEG+HLPHNDA VIAPLIDH+LVRRVL+DGGASANI SLPTYLALGWTRSQLKKSPTPLVGFS ESVSPEGC+DL VTIGQDATQVTQMAEFVVID +S 
Subjt:  LEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQDATQVTQMAEFVVIDGRST

Query:  YNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQFSLPKEELELVPLLSPEKQ
        YNAIF RPIIHSF  V STLHQVL+YST NG+G +RGEQKTS++C+AS LKG AVC LEEQ NR KLQGSE DLPK+ KRQFS P EELELVPLLSPEK 
Subjt:  YNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQFSLPKEELELVPLLSPEKQ

Query:  VSIGTKLGATDRE
        V+IGTKL ATDR+
Subjt:  VSIGTKLGATDRE

XP_022155866.1 uncharacterized protein LOC111022880 [Momordica charantia]2.3e-13174.38Show/hide
Query:  PGEAQQDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREV
        P    +DKYCRFHR+HGHNT+ CWELKR+IEDLIQDGYFKKF+GKP ++S +KKEERKRSRTPPRR DR AVINTIFGGP GGQSG+KRKELAR ARREV
Subjt:  PGEAQQDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREV

Query:  CIIREQKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQD
        CIIREQ PTC ITF  ADLE +HLPHNDA +IA LIDHV+VRRVLV+GGASANILSLPTYLALGWTRSQL++SPTPLVGFS ESV PEGC+DL VT+GQ+
Subjt:  CIIREQKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQD

Query:  ATQVTQMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQF
         T++TQMAEFVV+DGRS YNAIF RPIIHSFR +PSTLHQVL+Y TPNG+G +RGEQ  S+EC+A+ALKG +VCALE    R      E +LP   +++F
Subjt:  ATQVTQMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQF

Query:  SLPKEELELVPLLSPEKQVS
        + P EELELVPLLSPEKQ++
Subjt:  SLPKEELELVPLLSPEKQVS

XP_022157474.1 uncharacterized protein LOC111024166 [Momordica charantia]8.1e-12979.93Show/hide
Query:  QDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREVCIIRE
        +DKYCRFHRDHGHNT+SCWELKR+IEDLIQD YFKKF+GKPRSN  +KKEERKRSRTPPR DDR  VINTIFGGP GGQSGNKRKELAREARREVCIIRE
Subjt:  QDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREVCIIRE

Query:  QKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQDATQVT
        QKPTCSITFGDADLEG+HLPHNDA VIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGC+DL +TIGQD+TQVT
Subjt:  QKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQDATQVT

Query:  QMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGS-----------AVCALEEQANRRKLQGS
        QMAEFVVIDGRS YNAIF RPIIHSFR VPSTLHQVL+YSTPNG+GM+R  +K         LKG            A C   +Q  +RK++G+
Subjt:  QMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGS-----------AVCALEEQANRRKLQGS

TrEMBL top hitse value%identityAlignment
A0A6J1D9E1 uncharacterized protein LOC1110188231.5e-12875.72Show/hide
Query:  PGEAQQDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREV
        P    +DKYCRFHR+H HNT+  WELKR+IEDLIQD YFKKF+GKPR++S +KKEERK SRTP RR DR AVINTIFGGP GGQSG+KRKELAR ARREV
Subjt:  PGEAQQDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREV

Query:  CIIREQKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQD
        CIIREQ+PTC ITF  ADLE +HLPHNDA VIAPLIDHV+VRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFSRESV PEGC+DL VT+G D
Subjt:  CIIREQKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQD

Query:  ATQVTQMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQF
         TQVTQMAEFVVIDGRS YNAIF RPIIHSFR +PSTLHQVL+YSTPNG+GM+RGEQ  S+EC+ASALKGS+VCALE   +R      + +LP   +R+F
Subjt:  ATQVTQMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQF

Query:  SLPKEELELVPLL
        + P EELELVPLL
Subjt:  SLPKEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198992.5e-13676.4Show/hide
Query:  PGEAQQDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREV
        P    +DKYCRFHR+HGHNT+  WELK +IEDLIQDGYFKKF+GKPR++S +KKEERKRSRTPPRR DR AVINTIFGGP GGQSG+KRK+LAR ARREV
Subjt:  PGEAQQDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREV

Query:  CIIREQKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQD
        CIIREQ+PTC ITF  ADL  +HLPHNDA VIAPLIDHV+VRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFS ESV PEGC+DL VT+GQD
Subjt:  CIIREQKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQD

Query:  ATQVTQMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQF
         T+VTQMAEFVV+DGRS YNAIF RPIIHSFR +PSTLHQVL+YSTPNG+G +RGEQ  S+EC+AS LKG++VCALE   +R      E DLP    R+F
Subjt:  ATQVTQMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQF

Query:  SLPKEELELVPLLSPEKQVSIG
        + P EELELVPLLS EKQV +G
Subjt:  SLPKEELELVPLLSPEKQVSIG

A0A6J1DPX9 uncharacterized protein LOC1110220068.7e-14583.07Show/hide
Query:  NTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREVCIIREQKPTCSITFGDAD
        N  +CWELKR+IE+LIQDGYFKKF+GKPRSNS++KKEERKRSRTPPRRDDR AVINTIFGGP GGQ GNKR +LAR  RREVCIIREQKPTC ITFGDAD
Subjt:  NTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREVCIIREQKPTCSITFGDAD

Query:  LEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQDATQVTQMAEFVVIDGRST
        LEG+HLPHNDA VIAPLIDH+LVRRVL+DGGASANI SLPTYLALGWTRSQLKKSPTPLVGFS ESVSPEGC+DL VTIGQDATQVTQMAEFVVID +S 
Subjt:  LEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQDATQVTQMAEFVVIDGRST

Query:  YNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQFSLPKEELELVPLLSPEKQ
        YNAIF RPIIHSF  V STLHQVL+YST NG+G +RGEQKTS++C+AS LKG AVC LEEQ NR KLQGSE DLPK+ KRQFS P EELELVPLLSPEK 
Subjt:  YNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQFSLPKEELELVPLLSPEKQ

Query:  VSIGTKLGATDRE
        V+IGTKL ATDR+
Subjt:  VSIGTKLGATDRE

A0A6J1DT04 uncharacterized protein LOC1110228801.1e-13174.38Show/hide
Query:  PGEAQQDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREV
        P    +DKYCRFHR+HGHNT+ CWELKR+IEDLIQDGYFKKF+GKP ++S +KKEERKRSRTPPRR DR AVINTIFGGP GGQSG+KRKELAR ARREV
Subjt:  PGEAQQDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREV

Query:  CIIREQKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQD
        CIIREQ PTC ITF  ADLE +HLPHNDA +IA LIDHV+VRRVLV+GGASANILSLPTYLALGWTRSQL++SPTPLVGFS ESV PEGC+DL VT+GQ+
Subjt:  CIIREQKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQD

Query:  ATQVTQMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQF
         T++TQMAEFVV+DGRS YNAIF RPIIHSFR +PSTLHQVL+Y TPNG+G +RGEQ  S+EC+A+ALKG +VCALE    R      E +LP   +++F
Subjt:  ATQVTQMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQF

Query:  SLPKEELELVPLLSPEKQVS
        + P EELELVPLLSPEKQ++
Subjt:  SLPKEELELVPLLSPEKQVS

A0A6J1DWK7 uncharacterized protein LOC1110241663.9e-12979.93Show/hide
Query:  QDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREVCIIRE
        +DKYCRFHRDHGHNT+SCWELKR+IEDLIQD YFKKF+GKPRSN  +KKEERKRSRTPPR DDR  VINTIFGGP GGQSGNKRKELAREARREVCIIRE
Subjt:  QDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARREVCIIRE

Query:  QKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQDATQVT
        QKPTCSITFGDADLEG+HLPHNDA VIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGC+DL +TIGQD+TQVT
Subjt:  QKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQDATQVT

Query:  QMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGS-----------AVCALEEQANRRKLQGS
        QMAEFVVIDGRS YNAIF RPIIHSFR VPSTLHQVL+YSTPNG+GM+R  +K         LKG            A C   +Q  +RK++G+
Subjt:  QMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGS-----------AVCALEEQANRRKLQGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGCTCCAAGGAGACCCGGAGAAGCGCAACAAGATAAGTACTGCCGTTTTCATCGCGATCACGGCCATAATACGACAAGTTGCTGGGAATTGAAGCGCCGG
ATTGAAGACCTCATTCAAGATGGCTACTTTAAAAAGTTCATGGGCAAACCAAGGTCTAACTCGATCAAAAAAAAGGAAGAGAGGAAGCGTTCAAGAACGCCGCCT
CGTCGGGATGATCGACATGCAGTCATCAACACTATTTTCGGAGGTCCGAGAGGGGGCCAGTCCGGAAACAAGAGGAAGGAGCTAGCTCGCGAGGCCAGGCGCGAG
GTATGCATCATCAGGGAGCAGAAGCCCACTTGCTCCATCACTTTCGGCGATGCCGATCTGGAGGGGATCCATTTGCCCCATAATGACGCGTTTGTGATCGCCCCT
CTCATTGATCACGTCCTGGTCCGAAGAGTATTGGTTGATGGAGGTGCATCTGCCAACATCTTGTCTCTCCCAACATATCTAGCATTGGGATGGACCAGGTCACAA
TTGAAGAAGAGTCCAACACCCTTGGTTGGATTCTCTAGAGAATCGGTCTCCCCAGAAGGGTGCCTTGACCTGCTGGTAACGATCGGGCAAGATGCTACCCAAGTA
ACGCAGATGGCCGAGTTCGTGGTAATCGACGGCAGATCGACCTACAACGCCATTTTTTGGAGACCCATTATCCACTCATTTCGGGTTGTCCCCTCAACATTGCAT
CAAGTCTTGAGGTACTCAACCCCGAATGGACTGGGCATGATCCGAGGTGAGCAAAAAACCTCGCAGGAGTGTTTTGCATCCGCGCTTAAAGGGTCGGCGGTATGC
GCCCTGGAAGAGCAAGCCAATCGTCGCAAGCTGCAAGGGTCCGAGACAGACCTGCCCAAGGAAGGCAAAAGGCAGTTCTCCCTGCCAAAAGAAGAGCTCGAGCTT
GTTCCTTTACTTAGCCCTGAAAAACAAGTAAGCATAGGAACCAAGTTGGGGGCCACTGACAGGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAACGCTCCAAGGAGACCCGGAGAAGCGCAACAAGATAAGTACTGCCGTTTTCATCGCGATCACGGCCATAATACGACAAGTTGCTGGGAATTGAAGCGCCGG
ATTGAAGACCTCATTCAAGATGGCTACTTTAAAAAGTTCATGGGCAAACCAAGGTCTAACTCGATCAAAAAAAAGGAAGAGAGGAAGCGTTCAAGAACGCCGCCT
CGTCGGGATGATCGACATGCAGTCATCAACACTATTTTCGGAGGTCCGAGAGGGGGCCAGTCCGGAAACAAGAGGAAGGAGCTAGCTCGCGAGGCCAGGCGCGAG
GTATGCATCATCAGGGAGCAGAAGCCCACTTGCTCCATCACTTTCGGCGATGCCGATCTGGAGGGGATCCATTTGCCCCATAATGACGCGTTTGTGATCGCCCCT
CTCATTGATCACGTCCTGGTCCGAAGAGTATTGGTTGATGGAGGTGCATCTGCCAACATCTTGTCTCTCCCAACATATCTAGCATTGGGATGGACCAGGTCACAA
TTGAAGAAGAGTCCAACACCCTTGGTTGGATTCTCTAGAGAATCGGTCTCCCCAGAAGGGTGCCTTGACCTGCTGGTAACGATCGGGCAAGATGCTACCCAAGTA
ACGCAGATGGCCGAGTTCGTGGTAATCGACGGCAGATCGACCTACAACGCCATTTTTTGGAGACCCATTATCCACTCATTTCGGGTTGTCCCCTCAACATTGCAT
CAAGTCTTGAGGTACTCAACCCCGAATGGACTGGGCATGATCCGAGGTGAGCAAAAAACCTCGCAGGAGTGTTTTGCATCCGCGCTTAAAGGGTCGGCGGTATGC
GCCCTGGAAGAGCAAGCCAATCGTCGCAAGCTGCAAGGGTCCGAGACAGACCTGCCCAAGGAAGGCAAAAGGCAGTTCTCCCTGCCAAAAGAAGAGCTCGAGCTT
GTTCCTTTACTTAGCCCTGAAAAACAAGTAAGCATAGGAACCAAGTTGGGGGCCACTGACAGGGAATAA
Protein sequenceShow/hide protein sequence
MNAPRRPGEAQQDKYCRFHRDHGHNTTSCWELKRRIEDLIQDGYFKKFMGKPRSNSIKKKEERKRSRTPPRRDDRHAVINTIFGGPRGGQSGNKRKELAREARRE
VCIIREQKPTCSITFGDADLEGIHLPHNDAFVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVSPEGCLDLLVTIGQDATQV
TQMAEFVVIDGRSTYNAIFWRPIIHSFRVVPSTLHQVLRYSTPNGLGMIRGEQKTSQECFASALKGSAVCALEEQANRRKLQGSETDLPKEGKRQFSLPKEELEL
VPLLSPEKQVSIGTKLGATDRE