; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g09680 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g09680
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:8138659..8142792
RNA-Seq ExpressionMoc09g09680
SyntenyMoc09g09680
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152029.1 uncharacterized protein LOC111019838 [Momordica charantia]3.1e-12479.32Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG
        MEKLLKRPEKLRGDPEKRNKDKY RFHRDHGHNT+SCWELK QIEDLIQDGYFKKFVGKPRSNSVEK EERKRSRTP RRED+PAVINTI GGP+GGQS 
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG

Query:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTR-------------------
        NKR ELAR AR +VCIIREQRPT  ITF   DLE VHLPHNDALVIAPLIDHV+VRRVL+DGGASANILSL  YL LGWTR                   
Subjt:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTR-------------------

Query:  ---CIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTN
           CIDLPVT G D TQVT+MAEFVVIDG+SAYNAIFGRPIIHSFR VPSTLHQVLKYSTP GVG VRGEQ  SRECY+SALKGSSVC LE+Q +
Subjt:  ---CIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTN

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.3e-12276.69Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG
        MEKLLKRPEKLRG PE+R+KDKY RFHR+HGHNT+  WELK QIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR D+PAVINTI GGP+GGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG

Query:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTR-------------------
        +KRK+LAR AR EVCIIREQRPTC ITF   DL  VHLPHNDALVIAPLIDHV+VRRVL+DGGASANILSLP YLALGWTR                   
Subjt:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTR-------------------

Query:  ---CIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTNR
           CIDLPVT+G D T+VTQMAEFVV+DG+SAYNAIFGRPIIHSFR +PSTLHQVLKYSTP+GVGTVRGEQ  SRECY+S LKG+SVC LE  T+R
Subjt:  ---CIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTNR

XP_022154846.1 uncharacterized protein LOC111022006 [Momordica charantia]9.3e-12177.15Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG
        MEKLLKRPEKLRGDPEK NKD              +CWELK QIE+LIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRR+D+PAVINTI GGP+GGQ G
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG

Query:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTR-------------------
        NKR +LAR  R EVCIIREQ+PTC ITFGD DLEGVHLPHNDALVIAPLIDH+LVRRVLIDGGASANI SLP YLALGWTR                   
Subjt:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTR-------------------

Query:  ---CIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTNRGKLQ
           CIDL VTIG DATQVTQMAEFVVID KSAYNAIFGRPIIHSF  V STLHQVLKYST +GVGTVRGEQKTSR+CY+S LKG +VCTLEEQTNRGKLQ
Subjt:  ---CIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTNRGKLQ

Query:  GS
        GS
Subjt:  GS

XP_022155866.1 uncharacterized protein LOC111022880 [Momordica charantia]8.7e-11975.26Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG
        MEKLLKRPEKLRG PE+R+KDKY RFHR+HGHNT+ CWELK QIEDLIQDGYFKKFVGKP ++S EKKEERKRSRTPPRR D+PAVINTI GGP+GGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG

Query:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTR-------------------
        +KRKELAR AR EVCIIREQ PTC ITF   DLE VHLPHNDAL+IA LIDHV+VRRVL++GGASANILSLP YLALGWTR                   
Subjt:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTR-------------------

Query:  ---CIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLE
           CIDLPVT+G + T++TQMAEFVV+DG+S YNAIFGRPIIHSFR +PSTLHQVLKY TP+GVGTVRGEQ  SRECY++ALKG SVC LE
Subjt:  ---CIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLE

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]6.0e-12885.36Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG
        MEKLLKRPEKLRGD EKRNK+KY RFHRDHGHNTTSCWELK QIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRED+PAVINTI GGPNGGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG

Query:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTRCIDLPVTIGHDATQVTQMA
        NKRKELAR+AR EVCIIRE +PTCSITFGD DLEGVHLPHNDALVIA LIDH LVRRVLIDGG                  CIDLPVTIG DATQVTQMA
Subjt:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTRCIDLPVTIGHDATQVTQMA

Query:  EFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTNRGKLQGS
        EFVVIDG+SAYNAIFGRPIIHSFR VPSTLHQVLKYSTP+ VG VRGEQKTSRECY+SALKGS+VC LEEQTNRGKLQ S
Subjt:  EFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTNRGKLQGS

TrEMBL top hitse value%identityAlignment
A0A6J1DD03 uncharacterized protein LOC1110198996.3e-12376.69Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG
        MEKLLKRPEKLRG PE+R+KDKY RFHR+HGHNT+  WELK QIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR D+PAVINTI GGP+GGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG

Query:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTR-------------------
        +KRK+LAR AR EVCIIREQRPTC ITF   DL  VHLPHNDALVIAPLIDHV+VRRVL+DGGASANILSLP YLALGWTR                   
Subjt:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTR-------------------

Query:  ---CIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTNR
           CIDLPVT+G D T+VTQMAEFVV+DG+SAYNAIFGRPIIHSFR +PSTLHQVLKYSTP+GVGTVRGEQ  SRECY+S LKG+SVC LE  T+R
Subjt:  ---CIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTNR

A0A6J1DET8 uncharacterized protein LOC1110198381.5e-12479.32Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG
        MEKLLKRPEKLRGDPEKRNKDKY RFHRDHGHNT+SCWELK QIEDLIQDGYFKKFVGKPRSNSVEK EERKRSRTP RRED+PAVINTI GGP+GGQS 
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG

Query:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTR-------------------
        NKR ELAR AR +VCIIREQRPT  ITF   DLE VHLPHNDALVIAPLIDHV+VRRVL+DGGASANILSL  YL LGWTR                   
Subjt:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTR-------------------

Query:  ---CIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTN
           CIDLPVT G D TQVT+MAEFVVIDG+SAYNAIFGRPIIHSFR VPSTLHQVLKYSTP GVG VRGEQ  SRECY+SALKGSSVC LE+Q +
Subjt:  ---CIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTN

A0A6J1DPX9 uncharacterized protein LOC1110220064.5e-12177.15Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG
        MEKLLKRPEKLRGDPEK NKD              +CWELK QIE+LIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRR+D+PAVINTI GGP+GGQ G
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG

Query:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTR-------------------
        NKR +LAR  R EVCIIREQ+PTC ITFGD DLEGVHLPHNDALVIAPLIDH+LVRRVLIDGGASANI SLP YLALGWTR                   
Subjt:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTR-------------------

Query:  ---CIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTNRGKLQ
           CIDL VTIG DATQVTQMAEFVVID KSAYNAIFGRPIIHSF  V STLHQVLKYST +GVGTVRGEQKTSR+CY+S LKG +VCTLEEQTNRGKLQ
Subjt:  ---CIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTNRGKLQ

Query:  GS
        GS
Subjt:  GS

A0A6J1DT04 uncharacterized protein LOC1110228804.2e-11975.26Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG
        MEKLLKRPEKLRG PE+R+KDKY RFHR+HGHNT+ CWELK QIEDLIQDGYFKKFVGKP ++S EKKEERKRSRTPPRR D+PAVINTI GGP+GGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG

Query:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTR-------------------
        +KRKELAR AR EVCIIREQ PTC ITF   DLE VHLPHNDAL+IA LIDHV+VRRVL++GGASANILSLP YLALGWTR                   
Subjt:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTR-------------------

Query:  ---CIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLE
           CIDLPVT+G + T++TQMAEFVV+DG+S YNAIFGRPIIHSFR +PSTLHQVLKY TP+GVGTVRGEQ  SRECY++ALKG SVC LE
Subjt:  ---CIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLE

A0A6J1DZB9 uncharacterized protein LOC1110249042.9e-12885.36Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG
        MEKLLKRPEKLRGD EKRNK+KY RFHRDHGHNTTSCWELK QIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRED+PAVINTI GGPNGGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSG

Query:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTRCIDLPVTIGHDATQVTQMA
        NKRKELAR+AR EVCIIRE +PTCSITFGD DLEGVHLPHNDALVIA LIDH LVRRVLIDGG                  CIDLPVTIG DATQVTQMA
Subjt:  NKRKELARKARHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTRCIDLPVTIGHDATQVTQMA

Query:  EFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTNRGKLQGS
        EFVVIDG+SAYNAIFGRPIIHSFR VPSTLHQVLKYSTP+ VG VRGEQKTSRECY+SALKGS+VC LEEQTNRGKLQ S
Subjt:  EFVVIDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTNRGKLQGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGCTCCTCAAGAGACCTGAGAAGCTCCGAGGGGACCCAGAAAAGCGCAACAAAGATAAATACTACCGTTTTCATCGTGATCACGGCCATAACACAACAAGCTG
CTGGGAGCTGAAGCACCAGATTGAAGACCTCATTCAAGATGGCTACTTCAAAAAGTTCGTAGGAAAACCGAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAGCGTT
CAAGAACGCCGCCTCGAAGAGAGGACCAACCTGCGGTCATCAACACTATTTCCGGGGGCCCAAATGGGGGCCAGTCCGGAAATAAAAGGAAGGAGCTAGCTCGCAAGGCC
AGGCACGAGGTATGTATCATCAGGGAGCAGAGACCTACTTGCTCCATCACCTTTGGCGATACCGATCTGGAGGGGGTACACTTGCCCCATAACGATGCACTGGTGATCGC
CCCTCTGATCGATCATGTCCTGGTCAGGAGAGTGTTGATAGATGGAGGCGCGTCTGCCAACATCTTGTCCCTCCCAATATATCTTGCCTTGGGTTGGACCAGGTGTATCG
ATCTGCCAGTCACGATTGGACATGATGCTACCCAAGTAACGCAGATGGCTGAGTTCGTGGTGATCGACGGCAAGTCGGCCTACAACGCCATCTTCGGGAGACCCATCATC
CACTCATTCCGGGTCGTCCCCTCCACACTGCATCAGGTCCTGAAGTACTCAACCCCTGATGGAGTGGGCACGGTCCGAGGTGAGCAAAAGACCTCACGAGAATGCTATTC
ATCCGCGCTTAAAGGGTCGTCTGTATGCACCCTGGAAGAGCAGACCAATCGTGGCAAGCTGCAAGGATCGTACCTCGCTCATTTCGGGACTTACGAGGTAAGTCAAGTTC
CAAGGTCTGAGAACTCTAATGCGGACGCCTTGGCCAAATTGGCATCAGCATATGAGACCGACCTGACTAGATCAGTCCCGGTCGAGATCTTGGACACTCCTTCAATCTTG
GAGCCAGATGTAATGGAGGTTGATACTCCATCACCCACTTGGATGGACCCGATCGTGGAGTTCATCAAAGGAAACCCACCGCAAGATCCGAAGGAGCAAAAGAAGATGGG
AAAAATGCTTGCGCATCCATGGAACGCGGAGCAATTGAAGCGCTATTACCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAGCTCCTCAAGAGACCTGAGAAGCTCCGAGGGGACCCAGAAAAGCGCAACAAAGATAAATACTACCGTTTTCATCGTGATCACGGCCATAACACAACAAGCTG
CTGGGAGCTGAAGCACCAGATTGAAGACCTCATTCAAGATGGCTACTTCAAAAAGTTCGTAGGAAAACCGAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAGCGTT
CAAGAACGCCGCCTCGAAGAGAGGACCAACCTGCGGTCATCAACACTATTTCCGGGGGCCCAAATGGGGGCCAGTCCGGAAATAAAAGGAAGGAGCTAGCTCGCAAGGCC
AGGCACGAGGTATGTATCATCAGGGAGCAGAGACCTACTTGCTCCATCACCTTTGGCGATACCGATCTGGAGGGGGTACACTTGCCCCATAACGATGCACTGGTGATCGC
CCCTCTGATCGATCATGTCCTGGTCAGGAGAGTGTTGATAGATGGAGGCGCGTCTGCCAACATCTTGTCCCTCCCAATATATCTTGCCTTGGGTTGGACCAGGTGTATCG
ATCTGCCAGTCACGATTGGACATGATGCTACCCAAGTAACGCAGATGGCTGAGTTCGTGGTGATCGACGGCAAGTCGGCCTACAACGCCATCTTCGGGAGACCCATCATC
CACTCATTCCGGGTCGTCCCCTCCACACTGCATCAGGTCCTGAAGTACTCAACCCCTGATGGAGTGGGCACGGTCCGAGGTGAGCAAAAGACCTCACGAGAATGCTATTC
ATCCGCGCTTAAAGGGTCGTCTGTATGCACCCTGGAAGAGCAGACCAATCGTGGCAAGCTGCAAGGATCGTACCTCGCTCATTTCGGGACTTACGAGGTAAGTCAAGTTC
CAAGGTCTGAGAACTCTAATGCGGACGCCTTGGCCAAATTGGCATCAGCATATGAGACCGACCTGACTAGATCAGTCCCGGTCGAGATCTTGGACACTCCTTCAATCTTG
GAGCCAGATGTAATGGAGGTTGATACTCCATCACCCACTTGGATGGACCCGATCGTGGAGTTCATCAAAGGAAACCCACCGCAAGATCCGAAGGAGCAAAAGAAGATGGG
AAAAATGCTTGCGCATCCATGGAACGCGGAGCAATTGAAGCGCTATTACCCCTGA
Protein sequenceShow/hide protein sequence
MEKLLKRPEKLRGDPEKRNKDKYYRFHRDHGHNTTSCWELKHQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDQPAVINTISGGPNGGQSGNKRKELARKA
RHEVCIIREQRPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPIYLALGWTRCIDLPVTIGHDATQVTQMAEFVVIDGKSAYNAIFGRPII
HSFRVVPSTLHQVLKYSTPDGVGTVRGEQKTSRECYSSALKGSSVCTLEEQTNRGKLQGSYLAHFGTYEVSQVPRSENSNADALAKLASAYETDLTRSVPVEILDTPSIL
EPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMGKMLAHPWNAEQLKRYYP