; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g20660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g20660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:15005031..15007861
RNA-Seq ExpressionMoc04g20660
SyntenyMoc04g20660
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.0e-11471.25Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG
        MEKLLKRPEKLRG PE+RNKD YCRF R H HNTS+ WELKRQIEDLIQD  FKKFVGKPR++S EKKEERK SR   RR DRPAVI TIF GPSGGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS
        +KRKELAR ARREVCIIREQ+PTC I+F +ADLE VHLPHNDALVIAPLIDHV+VRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESV 
Subjt:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS

Query:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWDELP
         EGCIDLPVT+G D +QV+Q  EFVVIDGRSAYNAIF                               GEQ  SRECYASALKGSSVCALE   S D   
Subjt:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWDELP

Query:  KPEADLPKSGKREFSAPIEELELVPLL
        + +A+LP   +REF+AP EELELVPLL
Subjt:  KPEADLPKSGKREFSAPIEELELVPLL

XP_022152029.1 uncharacterized protein LOC111019838 [Momordica charantia]4.1e-11575.76Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG
        MEKLLKRPEKLRGDPEKRNKD YCRF R HGHNTS+CWELKRQIEDLIQDG FKKFVGKPRSNSVEK EERKRSR   RR+DRPAVI TIF GPSGGQS 
Subjt:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS
        NKR ELAR ARR+VCIIREQ+PT  I+F +ADLE VHLPHNDALVIAPLIDHV+VRRVLVDGGASANILSL TYL LGWTRSQLKKS TPLVGFSGES++
Subjt:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS

Query:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWD
         EGCIDLPVT GQD +QV++  EFVVIDGRSAYNAIF                               GEQ  SRECYASALKGSSVCALE+QAS D
Subjt:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWD

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.1e-12372.32Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG
        MEKLLKRPEKLRG PE+R+KD YCRF R HGHNTS+ WELK QIEDLIQDG FKKFVGKPR++S EKKEERKRSR  PRR DRPAVI TIF GPSGGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS
        +KRK+LAR ARREVCIIREQ+PTC I+F  ADL  VHLPHNDALVIAPLIDHV+VRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESV 
Subjt:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS

Query:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWDELP
         EGCIDLPVT+GQD ++V+Q  EFVV+DGRSAYNAIF                               GEQ  SRECYAS LKG+SVCALE   S D   
Subjt:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWDELP

Query:  KPEADLPKSGKREFSAPIEELELVPLLSPERQVSIG
        + EADLP    REF+AP EELELVPLLS E+QV +G
Subjt:  KPEADLPKSGKREFSAPIEELELVPLLSPERQVSIG

XP_022154846.1 uncharacterized protein LOC111022006 [Momordica charantia]2.0e-13072.32Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG
        MEKLLKRPEKLRGDPEK NKD              NCWELKRQIE+LIQDG FKKFVGKPRSNSVEKKEERKRSR  PRRDDRPAVI TIF GPSGGQ G
Subjt:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS
        NKR +LAR  RREVCIIREQKPTC I+FG+ADLEGVHLPHNDALVIAPLIDH+LVRRVL+DGGASANI SLPTYLALGWTRSQLKKSPTPLVGFSGESVS
Subjt:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS

Query:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWDELP
         EGCIDL VTIGQD +QV+Q  EFVVID +SAYNAIF                               GEQ+TSR+CYAS LKG +VC LEEQ +  +L 
Subjt:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWDELP

Query:  KPEADLPKSGKREFSAPIEELELVPLLSPERQVSIGTKLGATAREELINFLRSH
          EADLPK  KR+FS P EELELVPLLSPE+ V+IGTKL AT R+ELINFLRS+
Subjt:  KPEADLPKSGKREFSAPIEELELVPLLSPERQVSIGTKLGATAREELINFLRSH

XP_022155866.1 uncharacterized protein LOC111022880 [Momordica charantia]4.7e-11970.06Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG
        MEKLLKRPEKLRG PE+R+KD YCRF R HGHNTS+CWELKRQIEDLIQDG FKKFVGKP ++S EKKEERKRSR  PRR DRPAVI TIF GPSGGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS
        +KRKELAR ARREVCIIREQ PTC I+F  ADLE VHLPHNDAL+IA LIDHV+VRRVLV+GGASANILSLPTYLALGWTRSQL++SPTPLVGFSGESV 
Subjt:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS

Query:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWDELP
         EGCIDLPVT+GQ+ ++++Q  EFVV+DGRS YNAIF                               GEQ  SRECYA+ALKG SVCALE     D   
Subjt:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWDELP

Query:  KPEADLPKSGKREFSAPIEELELVPLLSPERQVS
        + EA+LP   ++EF+AP EELELVPLLSPE+Q++
Subjt:  KPEADLPKSGKREFSAPIEELELVPLLSPERQVS

TrEMBL top hitse value%identityAlignment
A0A6J1D9E1 uncharacterized protein LOC1110188239.9e-11571.25Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG
        MEKLLKRPEKLRG PE+RNKD YCRF R H HNTS+ WELKRQIEDLIQD  FKKFVGKPR++S EKKEERK SR   RR DRPAVI TIF GPSGGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS
        +KRKELAR ARREVCIIREQ+PTC I+F +ADLE VHLPHNDALVIAPLIDHV+VRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESV 
Subjt:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS

Query:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWDELP
         EGCIDLPVT+G D +QV+Q  EFVVIDGRSAYNAIF                               GEQ  SRECYASALKGSSVCALE   S D   
Subjt:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWDELP

Query:  KPEADLPKSGKREFSAPIEELELVPLL
        + +A+LP   +REF+AP EELELVPLL
Subjt:  KPEADLPKSGKREFSAPIEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198995.2e-12472.32Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG
        MEKLLKRPEKLRG PE+R+KD YCRF R HGHNTS+ WELK QIEDLIQDG FKKFVGKPR++S EKKEERKRSR  PRR DRPAVI TIF GPSGGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS
        +KRK+LAR ARREVCIIREQ+PTC I+F  ADL  VHLPHNDALVIAPLIDHV+VRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESV 
Subjt:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS

Query:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWDELP
         EGCIDLPVT+GQD ++V+Q  EFVV+DGRSAYNAIF                               GEQ  SRECYAS LKG+SVCALE   S D   
Subjt:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWDELP

Query:  KPEADLPKSGKREFSAPIEELELVPLLSPERQVSIG
        + EADLP    REF+AP EELELVPLLS E+QV +G
Subjt:  KPEADLPKSGKREFSAPIEELELVPLLSPERQVSIG

A0A6J1DET8 uncharacterized protein LOC1110198382.0e-11575.76Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG
        MEKLLKRPEKLRGDPEKRNKD YCRF R HGHNTS+CWELKRQIEDLIQDG FKKFVGKPRSNSVEK EERKRSR   RR+DRPAVI TIF GPSGGQS 
Subjt:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS
        NKR ELAR ARR+VCIIREQ+PT  I+F +ADLE VHLPHNDALVIAPLIDHV+VRRVLVDGGASANILSL TYL LGWTRSQLKKS TPLVGFSGES++
Subjt:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS

Query:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWD
         EGCIDLPVT GQD +QV++  EFVVIDGRSAYNAIF                               GEQ  SRECYASALKGSSVCALE+QAS D
Subjt:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWD

A0A6J1DPX9 uncharacterized protein LOC1110220069.9e-13172.32Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG
        MEKLLKRPEKLRGDPEK NKD              NCWELKRQIE+LIQDG FKKFVGKPRSNSVEKKEERKRSR  PRRDDRPAVI TIF GPSGGQ G
Subjt:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS
        NKR +LAR  RREVCIIREQKPTC I+FG+ADLEGVHLPHNDALVIAPLIDH+LVRRVL+DGGASANI SLPTYLALGWTRSQLKKSPTPLVGFSGESVS
Subjt:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS

Query:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWDELP
         EGCIDL VTIGQD +QV+Q  EFVVID +SAYNAIF                               GEQ+TSR+CYAS LKG +VC LEEQ +  +L 
Subjt:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWDELP

Query:  KPEADLPKSGKREFSAPIEELELVPLLSPERQVSIGTKLGATAREELINFLRSH
          EADLPK  KR+FS P EELELVPLLSPE+ V+IGTKL AT R+ELINFLRS+
Subjt:  KPEADLPKSGKREFSAPIEELELVPLLSPERQVSIGTKLGATAREELINFLRSH

A0A6J1DT04 uncharacterized protein LOC1110228802.3e-11970.06Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG
        MEKLLKRPEKLRG PE+R+KD YCRF R HGHNTS+CWELKRQIEDLIQDG FKKFVGKP ++S EKKEERKRSR  PRR DRPAVI TIF GPSGGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS
        +KRKELAR ARREVCIIREQ PTC I+F  ADLE VHLPHNDAL+IA LIDHV+VRRVLV+GGASANILSLPTYLALGWTRSQL++SPTPLVGFSGESV 
Subjt:  NKRKELAREARREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVS

Query:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWDELP
         EGCIDLPVT+GQ+ ++++Q  EFVV+DGRS YNAIF                               GEQ  SRECYA+ALKG SVCALE     D   
Subjt:  LEGCIDLPVTIGQDDSQVSQTVEFVVIDGRSAYNAIF-------------------------------GEQRTSRECYASALKGSSVCALEEQASWDELP

Query:  KPEADLPKSGKREFSAPIEELELVPLLSPERQVS
        + EA+LP   ++EF+AP EELELVPLLSPE+Q++
Subjt:  KPEADLPKSGKREFSAPIEELELVPLLSPERQVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGCTCCTTAAACGACCTGAGAAGCTCCGAGGAGACCCAGAAAAGCGCAATAAAGATAATTATTGTCGTTTTCTTCGTTATCACGGCCATAATACATCGAATTG
CTGGGAATTGAAGCGCCAAATTGAAGACCTCATTCAAGATGGCTGCTTCAAAAAATTTGTTGGAAAACCGAGGTCCAACTCGGTAGAGAAGAAAGAAGAGAGGAAGCGTT
CAAGGATGGCACCTCGACGAGATGACCGACCTGCAGTCATCAAGACTATTTTCAGGGGCCCTAGTGGGGGCCAGTCTGGAAATAAAAGGAAAGAGCTGGCTCGCGAGGCC
AGGCGCGAGGTATGCATCATCAGGGAGCAGAAACCAACTTGCTCCATTAGCTTCGGCAATGCCGATCTAGAGGGGGTACATTTGCCTCATAATGATGCACTTGTGATCGC
CCCCCTTATCGATCATGTCCTGGTCAGAAGAGTGCTAGTAGATGGAGGCGCGTCTGCTAACATCTTGTCCCTTCCGACATACCTAGCCTTGGGGTGGACCAGATCACAGC
TGAAGAAGAGTCCAACACCCTTGGTTGGATTCTCTGGAGAATCAGTCTCTCTAGAAGGGTGTATCGACTTGCCGGTTACAATTGGGCAGGATGATTCACAGGTATCCCAA
ACGGTCGAGTTCGTTGTAATCGATGGAAGGTCGGCCTACAACGCCATCTTCGGAGAGCAAAGAACTTCAAGGGAGTGCTACGCCTCCGCGCTCAAAGGATCGTCAGTATG
CGCCCTAGAGGAACAAGCTAGTTGGGACGAGTTGCCAAAGCCCGAGGCCGACCTACCGAAATCAGGTAAAAGAGAATTCTCAGCACCAATAGAGGAGCTCGAGCTTGTTC
CTTTGCTTAGTCCTGAGAGACAAGTAAGTATAGGAACCAAGCTAGGGGCCACTGCTAGGGAGGAGCTGATCAACTTCCTCAGATCGCACGTTGCCCAGTTCAAGACTTAC
GAGGTGAATCAAGTGCCAAGGTCAGAAAATTCCAATGCAGACCCCTTAGCAAATTGGCATCAGCATTTTAGACCGACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAGCTCCTTAAACGACCTGAGAAGCTCCGAGGAGACCCAGAAAAGCGCAATAAAGATAATTATTGTCGTTTTCTTCGTTATCACGGCCATAATACATCGAATTG
CTGGGAATTGAAGCGCCAAATTGAAGACCTCATTCAAGATGGCTGCTTCAAAAAATTTGTTGGAAAACCGAGGTCCAACTCGGTAGAGAAGAAAGAAGAGAGGAAGCGTT
CAAGGATGGCACCTCGACGAGATGACCGACCTGCAGTCATCAAGACTATTTTCAGGGGCCCTAGTGGGGGCCAGTCTGGAAATAAAAGGAAAGAGCTGGCTCGCGAGGCC
AGGCGCGAGGTATGCATCATCAGGGAGCAGAAACCAACTTGCTCCATTAGCTTCGGCAATGCCGATCTAGAGGGGGTACATTTGCCTCATAATGATGCACTTGTGATCGC
CCCCCTTATCGATCATGTCCTGGTCAGAAGAGTGCTAGTAGATGGAGGCGCGTCTGCTAACATCTTGTCCCTTCCGACATACCTAGCCTTGGGGTGGACCAGATCACAGC
TGAAGAAGAGTCCAACACCCTTGGTTGGATTCTCTGGAGAATCAGTCTCTCTAGAAGGGTGTATCGACTTGCCGGTTACAATTGGGCAGGATGATTCACAGGTATCCCAA
ACGGTCGAGTTCGTTGTAATCGATGGAAGGTCGGCCTACAACGCCATCTTCGGAGAGCAAAGAACTTCAAGGGAGTGCTACGCCTCCGCGCTCAAAGGATCGTCAGTATG
CGCCCTAGAGGAACAAGCTAGTTGGGACGAGTTGCCAAAGCCCGAGGCCGACCTACCGAAATCAGGTAAAAGAGAATTCTCAGCACCAATAGAGGAGCTCGAGCTTGTTC
CTTTGCTTAGTCCTGAGAGACAAGTAAGTATAGGAACCAAGCTAGGGGCCACTGCTAGGGAGGAGCTGATCAACTTCCTCAGATCGCACGTTGCCCAGTTCAAGACTTAC
GAGGTGAATCAAGTGCCAAGGTCAGAAAATTCCAATGCAGACCCCTTAGCAAATTGGCATCAGCATTTTAGACCGACCTAG
Protein sequenceShow/hide protein sequence
MEKLLKRPEKLRGDPEKRNKDNYCRFLRYHGHNTSNCWELKRQIEDLIQDGCFKKFVGKPRSNSVEKKEERKRSRMAPRRDDRPAVIKTIFRGPSGGQSGNKRKELAREA
RREVCIIREQKPTCSISFGNADLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSLEGCIDLPVTIGQDDSQVSQ
TVEFVVIDGRSAYNAIFGEQRTSRECYASALKGSSVCALEEQASWDELPKPEADLPKSGKREFSAPIEELELVPLLSPERQVSIGTKLGATAREELINFLRSHVAQFKTY
EVNQVPRSENSNADPLANWHQHFRPT