; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g06150 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g06150
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptioncell wall / vacuolar inhibitor of fructosidase 2
Genome locationchr9:4779277..4782436
RNA-Seq ExpressionMoc09g06150
SyntenyMoc09g06150
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0004857 - enzyme inhibitor activity (molecular function)
InterPro domainsIPR006501 - Pectinesterase inhibitor domain
IPR035513 - Invertase/pectin methylesterase inhibitor domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046709.1 putative transmembrane protein [Cucumis melo var. makuwa]2.0e-8372.34Show/hide
Query:  ISQKGYEIEYMALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVA
        ++ KGYEIEYMALRE DL FD E+GG+I +E GS EP+S K + KN+WSR TE  L KDER +  N NFAN+V ++IADE++ LLIDK LEGED  EV A
Subjt:  ISQKGYEIEYMALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVA

Query:  LVEKNNVRGKHKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQG
         +EK+N RGKHKNKKKALKPPRPPKGPSLDAADRM+VKE+AVLAMKKRAR ERMKALKK KAEKTSS +SCIPA+IITFLFFLVIIIQGIS RSSS+LQG
Subjt:  LVEKNNVRGKHKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQG

Query:  SPGPGPSVGGSSGFISIQYINSLPPNQSDIPDHSS
        S  P P+VGGSSGFIS+QYI S PP++SD+ +  S
Subjt:  SPGPGPSVGGSSGFISIQYINSLPPNQSDIPDHSS

TYK18245.1 putative transmembrane protein [Cucumis melo var. makuwa]2.0e-8373.48Show/hide
Query:  ISQKGYEIEYMALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVA
        ++ KGYEIEYMALRE DL FD E+GG+I +E GS EP+S K + KN+WSR TE  L KDER +  N NFAN+V ++IADE++ LLIDK LEGED  EV A
Subjt:  ISQKGYEIEYMALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVA

Query:  LVEKNNVRGKHKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQG
         +EK+N RGKHKNKKKALKPPRPPKGPSLDAADRM+VKE+AVLAMKKRAR ERMKALKK KAEKTSS +SCIPA+IITFLFFLVIIIQGIS RSSS+LQG
Subjt:  LVEKNNVRGKHKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQG

Query:  SPGPGPSVGGSSGFISIQYINSLPPNQSDI
        S  P P+VGGSSGFIS+QYI S PP++SD+
Subjt:  SPGPGPSVGGSSGFISIQYINSLPPNQSDI

XP_022153049.1 uncharacterized protein LOC111020642 [Momordica charantia]1.2e-115100Show/hide
Query:  MALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGK
        MALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGK
Subjt:  MALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGK

Query:  HKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGG
        HKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGG
Subjt:  HKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGG

Query:  SSGFISIQYINSLPPNQSDIPDHSSV
        SSGFISIQYINSLPPNQSDIPDHSSV
Subjt:  SSGFISIQYINSLPPNQSDIPDHSSV

XP_022991807.1 uncharacterized protein LOC111488345 [Cucurbita maxima]2.3e-7974.78Show/hide
Query:  MALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGK
        MALRE DL FD ESGGRIG+EVGS EP+SIK + KN+W+R TE  L KDE  V  N NFANSV +V+AD N+ELLIDK LEGED  E  A VEK N RGK
Subjt:  MALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGK

Query:  HKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGG
        HKNKKKALKPPRPPKGPSLDAADR LVKEIAV+AMKKRARVERMKAL+K KAEKTSS +SCIPAMIITFLFFLVII+QGISSRSS MLQGS  P P+V G
Subjt:  HKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGG

Query:  SSGFISIQYINSLPPNQSDIPDHSSV
        SSGFIS+QYI S PPN+S++ +   V
Subjt:  SSGFISIQYINSLPPNQSDIPDHSSV

XP_038898210.1 uncharacterized protein LOC120085950 [Benincasa hispida]2.1e-8076Show/hide
Query:  MALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGK
        MALRE DL  D ESGG+I +EVGS EP+SIK + KN+WSR T+  L KDER V  N NFANSV  +IADENIE+LIDK LEGED  EV   +EKNN RGK
Subjt:  MALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGK

Query:  HKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGG
        HKNKKKA KPPRPPKGPSLDAADRM+VKEIAVLAMKKRAR ERMKALKK KAEKTSS +SCIPAMIITFLFFLVIIIQGISSRSSS+LQGS  P P+VGG
Subjt:  HKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGG

Query:  SSGFISIQYINSLPPNQSDIPDHSS
        SSGFIS+QYI S PPN+S+I +  S
Subjt:  SSGFISIQYINSLPPNQSDIPDHSS

TrEMBL top hitse value%identityAlignment
A0A5A7TZQ1 Putative transmembrane protein9.8e-8472.34Show/hide
Query:  ISQKGYEIEYMALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVA
        ++ KGYEIEYMALRE DL FD E+GG+I +E GS EP+S K + KN+WSR TE  L KDER +  N NFAN+V ++IADE++ LLIDK LEGED  EV A
Subjt:  ISQKGYEIEYMALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVA

Query:  LVEKNNVRGKHKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQG
         +EK+N RGKHKNKKKALKPPRPPKGPSLDAADRM+VKE+AVLAMKKRAR ERMKALKK KAEKTSS +SCIPA+IITFLFFLVIIIQGIS RSSS+LQG
Subjt:  LVEKNNVRGKHKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQG

Query:  SPGPGPSVGGSSGFISIQYINSLPPNQSDIPDHSS
        S  P P+VGGSSGFIS+QYI S PP++SD+ +  S
Subjt:  SPGPGPSVGGSSGFISIQYINSLPPNQSDIPDHSS

A0A5D3D3X4 Putative transmembrane protein9.8e-8473.48Show/hide
Query:  ISQKGYEIEYMALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVA
        ++ KGYEIEYMALRE DL FD E+GG+I +E GS EP+S K + KN+WSR TE  L KDER +  N NFAN+V ++IADE++ LLIDK LEGED  EV A
Subjt:  ISQKGYEIEYMALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVA

Query:  LVEKNNVRGKHKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQG
         +EK+N RGKHKNKKKALKPPRPPKGPSLDAADRM+VKE+AVLAMKKRAR ERMKALKK KAEKTSS +SCIPA+IITFLFFLVIIIQGIS RSSS+LQG
Subjt:  LVEKNNVRGKHKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQG

Query:  SPGPGPSVGGSSGFISIQYINSLPPNQSDI
        S  P P+VGGSSGFIS+QYI S PP++SD+
Subjt:  SPGPGPSVGGSSGFISIQYINSLPPNQSDI

A0A6J1DHV4 uncharacterized protein LOC1110206425.7e-116100Show/hide
Query:  MALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGK
        MALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGK
Subjt:  MALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGK

Query:  HKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGG
        HKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGG
Subjt:  HKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGG

Query:  SSGFISIQYINSLPPNQSDIPDHSSV
        SSGFISIQYINSLPPNQSDIPDHSSV
Subjt:  SSGFISIQYINSLPPNQSDIPDHSSV

A0A6J1GR01 uncharacterized protein LOC1114566474.3e-7974.34Show/hide
Query:  MALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGK
        MALRE DL FD ESGGRIG EVGS EP+SIK + KN+W+R TE  L KDE  V  N NFANSV +V+AD N+ELLIDK LEGED  E  A VEK N RGK
Subjt:  MALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGK

Query:  HKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGG
        HKNKKKALKPPRPPKGP+LDAADR LVKEIAV+AMKKRARVERMKAL+K KAEKTSS +SCIPAMIITFLFFLVII+QGISSRSS MLQGS  P P+V G
Subjt:  HKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGG

Query:  SSGFISIQYINSLPPNQSDIPDHSSV
        SSGFIS+QYI S PPN+S++ +   V
Subjt:  SSGFISIQYINSLPPNQSDIPDHSSV

A0A6J1JVV1 uncharacterized protein LOC1114883451.1e-7974.78Show/hide
Query:  MALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGK
        MALRE DL FD ESGGRIG+EVGS EP+SIK + KN+W+R TE  L KDE  V  N NFANSV +V+AD N+ELLIDK LEGED  E  A VEK N RGK
Subjt:  MALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGK

Query:  HKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGG
        HKNKKKALKPPRPPKGPSLDAADR LVKEIAV+AMKKRARVERMKAL+K KAEKTSS +SCIPAMIITFLFFLVII+QGISSRSS MLQGS  P P+V G
Subjt:  HKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGG

Query:  SSGFISIQYINSLPPNQSDIPDHSSV
        SSGFIS+QYI S PPN+S++ +   V
Subjt:  SSGFISIQYINSLPPNQSDIPDHSSV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02380.1 unknown protein2.3e-2438.33Show/hide
Query:  IEYMALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNV
        ++ M   E+DL+ D E+    G+   ++E  S  +    VWS                  NF   V E IAD+    LI      E   + + L EK   
Subjt:  IEYMALREEDLKFDPESGGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNV

Query:  RGKHKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERM-KALKKVKAEKTSSLSSCIP--AMIITFLFFLVIIIQGISSRSSSMLQGSPGP
         GK K  +KA KPPRPPKGPSL   DR ++++I  LAM+KRAR+ERM K+LK++KA KTS  S CI   +MIIT +FF  ++ QG S+ SSSM      P
Subjt:  RGKHKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERM-KALKKVKAEKTSSLSSCIP--AMIITFLFFLVIIIQGISSRSSSMLQGSPGP

Query:  GPSVGGSSGFISIQYINSLPPNQSDIP
         P+V  ++  IS+Q+ N   P +   P
Subjt:  GPSVGGSSGFISIQYINSLPPNQSDIP

AT3G17120.1 unknown protein1.7e-1947.62Show/hide
Query:  KHKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSC---IPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGP
        K K KK A KPPRPP+GPSLDAAD+ L++EIA LAM KRAR+ERM+ALKK +A K +S +S    + A + T +FF V++ QG+S R++    GS G   
Subjt:  KHKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSC---IPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGP

Query:  SV---GGSSGFISIQYINSLPPNQSD
         V     + GF+S+QY  +   ++ D
Subjt:  SV---GGSSGFISIQYINSLPPNQSD

AT3G17120.2 unknown protein1.7e-1947.62Show/hide
Query:  KHKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSC---IPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGP
        K K KK A KPPRPP+GPSLDAAD+ L++EIA LAM KRAR+ERM+ALKK +A K +S +S    + A + T +FF V++ QG+S R++    GS G   
Subjt:  KHKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSC---IPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGP

Query:  SV---GGSSGFISIQYINSLPPNQSD
         V     + GF+S+QY  +   ++ D
Subjt:  SV---GGSSGFISIQYINSLPPNQSD

AT4G01960.1 unknown protein4.0e-2138.27Show/hide
Query:  NGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGKHKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKT
        +G  +    E  AD+ ++ L+ +G   E   + + L +      K K  +K  KPPRPPKGP L A D+ L++EI  LAM+KRAR+ERMK L+++KA K+
Subjt:  NGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGKHKNKKKALKPPRPPKGPSLDAADRMLVKEIAVLAMKKRARVERMKALKKVKAEKT

Query:  SSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGGSSGFISIQYINSLPPNQ
        SS  S I AMI+T +FF+ +I QG  + S++ L     P P+   ++  +S+Q+ N   P +
Subjt:  SSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGGSSGFISIQYINSLPPNQ

AT5G64620.1 cell wall / vacuolar inhibitor of fructosidase 24.5e-0438.46Show/hide
Query:  SEDQTANLITETCSKTSHVDLCISSLGVYPRSPTADVRRLAEIILKETLANA
        ++  T  +I  TC  T++   C+S+L   PRSPTAD + LA I++   + NA
Subjt:  SEDQTANLITETCSKTSHVDLCISSLGVYPRSPTADVRRLAEIILKETLANA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGCTTCTTCATTTGCCCGCAGATGATCAGAAGAACAGTGGCGAAATCGATGTGGATCATGTCGGTTTGAGCAGAAAAACGCTTGAATTTGGGTATGTTTGTACTGA
ATCCTTGCTATTGGGTTCTGGGTTCCTCGGTTCGAGGGATTCTAGTGGGAGTGTTTTTGTCTGTTTTGTTTTTGCCGTTGCTTTTAAGGCCATTGTTTCAGCGATTTGGT
GGACAGACATTTGCCCCGGGAACAGACCTTTATCTTTGATTAGTCAGAAAGGCTATGAGATTGAGTATATGGCTTTAAGAGAAGAGGACCTCAAATTCGATCCTGAAAGC
GGGGGGAGGATTGGCCAAGAGGTTGGGAGCAAAGAACCAAATTCAATAAAGATAGAGGAGAAGAATGTTTGGAGTAGGTTTACAGAGGCTGGACTGCCAAAGGATGAACG
AGGTGTCACCTTAAACGGTAATTTTGCGAATTCTGTTGGCGAGGTCATTGCTGATGAGAACATAGAATTGTTAATAGATAAGGGTTTGGAAGGAGAAGATGGTCGTGAAG
TTGTTGCGCTTGTGGAGAAGAATAATGTGAGAGGGAAGCATAAGAATAAGAAAAAGGCTCTAAAGCCACCGCGACCACCTAAAGGTCCTTCACTTGACGCTGCTGATAGA
ATGCTGGTGAAGGAAATTGCAGTGCTTGCTATGAAAAAGCGTGCAAGAGTCGAGCGAATGAAAGCATTAAAGAAGGTGAAAGCAGAGAAAACGTCATCTCTGAGCAGTTG
CATACCTGCCATGATTATTACATTCCTCTTCTTCCTTGTCATAATCATTCAAGGAATAAGCTCCAGAAGTAGTTCGATGTTGCAGGGGTCTCCCGGACCCGGACCTTCTG
TCGGTGGTAGTAGCGGTTTCATTTCCATTCAGTACATAAATAGCCTTCCCCCAAATCAAAGCGATATACCTGATCACTCCTCTGTTAAGATGTTCGATCTGTTCTCCGAG
GATCAAACAGCCAATCTCATTACAGAAACATGTTCGAAGACATCCCATGTTGACCTTTGCATATCGAGCCTTGGGGTCTACCCTCGAAGCCCGACGGCCGATGTTAGGCG
TCTTGCCGAGATCATCCTCAAAGAGACATTAGCGAACGCCCAAGAAATTCTGGCACAGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGCTTCTTCATTTGCCCGCAGATGATCAGAAGAACAGTGGCGAAATCGATGTGGATCATGTCGGTTTGAGCAGAAAAACGCTTGAATTTGGGTATGTTTGTACTGA
ATCCTTGCTATTGGGTTCTGGGTTCCTCGGTTCGAGGGATTCTAGTGGGAGTGTTTTTGTCTGTTTTGTTTTTGCCGTTGCTTTTAAGGCCATTGTTTCAGCGATTTGGT
GGACAGACATTTGCCCCGGGAACAGACCTTTATCTTTGATTAGTCAGAAAGGCTATGAGATTGAGTATATGGCTTTAAGAGAAGAGGACCTCAAATTCGATCCTGAAAGC
GGGGGGAGGATTGGCCAAGAGGTTGGGAGCAAAGAACCAAATTCAATAAAGATAGAGGAGAAGAATGTTTGGAGTAGGTTTACAGAGGCTGGACTGCCAAAGGATGAACG
AGGTGTCACCTTAAACGGTAATTTTGCGAATTCTGTTGGCGAGGTCATTGCTGATGAGAACATAGAATTGTTAATAGATAAGGGTTTGGAAGGAGAAGATGGTCGTGAAG
TTGTTGCGCTTGTGGAGAAGAATAATGTGAGAGGGAAGCATAAGAATAAGAAAAAGGCTCTAAAGCCACCGCGACCACCTAAAGGTCCTTCACTTGACGCTGCTGATAGA
ATGCTGGTGAAGGAAATTGCAGTGCTTGCTATGAAAAAGCGTGCAAGAGTCGAGCGAATGAAAGCATTAAAGAAGGTGAAAGCAGAGAAAACGTCATCTCTGAGCAGTTG
CATACCTGCCATGATTATTACATTCCTCTTCTTCCTTGTCATAATCATTCAAGGAATAAGCTCCAGAAGTAGTTCGATGTTGCAGGGGTCTCCCGGACCCGGACCTTCTG
TCGGTGGTAGTAGCGGTTTCATTTCCATTCAGTACATAAATAGCCTTCCCCCAAATCAAAGCGATATACCTGATCACTCCTCTGTTAAGATGTTCGATCTGTTCTCCGAG
GATCAAACAGCCAATCTCATTACAGAAACATGTTCGAAGACATCCCATGTTGACCTTTGCATATCGAGCCTTGGGGTCTACCCTCGAAGCCCGACGGCCGATGTTAGGCG
TCTTGCCGAGATCATCCTCAAAGAGACATTAGCGAACGCCCAAGAAATTCTGGCACAGAGTTGA
Protein sequenceShow/hide protein sequence
MLLLHLPADDQKNSGEIDVDHVGLSRKTLEFGYVCTESLLLGSGFLGSRDSSGSVFVCFVFAVAFKAIVSAIWWTDICPGNRPLSLISQKGYEIEYMALREEDLKFDPES
GGRIGQEVGSKEPNSIKIEEKNVWSRFTEAGLPKDERGVTLNGNFANSVGEVIADENIELLIDKGLEGEDGREVVALVEKNNVRGKHKNKKKALKPPRPPKGPSLDAADR
MLVKEIAVLAMKKRARVERMKALKKVKAEKTSSLSSCIPAMIITFLFFLVIIIQGISSRSSSMLQGSPGPGPSVGGSSGFISIQYINSLPPNQSDIPDHSSVKMFDLFSE
DQTANLITETCSKTSHVDLCISSLGVYPRSPTADVRRLAEIILKETLANAQEILAQS