; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g30530 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g30530
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:22966771..22968138
RNA-Seq ExpressionMoc06g30530
SyntenyMoc06g30530
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152029.1 uncharacterized protein LOC111019838 [Momordica charantia]4.2e-7356Show/hide
Query:  HDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVN
        H +      +C+ L+ +IE+LI+ G+ KK+VGK  S S +          +++ +S T  RRE    RP VINTIFGGPSGG S NKR  L R    +V 
Subjt:  HDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVN

Query:  TSYIQ-PTVKIYFSSTDLEGVHLPHNDALVVSPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGD
            Q PT  I F S DLE VHLPHNDALV++P+ID+V VRRVLVDGGAS NILSL TYL LGW +SQLK+  TPLVGFSGES   EGC DLPV  G+  
Subjt:  TSYIQ-PTVKIYFSSTDLEGVHLPHNDALVVSPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGD

Query:  SKVRKITEFVVVDGALAYNAILGRPYIHELQAVPSTYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAV
        ++V K+ EFVV+DG  AYNAI GRP IH  +AVPST HQ++KY T +GVGI++GEQ ASRECY +A+KG+  CA+
Subjt:  SKVRKITEFVVVDGALAYNAILGRPYIHELQAVPSTYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAV

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]3.8e-7456.73Show/hide
Query:  HDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVN
        H          + L+ +IE+LI+ G+ KK+VGK  + S++ K +  RS      ++P    R TD  RP VINTIFGGPSGG SG+KRK L R    EV 
Subjt:  HDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVN

Query:  TSYIQ-PTVKIYFSSTDLEGVHLPHNDALVVSPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGD
            Q PT  I F   DL  VHLPHNDALV++P+ID+V VRRVLVDGGAS NILSL TYLALGW +SQLK+ PTPLVGFSGES V EGC DLPV +G+  
Subjt:  TSYIQ-PTVKIYFSSTDLEGVHLPHNDALVVSPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGD

Query:  SKVRKITEFVVVDGALAYNAILGRPYIHELQAVPSTYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAV
        ++V ++ EFVVVDG  AYNAI GRP IH  +A+PST HQ++KY T NGVG ++GEQ ASRECY + +KGT  CA+
Subjt:  SKVRKITEFVVVDGALAYNAILGRPYIHELQAVPSTYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAV

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]5.0e-7450.42Show/hide
Query:  TPSEAIQRARQYMHAEDVLRAK--HVQHQIVISRHAQ-ARAQSSQSKNGYS----SRDHEPRKSAHDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKD
        T +E +Q A++ +  +++LR K    + QI   R +Q  R   S+SK+  S    SR    R  +  +RSR    C+ L+ +IE+LI+  + KK+VGK  
Subjt:  TPSEAIQRARQYMHAEDVLRAK--HVQHQIVISRHAQ-ARAQSSQSKNGYS----SRDHEPRKSAHDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKD

Query:  SCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVNTSYIQ-PTVKIYFSSTDLEGVHLPHNDALVVSPII
        S S + K        +++ +S T  RRE    RP VINTIFGGPSGG   NKRK L  E   +V+    Q PT  I F  TDLEGVHLPHNDALV++P+I
Subjt:  SCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVNTSYIQ-PTVKIYFSSTDLEGVHLPHNDALVVSPII

Query:  DNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGDSKVRKITEFVVVDGALAYNAILGRPYIHELQAVPS
        D+V VRRVLVDGGAS NILSL TYLAL   +SQLK+ PTPLVGFS ES   EGC DLPV IG+  ++V ++ EFVV+DG LAYNAI  RP IH  QAVPS
Subjt:  DNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGDSKVRKITEFVVVDGALAYNAILGRPYIHELQAVPS

Query:  TYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAVVFGNSSVEMTSGDDV
          HQ++KY T NGVG ++GEQK SRECY +A+K +  CA+       E TS DD+
Subjt:  TYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAVVFGNSSVEMTSGDDV

XP_022155866.1 uncharacterized protein LOC111022880 [Momordica charantia]9.5e-7354.91Show/hide
Query:  HDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVN
        H         C+ L+ +IE+LI+ G+ KK+VGK  + S++ K +  RS      ++P    R TD  RP VINTIFGGPSGG SG+KRK L R    EV 
Subjt:  HDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVN

Query:  TSYIQ-PTVKIYFSSTDLEGVHLPHNDALVVSPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGD
            Q PT  I F   DLE VHLPHNDAL+++ +ID+V VRRVLV+GGAS NILSL TYLALGW +SQL++ PTPLVGFSGES + EGC DLPV +G+  
Subjt:  TSYIQ-PTVKIYFSSTDLEGVHLPHNDALVVSPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGD

Query:  SKVRKITEFVVVDGALAYNAILGRPYIHELQAVPSTYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAV
        +++ ++ EFVVVDG   YNAI GRP IH  +A+PST HQ++KY T NGVG ++GEQ ASRECY  A+KG   CA+
Subjt:  SKVRKITEFVVVDGALAYNAILGRPYIHELQAVPSTYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAV

XP_022156175.1 uncharacterized protein LOC111023128 [Momordica charantia]1.0e-11973.77Show/hide
Query:  SQSKNGYSSRDHEPRKSAHDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGG
        S++K+    R+       H +       C++LRD+IENLIR GHLKKYVGKKDSCSSQGKRKF+R++ DDQDKSP+ K+ E   KRPVVINTIFGGPSGG
Subjt:  SQSKNGYSSRDHEPRKSAHDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGG

Query:  HSGNKRKALIRETSHEVNTSYIQPTVKIYFSSTDLEGVHLPHNDALVVSPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGES
         SGNKRKALIRETSHEVNTSY++ TV I FS+ DLEGVHLPHNDALV+SPIIDN+QV+ VL+DG ASTNILSLSTYLALGWEKSQLK+CPTPLVGFSGE 
Subjt:  HSGNKRKALIRETSHEVNTSYIQPTVKIYFSSTDLEGVHLPHNDALVVSPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGES

Query:  FVAEGCTDLPVIIGEGDSKVRKITEFVVVDGALAYNAILGRPYIHELQAVPSTYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAVVFGNSSVE
          AEGCTDLPV IGE D+KVRK+ EFV+VDGA AYNAILGRPYIHELQ VPSTYHQ+MKY T  GV IIKGEQKASRECY TA+KGTRT AV+ G +S E
Subjt:  FVAEGCTDLPVIIGEGDSKVRKITEFVVVDGALAYNAILGRPYIHELQAVPSTYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAVVFGNSSVE

Query:  MTSGD
        +T GD
Subjt:  MTSGD

TrEMBL top hitse value%identityAlignment
A0A6J1DD03 uncharacterized protein LOC1110198991.9e-7456.73Show/hide
Query:  HDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVN
        H          + L+ +IE+LI+ G+ KK+VGK  + S++ K +  RS      ++P    R TD  RP VINTIFGGPSGG SG+KRK L R    EV 
Subjt:  HDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVN

Query:  TSYIQ-PTVKIYFSSTDLEGVHLPHNDALVVSPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGD
            Q PT  I F   DL  VHLPHNDALV++P+ID+V VRRVLVDGGAS NILSL TYLALGW +SQLK+ PTPLVGFSGES V EGC DLPV +G+  
Subjt:  TSYIQ-PTVKIYFSSTDLEGVHLPHNDALVVSPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGD

Query:  SKVRKITEFVVVDGALAYNAILGRPYIHELQAVPSTYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAV
        ++V ++ EFVVVDG  AYNAI GRP IH  +A+PST HQ++KY T NGVG ++GEQ ASRECY + +KGT  CA+
Subjt:  SKVRKITEFVVVDGALAYNAILGRPYIHELQAVPSTYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAV

A0A6J1DET8 uncharacterized protein LOC1110198382.1e-7356Show/hide
Query:  HDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVN
        H +      +C+ L+ +IE+LI+ G+ KK+VGK  S S +          +++ +S T  RRE    RP VINTIFGGPSGG S NKR  L R    +V 
Subjt:  HDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVN

Query:  TSYIQ-PTVKIYFSSTDLEGVHLPHNDALVVSPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGD
            Q PT  I F S DLE VHLPHNDALV++P+ID+V VRRVLVDGGAS NILSL TYL LGW +SQLK+  TPLVGFSGES   EGC DLPV  G+  
Subjt:  TSYIQ-PTVKIYFSSTDLEGVHLPHNDALVVSPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGD

Query:  SKVRKITEFVVVDGALAYNAILGRPYIHELQAVPSTYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAV
        ++V K+ EFVV+DG  AYNAI GRP IH  +AVPST HQ++KY T +GVGI++GEQ ASRECY +A+KG+  CA+
Subjt:  SKVRKITEFVVVDGALAYNAILGRPYIHELQAVPSTYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAV

A0A6J1DPC9 uncharacterized protein LOC1110222802.4e-7450.42Show/hide
Query:  TPSEAIQRARQYMHAEDVLRAK--HVQHQIVISRHAQ-ARAQSSQSKNGYS----SRDHEPRKSAHDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKD
        T +E +Q A++ +  +++LR K    + QI   R +Q  R   S+SK+  S    SR    R  +  +RSR    C+ L+ +IE+LI+  + KK+VGK  
Subjt:  TPSEAIQRARQYMHAEDVLRAK--HVQHQIVISRHAQ-ARAQSSQSKNGYS----SRDHEPRKSAHDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKD

Query:  SCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVNTSYIQ-PTVKIYFSSTDLEGVHLPHNDALVVSPII
        S S + K        +++ +S T  RRE    RP VINTIFGGPSGG   NKRK L  E   +V+    Q PT  I F  TDLEGVHLPHNDALV++P+I
Subjt:  SCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVNTSYIQ-PTVKIYFSSTDLEGVHLPHNDALVVSPII

Query:  DNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGDSKVRKITEFVVVDGALAYNAILGRPYIHELQAVPS
        D+V VRRVLVDGGAS NILSL TYLAL   +SQLK+ PTPLVGFS ES   EGC DLPV IG+  ++V ++ EFVV+DG LAYNAI  RP IH  QAVPS
Subjt:  DNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGDSKVRKITEFVVVDGALAYNAILGRPYIHELQAVPS

Query:  TYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAVVFGNSSVEMTSGDDV
          HQ++KY T NGVG ++GEQK SRECY +A+K +  CA+       E TS DD+
Subjt:  TYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAVVFGNSSVEMTSGDDV

A0A6J1DPJ9 uncharacterized protein LOC1110231285.0e-12073.77Show/hide
Query:  SQSKNGYSSRDHEPRKSAHDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGG
        S++K+    R+       H +       C++LRD+IENLIR GHLKKYVGKKDSCSSQGKRKF+R++ DDQDKSP+ K+ E   KRPVVINTIFGGPSGG
Subjt:  SQSKNGYSSRDHEPRKSAHDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGG

Query:  HSGNKRKALIRETSHEVNTSYIQPTVKIYFSSTDLEGVHLPHNDALVVSPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGES
         SGNKRKALIRETSHEVNTSY++ TV I FS+ DLEGVHLPHNDALV+SPIIDN+QV+ VL+DG ASTNILSLSTYLALGWEKSQLK+CPTPLVGFSGE 
Subjt:  HSGNKRKALIRETSHEVNTSYIQPTVKIYFSSTDLEGVHLPHNDALVVSPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGES

Query:  FVAEGCTDLPVIIGEGDSKVRKITEFVVVDGALAYNAILGRPYIHELQAVPSTYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAVVFGNSSVE
          AEGCTDLPV IGE D+KVRK+ EFV+VDGA AYNAILGRPYIHELQ VPSTYHQ+MKY T  GV IIKGEQKASRECY TA+KGTRT AV+ G +S E
Subjt:  FVAEGCTDLPVIIGEGDSKVRKITEFVVVDGALAYNAILGRPYIHELQAVPSTYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAVVFGNSSVE

Query:  MTSGD
        +T GD
Subjt:  MTSGD

A0A6J1DT04 uncharacterized protein LOC1110228804.6e-7354.91Show/hide
Query:  HDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVN
        H         C+ L+ +IE+LI+ G+ KK+VGK  + S++ K +  RS      ++P    R TD  RP VINTIFGGPSGG SG+KRK L R    EV 
Subjt:  HDNRSRQERTCYNLRDRIENLIRRGHLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVN

Query:  TSYIQ-PTVKIYFSSTDLEGVHLPHNDALVVSPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGD
            Q PT  I F   DLE VHLPHNDAL+++ +ID+V VRRVLV+GGAS NILSL TYLALGW +SQL++ PTPLVGFSGES + EGC DLPV +G+  
Subjt:  TSYIQ-PTVKIYFSSTDLEGVHLPHNDALVVSPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGD

Query:  SKVRKITEFVVVDGALAYNAILGRPYIHELQAVPSTYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAV
        +++ ++ EFVVVDG   YNAI GRP IH  +A+PST HQ++KY T NGVG ++GEQ ASRECY  A+KG   CA+
Subjt:  SKVRKITEFVVVDGALAYNAILGRPYIHELQAVPSTYHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACATTCACGTCGGCTATAAATACTCCAGACCTAGTTCATTCCTTCGCCATTCAGCCAACTCAGACTCCTAGTGAAGCAATTCAGCGAGCACGTCAATATATG
CATGCAGAGGATGTGCTCCGAGCAAAACATGTCCAGCATCAAATAGTAATATCGAGGCACGCACAAGCTCGGGCGCAGAGCTCACAGTCGAAAAATGGCTATTCA
AGCCGTGACCACGAACCAAGAAAGTCAGCTCACGATAACCGATCGCGACAAGAGCGGACGTGCTATAACTTGAGAGACCGGATTGAGAATCTAATCAGGCGAGGT
CATCTGAAAAAATACGTAGGAAAGAAGGACTCATGCTCTTCTCAAGGAAAAAGGAAGTTCAACCGATCAGATCGTGATGATCAGGACAAGTCTCCTACGTCTAAG
AGGAGAGAAACGGACGGAAAGAGGCCAGTGGTGATTAACACCATTTTCGGTGGCCCAAGCGGAGGACATTCAGGCAACAAGAGGAAAGCCCTCATCAGAGAAACC
AGTCATGAAGTCAACACTTCATATATCCAACCAACAGTGAAAATTTATTTTTCTTCAACCGACTTGGAAGGAGTTCATTTACCACACAATGATGCATTAGTGGTT
TCACCAATAATTGATAATGTACAGGTCCGGCGAGTTCTTGTTGATGGAGGGGCGTCTACCAATATACTTTCACTCTCAACCTATCTAGCCCTAGGTTGGGAAAAG
TCACAATTGAAGCAATGTCCCACACCCCTAGTCGGTTTTTCAGGGGAATCATTCGTGGCAGAGGGTTGCACCGACCTACCAGTAATCATTGGTGAAGGCGATAGC
AAAGTTCGAAAAATAACTGAGTTCGTCGTTGTGGATGGCGCATTGGCATATAATGCAATTCTAGGGCGACCATATATCCACGAGCTGCAAGCTGTCCCGTCCACT
TATCACCAAATGATGAAATACCTAACCACTAATGGTGTTGGTATCATTAAAGGCGAGCAGAAAGCTTCACGAGAATGTTATACCACAGCAATAAAAGGTACCCGA
ACTTGTGCAGTGGTCTTTGGGAATTCATCTGTCGAGATGACCTCAGGTGATGATGTGGCCCACTTTGGGAACATCAAAGCAAGAACTCGAGCAAATTCAGCTGGG
GTTGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGACATTCACGTCGGCTATAAATACTCCAGACCTAGTTCATTCCTTCGCCATTCAGCCAACTCAGACTCCTAGTGAAGCAATTCAGCGAGCACGTCAATATATG
CATGCAGAGGATGTGCTCCGAGCAAAACATGTCCAGCATCAAATAGTAATATCGAGGCACGCACAAGCTCGGGCGCAGAGCTCACAGTCGAAAAATGGCTATTCA
AGCCGTGACCACGAACCAAGAAAGTCAGCTCACGATAACCGATCGCGACAAGAGCGGACGTGCTATAACTTGAGAGACCGGATTGAGAATCTAATCAGGCGAGGT
CATCTGAAAAAATACGTAGGAAAGAAGGACTCATGCTCTTCTCAAGGAAAAAGGAAGTTCAACCGATCAGATCGTGATGATCAGGACAAGTCTCCTACGTCTAAG
AGGAGAGAAACGGACGGAAAGAGGCCAGTGGTGATTAACACCATTTTCGGTGGCCCAAGCGGAGGACATTCAGGCAACAAGAGGAAAGCCCTCATCAGAGAAACC
AGTCATGAAGTCAACACTTCATATATCCAACCAACAGTGAAAATTTATTTTTCTTCAACCGACTTGGAAGGAGTTCATTTACCACACAATGATGCATTAGTGGTT
TCACCAATAATTGATAATGTACAGGTCCGGCGAGTTCTTGTTGATGGAGGGGCGTCTACCAATATACTTTCACTCTCAACCTATCTAGCCCTAGGTTGGGAAAAG
TCACAATTGAAGCAATGTCCCACACCCCTAGTCGGTTTTTCAGGGGAATCATTCGTGGCAGAGGGTTGCACCGACCTACCAGTAATCATTGGTGAAGGCGATAGC
AAAGTTCGAAAAATAACTGAGTTCGTCGTTGTGGATGGCGCATTGGCATATAATGCAATTCTAGGGCGACCATATATCCACGAGCTGCAAGCTGTCCCGTCCACT
TATCACCAAATGATGAAATACCTAACCACTAATGGTGTTGGTATCATTAAAGGCGAGCAGAAAGCTTCACGAGAATGTTATACCACAGCAATAAAAGGTACCCGA
ACTTGTGCAGTGGTCTTTGGGAATTCATCTGTCGAGATGACCTCAGGTGATGATGTGGCCCACTTTGGGAACATCAAAGCAAGAACTCGAGCAAATTCAGCTGGG
GTTGGATGA
Protein sequenceShow/hide protein sequence
MTFTSAINTPDLVHSFAIQPTQTPSEAIQRARQYMHAEDVLRAKHVQHQIVISRHAQARAQSSQSKNGYSSRDHEPRKSAHDNRSRQERTCYNLRDRIENLIRRG
HLKKYVGKKDSCSSQGKRKFNRSDRDDQDKSPTSKRRETDGKRPVVINTIFGGPSGGHSGNKRKALIRETSHEVNTSYIQPTVKIYFSSTDLEGVHLPHNDALVV
SPIIDNVQVRRVLVDGGASTNILSLSTYLALGWEKSQLKQCPTPLVGFSGESFVAEGCTDLPVIIGEGDSKVRKITEFVVVDGALAYNAILGRPYIHELQAVPST
YHQMMKYLTTNGVGIIKGEQKASRECYTTAIKGTRTCAVVFGNSSVEMTSGDDVAHFGNIKARTRANSAGVG