; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g04410 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g04410
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPlant self-incompatibility protein S1 family
Genome locationchr1:2901416..2916082
RNA-Seq ExpressionMoc01g04410
SyntenyMoc01g04410
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0060320 - rejection of self pollen (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR010264 - Plant self-incompatibility S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650821.1 hypothetical protein Csa_017644 [Cucumis sativus]5.9e-3060Show/hide
Query:  MVYFNVAEINGAIINESTSS-----SKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEV
        +V F ++ + G  +N+  SS     S  + P SNW VTI N+Q  N+TL AHCKSKD+DLG HIIN+ G+Y W FKENF QTTLFWCNF S  GHASFEV
Subjt:  MVYFNVAEINGAIINESTSS-----SKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEV

Query:  FWPEKETWLSDRCKKLFICK
        FWPEKE WL  RCK   ICK
Subjt:  FWPEKETWLSDRCKKLFICK

XP_008458671.1 PREDICTED: uncharacterized protein LOC103498000 [Cucumis melo]1.0e-2970.79Show/hide
Query:  SKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHII-NVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRC
        SKY+LPL++W+VTI+N Q  NA+L  HCKSKD+DLGVH+I N G +Y W FKEN+LQTT FWCNF+S++GHASFEVFWPE  TWLSDRC
Subjt:  SKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHII-NVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRC

XP_011656368.1 S-protein homolog 1 [Cucumis sativus]1.9e-2868.54Show/hide
Query:  SKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHII-NVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRC
        SKY+LPL++W VTI+N Q  NA+L  HCKSKD+DLGVH+I N G  Y W FKEN+LQTT +WC+F+SK+GHASFEVFWPE+ TW SDRC
Subjt:  SKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHII-NVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRC

XP_022137291.1 S-protein homolog 1-like [Momordica charantia]3.0e-2661.05Show/hide
Query:  INESTSSSKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRC
        I E    S  + P S+W+VTI N  K +A L  HCKSKD+DLG H+I   G+YEW+FKENF QTTLFWCNF+S  GHAS EVFWPEK  WL+ RC
Subjt:  INESTSSSKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRC

XP_038907112.1 S-protein homolog 74-like [Benincasa hispida]1.8e-3174.16Show/hide
Query:  SKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHII-NVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRC
        SKY+LPL+NW+VTI+N QK NA L  HCKSKD+DLGVH+I N G  Y W FKENFLQTT FWCNF+S++GHASFEVFWPE  TWLSDRC
Subjt:  SKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHII-NVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRC

TrEMBL top hitse value%identityAlignment
A0A0A0KA40 S-protein homolog9.2e-2968.54Show/hide
Query:  SKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHII-NVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRC
        SKY+LPL++W VTI+N Q  NA+L  HCKSKD+DLGVH+I N G  Y W FKEN+LQTT +WC+F+SK+GHASFEVFWPE+ TW SDRC
Subjt:  SKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHII-NVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRC

A0A1S3C8Z6 S-protein homolog4.9e-3070.79Show/hide
Query:  SKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHII-NVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRC
        SKY+LPL++W+VTI+N Q  NA+L  HCKSKD+DLGVH+I N G +Y W FKEN+LQTT FWCNF+S++GHASFEVFWPE  TWLSDRC
Subjt:  SKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHII-NVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRC

A0A6J1C659 S-protein homolog1.5e-2661.05Show/hide
Query:  INESTSSSKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRC
        I E    S  + P S+W+VTI N  K +A L  HCKSKD+DLG H+I   G+YEW+FKENF QTTLFWCNF+S  GHAS EVFWPEK  WL+ RC
Subjt:  INESTSSSKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRC

A0A6J1GSD6 S-protein homolog3.1e-2459Show/hide
Query:  IINESTSSSKYEL--PLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRCKK
        IIN   + S   L    SNW+V IVN  K ++TL  HCKSKD+DLG H+I  G KY W+F EN LQTTL+WCNF SK G AS +VFWPEK  WLSDRC +
Subjt:  IINESTSSSKYEL--PLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRCKK

A0A6J1GSZ3 S-protein homolog1.8e-2460Show/hide
Query:  IINESTSSSKYEL--PLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRCKK
        IIN   + S   L    SNW+V IVN  K ++TL AHCKSKD+DLG H+I  G KY W+F EN LQTTL+WCNF SK G AS +VFWPEK  WLSDRC +
Subjt:  IINESTSSSKYEL--PLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRCKK

SwissProt top hitse value%identityAlignment
F2Q9V4 S-protein homolog 63.9e-0834.12Show/hide
Query:  PLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNF-RSKVGHASFEVFWPEKETWLSDRCK
        P+      +V    N+  L  HCKS+D+D G HI+  GG Y W F  NF+ +TL++C F + +V    F+++   ++   S RC+
Subjt:  PLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNF-RSKVGHASFEVFWPEKETWLSDRCK

F4JLS0 S-protein homolog 15.2e-1342.47Show/hide
Query:  LSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPE
        +S W+VT+VN      TLF HCKSK++DLG   +    ++ W F EN L +T FWC      GH +  VFW +
Subjt:  LSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPE

O23020 S-protein homolog 57.8e-0930.49Show/hide
Query:  WKVTIVNVQK--NNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRCK
        W+ T+V +        L  HCKSK +DLG+H++    +Y ++F+ N  ++TLF+C+F+      SF+++  +++  + D C+
Subjt:  WKVTIVNVQK--NNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRCK

P0DN92 S-protein homolog 241.9e-0734.18Show/hide
Query:  VTIVNVQKNNATLFA-HCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNF-RSKVGHASFEVFWPEKETWLSDRC
        +T V +Q +N  L   HCKS+D+DLG HI+  G  + W+F  NF  +TL++C F + ++    FE++   ++ +    C
Subjt:  VTIVNVQKNNATLFA-HCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNF-RSKVGHASFEVFWPEKETWLSDRC

Q2HQ46 S-protein homolog 741.8e-1333.33Show/hide
Query:  EHVLSMMVYFNVAEINGAIINESTSSSKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFE
        + +L++  Y  +   +  +  ++T+       +S W+VT+ N      TLF HCKSK+NDLG   +    ++ W F EN L +TLFWC      GH + +
Subjt:  EHVLSMMVYFNVAEINGAIINESTSSSKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFE

Query:  VFWPE
        VFW +
Subjt:  VFWPE

Arabidopsis top hitse value%identityAlignment
AT1G04645.1 Plant self-incompatibility protein S1 family5.6e-1030.49Show/hide
Query:  WKVTIVNVQK--NNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRCK
        W+ T+V +        L  HCKSK +DLG+H++    +Y ++F+ N  ++TLF+C+F+      SF+++  +++  + D C+
Subjt:  WKVTIVNVQK--NNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRCK

AT1G26795.1 Plant self-incompatibility protein S1 family1.4e-0826.67Show/hide
Query:  VLSMMVYFNVAEINGAIINESTSSSKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVF
        +  + ++F + + + ++ N S+      LP +   V I+N     ATL  HC +K  DLGV  +N   ++++ F+ N  +TT + C+F      A+F++F
Subjt:  VLSMMVYFNVAEINGAIINESTSSSKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVF

Query:  WPEKE
          +++
Subjt:  WPEKE

AT4G16295.1 S-protein homologue 13.7e-1442.47Show/hide
Query:  LSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPE
        +S W+VT+VN      TLF HCKSK++DLG   +    ++ W F EN L +T FWC      GH +  VFW +
Subjt:  LSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPE

AT4G29035.1 Plant self-incompatibility protein S1 family1.3e-1433.33Show/hide
Query:  EHVLSMMVYFNVAEINGAIINESTSSSKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFE
        + +L++  Y  +   +  +  ++T+       +S W+VT+ N      TLF HCKSK+NDLG   +    ++ W F EN L +TLFWC      GH + +
Subjt:  EHVLSMMVYFNVAEINGAIINESTSSSKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFE

Query:  VFWPE
        VFW +
Subjt:  VFWPE

AT5G04350.1 Plant self-incompatibility protein S1 family4.7e-0938.36Show/hide
Query:  KVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWC------NFRSKVGHASFEVFW
        KV + N  +++  L  HC+SKD+DLG HI+ +G  YE+ F +N  QTT F C      NF+  +   ++E  W
Subjt:  KVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWC------NFRSKVGHASFEVFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGCACGGGTCAATAATGAGGCTTCATGCTCGGCCTGGTGGTGCCCTGAGAGCATCCTCCCTTCAGAATGTGCTTGCATGAAGCCAATTCCAGTTGCTGAAGTCTA
TATATTGGCCAGCATATTTGATGTATTGGCCAAGAAACATGAGACCATGATCACTGCACGAGACATCATGGGTTCATTATATGAGATGTTTGGGCAACAGTCCTCACAAC
TCAAACATGGAGCTCTCAAGTTCATCTTCAACGCACGGATGAAGGAAGGAAGTTCTGTCTGGGAACATGTTCTTAGTATGATGGTCTATTTCAACGTGGCAGAGATTAAT
GGTGCTATCATAAACGAGTCCACATCATCGTCCAAGTACGAACTTCCGTTGTCCAACTGGAAAGTGACGATTGTGAACGTTCAGAAAAATAACGCAACTCTGTTTGCTCA
TTGCAAGTCCAAAGACAACGATTTAGGTGTGCACATCATCAACGTCGGGGGAAAATACGAGTGGGAATTCAAGGAGAATTTTTTGCAGACGACGTTATTTTGGTGCAACT
TTCGTAGCAAAGTAGGGCATGCTTCATTTGAGGTTTTCTGGCCAGAGAAAGAAACATGGCTTTCTGATCGATGCAAGAAGCTTTTCATATGCAAACTTCTTCTAGAGGGA
GATCTAATGGAAGAAGAGGTGGATGTGGTGGTGGAGGAAATGGAAGGTCCAATGACTTCAAAAGTTTTGAGTCAGAGAGGAGAAATAATCATACTTCTTCGAATGGAGGA
AGGGGAAGAAGTTAAAGCAGAGGAAGAAGTAGAGGTGAATGTCGTGGAGATTTTTCTCACATACAATGCTTCAATTGTGGACGCTACAGGAGATGCTCAAGACTTGGAGC
TTGAACAGACTCAACCACCAACCTCATCTTCATCTTCACTTTCCACAAGTGACGAATCAACTCCACCAAGGAAGAGGATAAATATTCAAGAGATCTGTAATGCTTCAAGA
GCAATACTTGAAGATGATGTTGAGTGTGTTGCCCTAAACTCCGCCTCGGAAAAGTTCCACCGAGGTTCGACCTCTGGAGCCAAGTCTGTGCCATCCTCTTCTGGAAGTAA
GACTTTCAAGAAGAAGAAGGCCACTGGTAATGAGCCTAGACCTGACCCCACTGCTGCCACTGCCAAGAAAGGCAAGACCAAGGTTGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTGCACGGGTCAATAATGAGGCTTCATGCTCGGCCTGGTGGTGCCCTGAGAGCATCCTCCCTTCAGAATGTGCTTGCATGAAGCCAATTCCAGTTGCTGAAGTCTA
TATATTGGCCAGCATATTTGATGTATTGGCCAAGAAACATGAGACCATGATCACTGCACGAGACATCATGGGTTCATTATATGAGATGTTTGGGCAACAGTCCTCACAAC
TCAAACATGGAGCTCTCAAGTTCATCTTCAACGCACGGATGAAGGAAGGAAGTTCTGTCTGGGAACATGTTCTTAGTATGATGGTCTATTTCAACGTGGCAGAGATTAAT
GGTGCTATCATAAACGAGTCCACATCATCGTCCAAGTACGAACTTCCGTTGTCCAACTGGAAAGTGACGATTGTGAACGTTCAGAAAAATAACGCAACTCTGTTTGCTCA
TTGCAAGTCCAAAGACAACGATTTAGGTGTGCACATCATCAACGTCGGGGGAAAATACGAGTGGGAATTCAAGGAGAATTTTTTGCAGACGACGTTATTTTGGTGCAACT
TTCGTAGCAAAGTAGGGCATGCTTCATTTGAGGTTTTCTGGCCAGAGAAAGAAACATGGCTTTCTGATCGATGCAAGAAGCTTTTCATATGCAAACTTCTTCTAGAGGGA
GATCTAATGGAAGAAGAGGTGGATGTGGTGGTGGAGGAAATGGAAGGTCCAATGACTTCAAAAGTTTTGAGTCAGAGAGGAGAAATAATCATACTTCTTCGAATGGAGGA
AGGGGAAGAAGTTAAAGCAGAGGAAGAAGTAGAGGTGAATGTCGTGGAGATTTTTCTCACATACAATGCTTCAATTGTGGACGCTACAGGAGATGCTCAAGACTTGGAGC
TTGAACAGACTCAACCACCAACCTCATCTTCATCTTCACTTTCCACAAGTGACGAATCAACTCCACCAAGGAAGAGGATAAATATTCAAGAGATCTGTAATGCTTCAAGA
GCAATACTTGAAGATGATGTTGAGTGTGTTGCCCTAAACTCCGCCTCGGAAAAGTTCCACCGAGGTTCGACCTCTGGAGCCAAGTCTGTGCCATCCTCTTCTGGAAGTAA
GACTTTCAAGAAGAAGAAGGCCACTGGTAATGAGCCTAGACCTGACCCCACTGCTGCCACTGCCAAGAAAGGCAAGACCAAGGTTGCATAG
Protein sequenceShow/hide protein sequence
MFARVNNEASCSAWWCPESILPSECACMKPIPVAEVYILASIFDVLAKKHETMITARDIMGSLYEMFGQQSSQLKHGALKFIFNARMKEGSSVWEHVLSMMVYFNVAEIN
GAIINESTSSSKYELPLSNWKVTIVNVQKNNATLFAHCKSKDNDLGVHIINVGGKYEWEFKENFLQTTLFWCNFRSKVGHASFEVFWPEKETWLSDRCKKLFICKLLLEG
DLMEEEVDVVVEEMEGPMTSKVLSQRGEIIILLRMEEGEEVKAEEEVEVNVVEIFLTYNASIVDATGDAQDLELEQTQPPTSSSSSLSTSDESTPPRKRINIQEICNASR
AILEDDVECVALNSASEKFHRGSTSGAKSVPSSSGSKTFKKKKATGNEPRPDPTAATAKKGKTKVA