; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1719 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1719
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationMC04:24686020..24687584
RNA-Seq ExpressionMC04g1719
SyntenyMC04g1719
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141578.1 uncharacterized protein LOC101212716 isoform X2 [Cucumis sativus]4.47e-9163.52Show/hide
Query:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKL
        KKQK+ ++ K  A+    Q+   H NLLS S   FSD+PL ESPGKASFD+YLEDKPRL+KATFPGK QQLNQ          EEWRIETP+IQLLFLK+
Subjt:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKL

Query:  LPVVDIKIISKTSG-EDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFET
         P +D+KIISKT+G E YP HVPHYI KLL  +MTNWEINGIH++YRPSSA VCS G IY  K GT SRLKFQL+++L+F+V   L F+P DV + I ET
Subjt:  LPVVDIKIISKTSG-EDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFET

Query:  VLKVMIEDVKHKTIDKLVEDYSKFKKEKSKRQI
        V+K M+ED+KHKT+ KLVEDYSKF+ EK K  I
Subjt:  VLKVMIEDVKHKTIDKLVEDYSKFKKEKSKRQI

XP_022134158.1 uncharacterized protein LOC111006493 isoform X1 [Momordica charantia]1.77e-12481.25Show/hide
Query:  KKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKLLPV
        KKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPR+LKATFPGK QQLNQ          EEWRIETP+++LL LK+ PV
Subjt:  KKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKLLPV

Query:  VDIKIISKTSGEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETVLKV
        +D+KIISKTSG+DYPPHVPH+ITKLL LEMTNWEINGIHR+YRPSSA V SKGAIY++KRGTTSRLKFQ  MN TFVV Q LSFIPKD+F++IFETVLKV
Subjt:  VDIKIISKTSGEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETVLKV

Query:  MIEDVKHKTIDKLVEDYSKFKKEK
        M+ED+ +K IDKLVEDYSKF+KEK
Subjt:  MIEDVKHKTIDKLVEDYSKFKKEK

XP_022961709.1 uncharacterized protein LOC111462397 [Cucurbita moschata]4.69e-9165.5Show/hide
Query:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKL
        K QK+ R  K  AVSK+QQ+    QNLLS S+  FSDIPL E  GKASFDQYLEDKPRL+KATFPGK +QLNQ          EEWRIETP+I+ LFLK+
Subjt:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKL

Query:  LPVVDIKIISKTSGEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETV
         P +DIKIISKTSGE YP  VPH ITK+L+L+MTNWE+NGIHRDYRPSSA VCS+GAIY++K G  SRLKFQL +NL+F +   L+F+PKDVFQ+I E  
Subjt:  LPVVDIKIISKTSGEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETV

Query:  LKVMIEDVKHKTIDKLVEDYSKFKKEKSK
        LK M+EDVK K +D+LVEDY  F+KEK K
Subjt:  LKVMIEDVKHKTIDKLVEDYSKFKKEKSK

XP_023517004.1 uncharacterized protein LOC111780797 isoform X1 [Cucurbita pepo subsp. pepo]3.83e-9065.07Show/hide
Query:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKL
        K QK+ R  K  AVSK+QQ+    QNLLS S+  FSDIPL E  GKASFDQYLEDKPR++KATFPGK +QLNQ          EEWRIETP+I+ LFLK+
Subjt:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKL

Query:  LPVVDIKIISKTSGEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETV
         P +DIKIISKTSGE YP  VPH ITK+L+L+MTNWE+NGIHRDYRPSSA VCS+GAIY+ K G  SRLKFQL +NL+F +   L+F+PKDVFQ+I E  
Subjt:  LPVVDIKIISKTSGEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETV

Query:  LKVMIEDVKHKTIDKLVEDYSKFKKEKSK
        LK M+EDVK K +D+LVEDY  F+KEK K
Subjt:  LKVMIEDVKHKTIDKLVEDYSKFKKEKSK

XP_038891182.1 uncharacterized protein LOC120080556 [Benincasa hispida]3.11e-9266.08Show/hide
Query:  KKQKIIRSCKFCAVSKTQQ----QHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLK
        KKQKI R  K  AVSKTQ+     H NLLS S+ FFSD+PL +SPGKASFD+YLEDKPRL+KATFPGK QQLNQ          EEWRIE P+I+LLFLK
Subjt:  KKQKIIRSCKFCAVSKTQQ----QHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLK

Query:  LLPVVDIKIISKTSGEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFET
        + P VDIKI  KT+GE YP  VPHYITK+L LEMTNWEINGIH+DYRPS A VCS+GAIY++K GT S LKF+LL+NL+F+V   L+F+  DV Q+I +T
Subjt:  LLPVVDIKIISKTSGEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFET

Query:  VLKVMIEDVKHKTIDKLVEDYSKFKKE
         LK MIED+KHK+I KLVEDY++F+KE
Subjt:  VLKVMIEDVKHKTIDKLVEDYSKFKKE

TrEMBL top hitse value%identityAlignment
A0A0A0KSD5 Uncharacterized protein2.16e-9163.52Show/hide
Query:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKL
        KKQK+ ++ K  A+    Q+   H NLLS S   FSD+PL ESPGKASFD+YLEDKPRL+KATFPGK QQLNQ          EEWRIETP+IQLLFLK+
Subjt:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKL

Query:  LPVVDIKIISKTSG-EDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFET
         P +D+KIISKT+G E YP HVPHYI KLL  +MTNWEINGIH++YRPSSA VCS G IY  K GT SRLKFQL+++L+F+V   L F+P DV + I ET
Subjt:  LPVVDIKIISKTSG-EDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFET

Query:  VLKVMIEDVKHKTIDKLVEDYSKFKKEKSKRQI
        V+K M+ED+KHKT+ KLVEDYSKF+ EK K  I
Subjt:  VLKVMIEDVKHKTIDKLVEDYSKFKKEKSKRQI

A0A6J1BY06 uncharacterized protein LOC111006493 isoform X21.08e-8275.15Show/hide
Query:  LLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKLLPVVDIKIISKTSGEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAI
        +LKATFPGK QQLNQ          EEWRIETP+++LL LK+ PV+D+KIISKTSG+DYPPHVPH+ITKLL LEMTNWEINGIHR+YRPSSA V SKGAI
Subjt:  LLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKLLPVVDIKIISKTSGEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAI

Query:  YTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETVLKVMIEDVKHKTIDKLVEDYSKFKKEK
        Y++KRGTTSRLKFQ  MN TFVV Q LSFIPKD+F++IFETVLKVM+ED+ +K IDKLVEDYSKF+KEK
Subjt:  YTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETVLKVMIEDVKHKTIDKLVEDYSKFKKEK

A0A6J1C174 uncharacterized protein LOC111006493 isoform X18.58e-12581.25Show/hide
Query:  KKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKLLPV
        KKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPR+LKATFPGK QQLNQ          EEWRIETP+++LL LK+ PV
Subjt:  KKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKLLPV

Query:  VDIKIISKTSGEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETVLKV
        +D+KIISKTSG+DYPPHVPH+ITKLL LEMTNWEINGIHR+YRPSSA V SKGAIY++KRGTTSRLKFQ  MN TFVV Q LSFIPKD+F++IFETVLKV
Subjt:  VDIKIISKTSGEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETVLKV

Query:  MIEDVKHKTIDKLVEDYSKFKKEK
        M+ED+ +K IDKLVEDYSKF+KEK
Subjt:  MIEDVKHKTIDKLVEDYSKFKKEK

A0A6J1HEU2 uncharacterized protein LOC1114623972.27e-9165.5Show/hide
Query:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKL
        K QK+ R  K  AVSK+QQ+    QNLLS S+  FSDIPL E  GKASFDQYLEDKPRL+KATFPGK +QLNQ          EEWRIETP+I+ LFLK+
Subjt:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKL

Query:  LPVVDIKIISKTSGEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETV
         P +DIKIISKTSGE YP  VPH ITK+L+L+MTNWE+NGIHRDYRPSSA VCS+GAIY++K G  SRLKFQL +NL+F +   L+F+PKDVFQ+I E  
Subjt:  LPVVDIKIISKTSGEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETV

Query:  LKVMIEDVKHKTIDKLVEDYSKFKKEKSK
        LK M+EDVK K +D+LVEDY  F+KEK K
Subjt:  LKVMIEDVKHKTIDKLVEDYSKFKKEKSK

A0A6J1JUJ3 uncharacterized protein LOC1114876271.19e-8466.34Show/hide
Query:  LSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKLLPVVDIKIISKTSGEDYPPHVPHYITK
        LS S+  FSDIPL E  GKASFDQYLEDKPRL+KA FPGK +QLNQ          EEWRIETP+I+ LFLK+ P +DIKIISKTSGE YP  VPH IT+
Subjt:  LSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKLLPVVDIKIISKTSGEDYPPHVPHYITK

Query:  LLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETVLKVMIEDVKHKTIDKLVEDYSKFKKEK
        +L+L+MTNWE+NGI RDY PSSA VCS+GAIY++K G  SRLKFQL +NL+F +   L+FIPKDVFQ+I ET LK M+EDVK K +D+LVEDY  F+KEK
Subjt:  LLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETVLKVMIEDVKHKTIDKLVEDYSKFKKEK

Query:  SK
         K
Subjt:  SK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G39520.1 Protein of unknown function (DUF1997)3.7e-3842.57Show/hide
Query:  FSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQ--QLNQVFFSSSFKVVEEWRIETPQIQLLFLKLLPVVDIKIISKTSGEDYPPHVPHYITK
        +S    +DI L+ESP +A FD+YLEDK R+ +A FP K +  +LN+          EEWRI+   I+  FL   PVV ++I  K++G+DYP  VP +ITK
Subjt:  FSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQ--QLNQVFFSSSFKVVEEWRIETPQIQLLFLKLLPVVDIKIISKTSGEDYPPHVPHYITK

Query:  LLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETVLKVMIEDVKHKTIDKLVEDYSKFKKEK
        +LEL MT WE+ G+ R   P+   +  KGA+Y D+RG  +RLK +L   ++FV+   L+ +P+DV +N+   +L  +++++KH+ I+ LV DYSKFK E+
Subjt:  LLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETVLKVMIEDVKHKTIDKLVEDYSKFKKEK

Query:  SK
         K
Subjt:  SK

AT5G39530.1 Protein of unknown function (DUF1997)2.7e-4141.96Show/hide
Query:  QKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGK--CQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKLLPV
        + ++RS   C VS  +       ++S    +DIPLNESP +A FD+YLEDK R+ +A FP K    +LN+          EEWRI+   I  LFL + PV
Subjt:  QKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGK--CQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKLLPV

Query:  VDIKIISKTSGEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETVLKV
        VD+++  K++G+DYPP VP  ITK+LEL M  W++ G+ R   P+   +  KGA+Y D+RG  +RL+ QL MN++FV+   L  +P+DV +N+   VL  
Subjt:  VDIKIISKTSGEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETVLKV

Query:  MIEDVKHKTIDKLVEDYSKFKKEK
        ++E++KHK    L+ DYS+FK E+
Subjt:  MIEDVKHKTIDKLVEDYSKFKKEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AAGAAGCAGAAAATAATTAGGAGCTGCAAATTCTGTGCAGTCTCCAAAACGCAGCAGCAGCATCAAAATTTGTTATCTTTTTCTATGGGATTTTTCAGTGATATACCTCT
TAACGAGTCTCCTGGTAAAGCTTCTTTTGATCAATACTTGGAAGATAAACCCAGATTGCTGAAGGCAACATTTCCAGGAAAATGTCAACAGCTCAACCAGGTTTTTTTTT
CATCATCTTTTAAAGTTGTTGAAGAGTGGAGAATTGAAACGCCACAAATCCAGTTGTTGTTTCTCAAGTTATTGCCAGTGGTTGATATAAAAATAATCTCCAAAACCAGT
GGTGAGGATTACCCACCTCATGTTCCTCATTATATCACAAAACTTCTCGAACTTGAAATGACAAATTGGGAGATCAATGGCATCCATAGGGACTATAGGCCATCTTCAGC
CAAAGTTTGTTCCAAAGGAGCTATTTACACTGACAAAAGAGGAACTACAAGTAGACTTAAGTTTCAACTCCTCATGAATCTAACATTTGTTGTCCTGCAGACTCTGAGTT
TCATTCCGAAGGACGTTTTTCAAAACATCTTCGAGACGGTTTTGAAAGTAATGATTGAGGATGTGAAGCATAAAACTATAGATAAATTGGTTGAAGATTACAGTAAGTTC
AAGAAGGAGAAAAGTAAGAGACAGATC
mRNA sequenceShow/hide mRNA sequence
AAGAAGCAGAAAATAATTAGGAGCTGCAAATTCTGTGCAGTCTCCAAAACGCAGCAGCAGCATCAAAATTTGTTATCTTTTTCTATGGGATTTTTCAGTGATATACCTCT
TAACGAGTCTCCTGGTAAAGCTTCTTTTGATCAATACTTGGAAGATAAACCCAGATTGCTGAAGGCAACATTTCCAGGAAAATGTCAACAGCTCAACCAGGTTTTTTTTT
CATCATCTTTTAAAGTTGTTGAAGAGTGGAGAATTGAAACGCCACAAATCCAGTTGTTGTTTCTCAAGTTATTGCCAGTGGTTGATATAAAAATAATCTCCAAAACCAGT
GGTGAGGATTACCCACCTCATGTTCCTCATTATATCACAAAACTTCTCGAACTTGAAATGACAAATTGGGAGATCAATGGCATCCATAGGGACTATAGGCCATCTTCAGC
CAAAGTTTGTTCCAAAGGAGCTATTTACACTGACAAAAGAGGAACTACAAGTAGACTTAAGTTTCAACTCCTCATGAATCTAACATTTGTTGTCCTGCAGACTCTGAGTT
TCATTCCGAAGGACGTTTTTCAAAACATCTTCGAGACGGTTTTGAAAGTAATGATTGAGGATGTGAAGCATAAAACTATAGATAAATTGGTTGAAGATTACAGTAAGTTC
AAGAAGGAGAAAAGTAAGAGACAGATC
Protein sequenceShow/hide protein sequence
KKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRLLKATFPGKCQQLNQVFFSSSFKVVEEWRIETPQIQLLFLKLLPVVDIKIISKTS
GEDYPPHVPHYITKLLELEMTNWEINGIHRDYRPSSAKVCSKGAIYTDKRGTTSRLKFQLLMNLTFVVLQTLSFIPKDVFQNIFETVLKVMIEDVKHKTIDKLVEDYSKF
KKEKSKRQI