; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1714 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1714
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationMC04:24632767..24634520
RNA-Seq ExpressionMC04g1714
SyntenyMC04g1714
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033185.1 hypothetical protein SDJN02_07239, partial [Cucurbita argyrosperma subsp. argyrosperma]3.46e-8269.68Show/hide
Query:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS
        K QK+ R  K  AVSK+QQ+    QNLLS S+  FSDIPL E  GKASFDQYLEDKPR++KATFPGKS+QLNQEEWRIETPK+E L LKIWP ID+KIIS
Subjt:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS

Query:  KTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFE
        KTSG+ YP  VPH+ITK+L L+MTNWE+NGI R+YRPSSANV S+GAIYSEK G  SRLKFQ  +N +F +P AL+F+PKD+F+SI E
Subjt:  KTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFE

XP_022134158.1 uncharacterized protein LOC111006493 isoform X1 [Momordica charantia]4.55e-132100Show/hide
Query:  KKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTS
        KKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTS
Subjt:  KKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTS

Query:  GQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETV
        GQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETV
Subjt:  GQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETV

XP_022134159.1 uncharacterized protein LOC111006493 isoform X2 [Momordica charantia]6.27e-90100Show/hide
Query:  MLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSR
        MLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSR
Subjt:  MLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSR

Query:  LKFQFFMNFTFVVPQALSFIPKDIFRSIFETV
        LKFQFFMNFTFVVPQALSFIPKDIFRSIFETV
Subjt:  LKFQFFMNFTFVVPQALSFIPKDIFRSIFETV

XP_022961709.1 uncharacterized protein LOC111462397 [Cucurbita moschata]2.11e-8370.21Show/hide
Query:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS
        K QK+ R  K  AVSK+QQ+    QNLLS S+  FSDIPL E  GKASFDQYLEDKPR++KATFPGKS+QLNQEEWRIETPK+E L LKIWP ID+KIIS
Subjt:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS

Query:  KTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFE
        KTSG+ YP  VPH+ITK+L L+MTNWE+NGIHR+YRPSSANV S+GAIYSEK G  SRLKFQ  +N +F +P AL+F+PKD+F+SI E
Subjt:  KTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFE

XP_023517004.1 uncharacterized protein LOC111780797 isoform X1 [Cucurbita pepo subsp. pepo]1.21e-8269.68Show/hide
Query:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS
        K QK+ R  K  AVSK+QQ+    QNLLS S+  FSDIPL E  GKASFDQYLEDKPR++KATFPGKS+QLNQEEWRIETPK+E L LKIWP ID+KIIS
Subjt:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS

Query:  KTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFE
        KTSG+ YP  VPH+ITK+L L+MTNWE+NGIHR+YRPSSANV S+GAIYS+K G  SRLKFQ  +N +F +P AL+F+PKD+F+SI E
Subjt:  KTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFE

TrEMBL top hitse value%identityAlignment
A0A0A0KSD5 Uncharacterized protein3.48e-7965.97Show/hide
Query:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS
        KKQK+ ++ K  A+    Q+   H NLLS S   FSD+PL ESPGKASFD+YLEDKPR++KATFPGK+QQLNQEEWRIETPK++LL LKI P IDMKIIS
Subjt:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS

Query:  KTSGQD-YPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETV
        KT+G + YP HVPH+I KLLH +MTNWEINGIH+ YRPSSANV S G IY +K GT SRLKFQ  ++ +F+VP AL F+P D+ R I ETV
Subjt:  KTSGQD-YPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETV

A0A6J1BY06 uncharacterized protein LOC111006493 isoform X23.04e-90100Show/hide
Query:  MLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSR
        MLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSR
Subjt:  MLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSR

Query:  LKFQFFMNFTFVVPQALSFIPKDIFRSIFETV
        LKFQFFMNFTFVVPQALSFIPKDIFRSIFETV
Subjt:  LKFQFFMNFTFVVPQALSFIPKDIFRSIFETV

A0A6J1C174 uncharacterized protein LOC111006493 isoform X12.20e-132100Show/hide
Query:  KKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTS
        KKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTS
Subjt:  KKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTS

Query:  GQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETV
        GQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETV
Subjt:  GQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETV

A0A6J1HEU2 uncharacterized protein LOC1114623971.02e-8370.21Show/hide
Query:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS
        K QK+ R  K  AVSK+QQ+    QNLLS S+  FSDIPL E  GKASFDQYLEDKPR++KATFPGKS+QLNQEEWRIETPK+E L LKIWP ID+KIIS
Subjt:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS

Query:  KTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFE
        KTSG+ YP  VPH+ITK+L L+MTNWE+NGIHR+YRPSSANV S+GAIYSEK G  SRLKFQ  +N +F +P AL+F+PKD+F+SI E
Subjt:  KTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFE

A0A6J1JUJ3 uncharacterized protein LOC1114876277.65e-7771.6Show/hide
Query:  LSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWE
        LS S+  FSDIPL E  GKASFDQYLEDKPR++KA FPGKS+QLNQEEWRIETPK+E L LKIWP ID+KIISKTSG+ YP  VPH+IT++L L+MTNWE
Subjt:  LSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWE

Query:  INGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFET
        +NGI R+Y PSSANV S+GAIYSEK G  SRLKFQ  +N +F +P AL+FIPKD+F+SI ET
Subjt:  INGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFET

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G39520.1 Protein of unknown function (DUF1997)7.1e-3242.33Show/hide
Query:  FSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPG--KSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWE
        +S    +DI L+ESP +A FD+YLEDK R+ +A FP   K+ +LN+EEWRI+   ++   L   PV+ M+I  K++GQDYP  VP HITK+L L MT WE
Subjt:  FSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPG--KSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWE

Query:  INGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETV
        + G+ R   P+   +  KGA+Y ++RG  +RLK +     +FV+P  L+ +P+D+ R++   +
Subjt:  INGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETV

AT5G39530.1 Protein of unknown function (DUF1997)3.9e-3842.78Show/hide
Query:  QKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGK--SQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTS
        + ++RS   C VS  +       ++S    +DIPLNESP +A FD+YLEDK R+ +A FP K  S +LN+EEWRI+   +  L L +WPV+DM++  K++
Subjt:  QKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGK--SQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTS

Query:  GQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETV
        GQDYPP VP  ITK+L L M  W++ G+ R   P+  ++  KGA+Y ++RG  +RL+ Q  MN +FV+P  L  +P+D+ R++   V
Subjt:  GQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AAGAAGCAGAAAATAATTAGGAGCTGCAAATTCTGTGCAGTCTCCAAAACGCAGCAGCAGCATCAAAATTTGTTATCTTTTTCTATGGGATTTTTCAGTGATATACCTCT
TAACGAGTCTCCTGGTAAAGCTTCTTTTGATCAATACTTGGAAGATAAACCCAGAATGCTGAAGGCAACATTTCCAGGAAAAAGTCAACAGCTCAACCAGGAAGAGTGGA
GAATTGAAACGCCAAAAATGGAATTGCTGTCTTTGAAGATATGGCCAGTGATTGATATGAAAATAATCTCCAAAACCAGTGGCCAAGATTACCCACCTCATGTTCCTCAT
CATATCACAAAACTTCTCCACCTTGAAATGACAAATTGGGAGATCAATGGCATCCATAGGAACTATAGGCCATCTTCAGCCAATGTTACTTCCAAAGGAGCTATTTACAG
TGAAAAAAGAGGAACTACAAGTAGACTTAAGTTTCAATTCTTTATGAATTTCACCTTTGTTGTCCCGCAGGCTCTGAGTTTCATTCCGAAAGACATTTTTCGAAGCATCT
TTGAGACGGTA
mRNA sequenceShow/hide mRNA sequence
AAGAAGCAGAAAATAATTAGGAGCTGCAAATTCTGTGCAGTCTCCAAAACGCAGCAGCAGCATCAAAATTTGTTATCTTTTTCTATGGGATTTTTCAGTGATATACCTCT
TAACGAGTCTCCTGGTAAAGCTTCTTTTGATCAATACTTGGAAGATAAACCCAGAATGCTGAAGGCAACATTTCCAGGAAAAAGTCAACAGCTCAACCAGGAAGAGTGGA
GAATTGAAACGCCAAAAATGGAATTGCTGTCTTTGAAGATATGGCCAGTGATTGATATGAAAATAATCTCCAAAACCAGTGGCCAAGATTACCCACCTCATGTTCCTCAT
CATATCACAAAACTTCTCCACCTTGAAATGACAAATTGGGAGATCAATGGCATCCATAGGAACTATAGGCCATCTTCAGCCAATGTTACTTCCAAAGGAGCTATTTACAG
TGAAAAAAGAGGAACTACAAGTAGACTTAAGTTTCAATTCTTTATGAATTTCACCTTTGTTGTCCCGCAGGCTCTGAGTTTCATTCCGAAAGACATTTTTCGAAGCATCT
TTGAGACGGTA
Protein sequenceShow/hide protein sequence
KKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPH
HITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETV