; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g36560 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g36560
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF1997)
Genome locationchr4:27441790..27444526
RNA-Seq ExpressionMoc04g36560
SyntenyMoc04g36560
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022134158.1 uncharacterized protein LOC111006493 isoform X1 [Momordica charantia]2.1e-134100Show/hide
Query:  MMSLFSTQLLHFHVENGEQRSNNRFNMVKKKKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQ
        MMSLFSTQLLHFHVENGEQRSNNRFNMVKKKKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQ
Subjt:  MMSLFSTQLLHFHVENGEQRSNNRFNMVKKKKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQ

Query:  EEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQ
        EEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQ
Subjt:  EEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQ

Query:  ALSFIPKDIFRSIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK
        ALSFIPKDIFRSIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK
Subjt:  ALSFIPKDIFRSIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK

XP_022134159.1 uncharacterized protein LOC111006493 isoform X2 [Momordica charantia]4.4e-84100Show/hide
Query:  MLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSR
        MLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSR
Subjt:  MLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSR

Query:  LKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK
        LKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK
Subjt:  LKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK

XP_022961709.1 uncharacterized protein LOC111462397 [Cucurbita moschata]3.2e-7469.27Show/hide
Query:  KKQKIIRSCKFCAVSKTQQ---QHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS
        K QK+ R  K  AVSK+QQ   + QNLLS S+  FSDIPL E  GKASFDQYLEDKPR++KATFPGKS+QLNQEEWRIETPK+E L LKIWP ID+KIIS
Subjt:  KKQKIIRSCKFCAVSKTQQ---QHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS

Query:  KTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLTN
        KTSG+ YP  VPH+ITK+L L+MTNWE+NGIHR+YRPSSANV S+GAIYSEK G  SRLKFQ  +N +F +P AL+F+PKD+F+SI E  LK M+ED+  
Subjt:  KTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLTN

Query:  KAIDKLVEDYSKFRKEKK
        KA+D+LVEDY  FRKEKK
Subjt:  KAIDKLVEDYSKFRKEKK

XP_023517004.1 uncharacterized protein LOC111780797 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-7368.81Show/hide
Query:  KKQKIIRSCKFCAVSKTQQ---QHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS
        K QK+ R  K  AVSK+QQ   + QNLLS S+  FSDIPL E  GKASFDQYLEDKPR++KATFPGKS+QLNQEEWRIETPK+E L LKIWP ID+KIIS
Subjt:  KKQKIIRSCKFCAVSKTQQ---QHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS

Query:  KTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLTN
        KTSG+ YP  VPH+ITK+L L+MTNWE+NGIHR+YRPSSANV S+GAIYS+K G  SRLKFQ  +N +F +P AL+F+PKD+F+SI E  LK M+ED+  
Subjt:  KTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLTN

Query:  KAIDKLVEDYSKFRKEKK
        KA+D+LVEDY  FRKEKK
Subjt:  KAIDKLVEDYSKFRKEKK

XP_038891182.1 uncharacterized protein LOC120080556 [Benincasa hispida]3.2e-7461.75Show/hide
Query:  MMSLFSTQLLHFHVENGE--QRSNNRFNMVKKKKQKIIRSCKFCAVSKTQQ----QHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGK
        M+ L +  L +  VENG   QR ++R + +  KKQKI R  K  AVSKTQ+     H NLLS S+ FFSD+PL +SPGKASFD+YLEDKPR++KATFPGK
Subjt:  MMSLFSTQLLHFHVENGE--QRSNNRFNMVKKKKQKIIRSCKFCAVSKTQQ----QHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGK

Query:  SQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNF
         QQLNQEEWRIE PK+ELL LKIWP +D+KI  KT+G+ YP  VPH+ITK+L LEMTNWEINGIH++YRPS ANV S+GAIYSEK GT S LKF+  +N 
Subjt:  SQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNF

Query:  TFVVPQALSFIPKDIFRSIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK
        +F+VP  L+F+  D+ +SI +T LK M+EDL +K+I KLVEDY++FRKE K
Subjt:  TFVVPQALSFIPKDIFRSIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK

TrEMBL top hitse value%identityAlignment
A0A0A0KSD5 Uncharacterized protein1.2e-7165.75Show/hide
Query:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS
        KKQK +++ K  A+    Q+   H NLLS S   FSD+PL ESPGKASFD+YLEDKPR++KATFPGK+QQLNQEEWRIETPK++LL LKI P IDMKIIS
Subjt:  KKQKIIRSCKFCAVSKTQQQ---HQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS

Query:  KTS-GQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLT
        KT+ G+ YP HVPH+I KLLH +MTNWEINGIH+ YRPSSANV S G IY +K GT SRLKFQ  ++ +F+VP AL F+P D+ R I ETV+K M+EDL 
Subjt:  KTS-GQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLT

Query:  NKAIDKLVEDYSKFRKEKK
        +K + KLVEDYSKFR EK+
Subjt:  NKAIDKLVEDYSKFRKEKK

A0A6J1BY06 uncharacterized protein LOC111006493 isoform X22.1e-84100Show/hide
Query:  MLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSR
        MLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSR
Subjt:  MLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSR

Query:  LKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK
        LKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK
Subjt:  LKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK

A0A6J1C174 uncharacterized protein LOC111006493 isoform X11.0e-134100Show/hide
Query:  MMSLFSTQLLHFHVENGEQRSNNRFNMVKKKKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQ
        MMSLFSTQLLHFHVENGEQRSNNRFNMVKKKKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQ
Subjt:  MMSLFSTQLLHFHVENGEQRSNNRFNMVKKKKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQ

Query:  EEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQ
        EEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQ
Subjt:  EEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQ

Query:  ALSFIPKDIFRSIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK
        ALSFIPKDIFRSIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK
Subjt:  ALSFIPKDIFRSIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK

A0A6J1HEU2 uncharacterized protein LOC1114623971.5e-7469.27Show/hide
Query:  KKQKIIRSCKFCAVSKTQQ---QHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS
        K QK+ R  K  AVSK+QQ   + QNLLS S+  FSDIPL E  GKASFDQYLEDKPR++KATFPGKS+QLNQEEWRIETPK+E L LKIWP ID+KIIS
Subjt:  KKQKIIRSCKFCAVSKTQQ---QHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIIS

Query:  KTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLTN
        KTSG+ YP  VPH+ITK+L L+MTNWE+NGIHR+YRPSSANV S+GAIYSEK G  SRLKFQ  +N +F +P AL+F+PKD+F+SI E  LK M+ED+  
Subjt:  KTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLTN

Query:  KAIDKLVEDYSKFRKEKK
        KA+D+LVEDY  FRKEKK
Subjt:  KAIDKLVEDYSKFRKEKK

A0A6J1JUJ3 uncharacterized protein LOC1114876271.9e-6966.98Show/hide
Query:  KKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTS
        K QK+ R  K  AVSK        LS S+  FSDIPL E  GKASFDQYLEDKPR++KA FPGKS+QLNQEEWRIETPK+E L LKIWP ID+KIISKTS
Subjt:  KKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTS

Query:  GQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLTNKAI
        G+ YP  VPH+IT++L L+MTNWE+NGI R+Y PSSANV S+GAIYSEK G  SRLKFQ  +N +F +P AL+FIPKD+F+SI ET LK M+ED+  KA+
Subjt:  GQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLTNKAI

Query:  DKLVEDYSKFRKEKK
        D+LVEDY  FRKEKK
Subjt:  DKLVEDYSKFRKEKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G39520.1 Protein of unknown function (DUF1997)5.1e-3841.88Show/hide
Query:  FSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPG--KSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWE
        +S    +DI L+ESP +A FD+YLEDK R+ +A FP   K+ +LN+EEWRI+   ++   L   PV+ M+I  K++GQDYP  VP HITK+L L MT WE
Subjt:  FSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPG--KSQQLNQEEWRIETPKMELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWE

Query:  INGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK
        + G+ R   P+   +  KGA+Y ++RG  +RLK +     +FV+P  L+ +P+D+ R++   +L  +++++ ++ I+ LV DYSKF+ E+K
Subjt:  INGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK

AT5G39530.1 Protein of unknown function (DUF1997)3.1e-4340.6Show/hide
Query:  VENGEQRSNNRFNMVKKKKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGK--SQQLNQEEWRIETPKME
        + +G  RS +R   V  + + ++RS   C VS  +       ++S    +DIPLNESP +A FD+YLEDK R+ +A FP K  S +LN+EEWRI+   + 
Subjt:  VENGEQRSNNRFNMVKKKKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGK--SQQLNQEEWRIETPKME

Query:  LLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFR
         L L +WPV+DM++  K++GQDYPP VP  ITK+L L M  W++ G+ R   P+  ++  KGA+Y ++RG  +RL+ Q  MN +FV+P  L  +P+D+ R
Subjt:  LLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFR

Query:  SIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK
        ++   VL  ++E++ +K    L+ DYS+F+ E+K
Subjt:  SIFETVLKVMMEDLTNKAIDKLVEDYSKFRKEKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTCTTTATTTTCCACTCAATTACTACATTTCCATGTTGAAAATGGAGAACAAAGGTCGAACAACAGATTCAATATGGTGAAGAAGAAGAAGCAGAAAATAATTAG
GAGCTGCAAATTCTGTGCAGTCTCCAAAACGCAGCAGCAGCATCAAAATTTGTTATCTTTTTCTATGGGATTTTTCAGTGATATACCTCTTAACGAGTCTCCTGGTAAAG
CTTCTTTTGATCAATACTTGGAAGATAAACCCAGAATGCTGAAGGCAACATTTCCAGGAAAAAGTCAACAGCTCAACCAGGAAGAGTGGAGAATTGAAACGCCAAAAATG
GAATTGCTGTCTTTGAAGATATGGCCAGTGATTGATATGAAAATAATCTCCAAAACCAGTGGCCAAGATTACCCACCTCATGTTCCTCATCATATCACAAAACTTCTCCA
CCTTGAAATGACAAATTGGGAGATCAATGGCATCCATAGGAACTATAGGCCATCTTCAGCCAATGTTACTTCCAAAGGAGCTATTTACAGTGAAAAAAGAGGAACTACAA
GTAGACTTAAGTTTCAATTCTTTATGAATTTCACCTTTGTTGTCCCGCAGGCTCTGAGTTTCATTCCGAAAGACATTTTTCGAAGCATCTTTGAGACGGTTTTGAAGGTA
ATGATGGAAGATCTTACGAATAAAGCTATAGATAAATTGGTTGAAGATTACAGTAAGTTCAGGAAGGAGAAGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGTCTTTATTTTCCACTCAATTACTACATTTCCATGTTGAAAATGGAGAACAAAGGTCGAACAACAGATTCAATATGGTGAAGAAGAAGAAGCAGAAAATAATTAG
GAGCTGCAAATTCTGTGCAGTCTCCAAAACGCAGCAGCAGCATCAAAATTTGTTATCTTTTTCTATGGGATTTTTCAGTGATATACCTCTTAACGAGTCTCCTGGTAAAG
CTTCTTTTGATCAATACTTGGAAGATAAACCCAGAATGCTGAAGGCAACATTTCCAGGAAAAAGTCAACAGCTCAACCAGGAAGAGTGGAGAATTGAAACGCCAAAAATG
GAATTGCTGTCTTTGAAGATATGGCCAGTGATTGATATGAAAATAATCTCCAAAACCAGTGGCCAAGATTACCCACCTCATGTTCCTCATCATATCACAAAACTTCTCCA
CCTTGAAATGACAAATTGGGAGATCAATGGCATCCATAGGAACTATAGGCCATCTTCAGCCAATGTTACTTCCAAAGGAGCTATTTACAGTGAAAAAAGAGGAACTACAA
GTAGACTTAAGTTTCAATTCTTTATGAATTTCACCTTTGTTGTCCCGCAGGCTCTGAGTTTCATTCCGAAAGACATTTTTCGAAGCATCTTTGAGACGGTTTTGAAGGTA
ATGATGGAAGATCTTACGAATAAAGCTATAGATAAATTGGTTGAAGATTACAGTAAGTTCAGGAAGGAGAAGAAGTAA
Protein sequenceShow/hide protein sequence
MMSLFSTQLLHFHVENGEQRSNNRFNMVKKKKQKIIRSCKFCAVSKTQQQHQNLLSFSMGFFSDIPLNESPGKASFDQYLEDKPRMLKATFPGKSQQLNQEEWRIETPKM
ELLSLKIWPVIDMKIISKTSGQDYPPHVPHHITKLLHLEMTNWEINGIHRNYRPSSANVTSKGAIYSEKRGTTSRLKFQFFMNFTFVVPQALSFIPKDIFRSIFETVLKV
MMEDLTNKAIDKLVEDYSKFRKEKK