; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g1473 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g1473
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDUF4228 domain-containing protein
Genome locationMC01:19191290..19192524
RNA-Seq ExpressionMC01g1473
SyntenyMC01g1473
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052618.1 DUF4228 domain-containing protein [Cucumis melo var. makuwa]4.02e-7876.19Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAA--RSRSLT
        MKN IRCCISCILPCG+LDVIRIVHC+GHV+EIAG+IRAS+VMKANP+HVLKKPSSP+    VVPKIVI+PPDAELQRGKIYFLMP+PP     RS+SLT
Subjt:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAA--RSRSLT

Query:  KKKKRTSELLPETAVGSDLSV---VVSDRYLSEILS----TTGKEKRRGRVGVWRPHLESISEFPTDL
        KKKK+   L P T VGS +SV   VVSD+YLSEILS    T  K+KRRGRVGVWRPHLESISEFPTDL
Subjt:  KKKKRTSELLPETAVGSDLSV---VVSDRYLSEILS----TTGKEKRRGRVGVWRPHLESISEFPTDL

KAG6581359.1 hypothetical protein SDJN03_21361, partial [Cucurbita argyrosperma subsp. sororia]7.47e-7777.11Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAARSRSLTKK
        MKN IRCCISCI PCG+LDVIRIVHCNGHVEEIAG+IRAS++MKANP+HVLKKPSSPSD  GVVPKIVI+PPDAELQRGKIYFLMP+PP   ++RS  KK
Subjt:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAARSRSLTKK

Query:  KKRTSELLPETAVGSDLSV---VVSDRYLSEILS---TTG-KEKRRGRVGVWRPHLESISEFPTDL
        KKR +   P T VGS +SV   VVSDRYLS+ILS   +TG K+KRRGRVGVWRPHLESISEFP DL
Subjt:  KKRTSELLPETAVGSDLSV---VVSDRYLSEILS---TTG-KEKRRGRVGVWRPHLESISEFPTDL

XP_023543275.1 uncharacterized protein LOC111803203 isoform X1 [Cucurbita pepo subsp. pepo]1.84e-7776.51Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAARSRSLTKK
        MKN IRCCISCI PCG+LDVIRIVHCNGHVEEIAG+IRAS++MKANP+HVLKKP+SPSDQ GVVPKIVI+PPDAELQRGKIYFLMP+PP   ++RS  KK
Subjt:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAARSRSLTKK

Query:  KKRTSELLPETAVGSDLSV---VVSDRYLSEILS---TTG-KEKRRGRVGVWRPHLESISEFPTDL
        K+ T    P T VGS +SV   VVSDRYLS+ILS   +TG K+KRRGRVGVWRPHLESISEFP DL
Subjt:  KKRTSELLPETAVGSDLSV---VVSDRYLSEILS---TTG-KEKRRGRVGVWRPHLESISEFPTDL

XP_031742479.1 uncharacterized protein LOC101214777 [Cucumis sativus]5.71e-7874.85Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAARSRSLTKK
        MKN IRCC+SCILPCG+LDVIRIVHC+GHV+EIAG+IRAS+VMKANP+HVLKKPSSP+    VVPKIVI+PPDAELQRGKIYFLMP+PP   + RS +  
Subjt:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAARSRSLTKK

Query:  KKRTSEL-LPETAVGSDLSV---VVSDRYLSEILS----TTGKEKRRGRVGVWRPHLESISEFPTDL
        KK+  EL LP T VGS +SV   VVSDRYLSEILS    T  K+KRRGRVGVWRPHLESISEFPTDL
Subjt:  KKRTSEL-LPETAVGSDLSV---VVSDRYLSEILS----TTGKEKRRGRVGVWRPHLESISEFPTDL

XP_038882770.1 uncharacterized protein LOC120073923 [Benincasa hispida]4.07e-8077.98Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAA--RSRSLT
        MKNTIRCCISCILPCG+LDVIRIVHCNGHVEEIAGTIRAS+VMKANP+HVLKKPSSP+    VVPKIVI+PPDA+LQRGKIYFLMP+PP     RS+SLT
Subjt:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAA--RSRSLT

Query:  KKKKRTSELLPETAVGSDLSV---VVSDRYLSEILS----TTGKEKRRGRVGVWRPHLESISEFPTDL
         KKK+    LP T VGS +SV   VVSDRYLSEILS    T  K+KRRGRVGVWRPHLESISEFPTDL
Subjt:  KKKKRTSELLPETAVGSDLSV---VVSDRYLSEILS----TTGKEKRRGRVGVWRPHLESISEFPTDL

TrEMBL top hitse value%identityAlignment
A0A0A0KI99 Uncharacterized protein2.76e-7874.85Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAARSRSLTKK
        MKN IRCC+SCILPCG+LDVIRIVHC+GHV+EIAG+IRAS+VMKANP+HVLKKPSSP+    VVPKIVI+PPDAELQRGKIYFLMP+PP   + RS +  
Subjt:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAARSRSLTKK

Query:  KKRTSEL-LPETAVGSDLSV---VVSDRYLSEILS----TTGKEKRRGRVGVWRPHLESISEFPTDL
        KK+  EL LP T VGS +SV   VVSDRYLSEILS    T  K+KRRGRVGVWRPHLESISEFPTDL
Subjt:  KKRTSEL-LPETAVGSDLSV---VVSDRYLSEILS----TTGKEKRRGRVGVWRPHLESISEFPTDL

A0A5D3CP55 DUF4228 domain-containing protein1.95e-7876.19Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAA--RSRSLT
        MKN IRCCISCILPCG+LDVIRIVHC+GHV+EIAG+IRAS+VMKANP+HVLKKPSSP+    VVPKIVI+PPDAELQRGKIYFLMP+PP     RS+SLT
Subjt:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAA--RSRSLT

Query:  KKKKRTSELLPETAVGSDLSV---VVSDRYLSEILS----TTGKEKRRGRVGVWRPHLESISEFPTDL
        KKKK+   L P T VGS +SV   VVSD+YLSEILS    T  K+KRRGRVGVWRPHLESISEFPTDL
Subjt:  KKKKRTSELLPETAVGSDLSV---VVSDRYLSEILS----TTGKEKRRGRVGVWRPHLESISEFPTDL

A0A6J1EJJ7 uncharacterized protein LOC1114331841.42e-7677.11Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAARSRSLTKK
        MKN IRCCISCI PCG+LDVIRIVHCNGHVEEIAG+IRAS++MKANP+HVLKKPSSPSD  GVVPKIVI+PPDAELQRGKIYFLMP+PP   ++RS  KK
Subjt:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAARSRSLTKK

Query:  KKRTSELLPETAVGSDLSV---VVSDRYLSEILS---TTG-KEKRRGRVGVWRPHLESISEFPTDL
        KKR +   P T VGS +SV   VVSDRYLS+ILS   +TG K+KRRGRVGVWRPHLESISEFP DL
Subjt:  KKRTSELLPETAVGSDLSV---VVSDRYLSEILS---TTG-KEKRRGRVGVWRPHLESISEFPTDL

A0A6J1IUZ9 uncharacterized protein LOC1114786383.62e-7777.11Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAARSRSLTKK
        MKN IRCCISCI PCG+LDVIRIVHCNGHVEEIAG+IRAS++MKANP+HVLKKPSSPSD  GVVPKIVI+PPDAELQRGKIYFLMP+PP   ++RS  KK
Subjt:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAARSRSLTKK

Query:  KKRTSELLPETAVGSDLSV---VVSDRYLSEILS---TTG-KEKRRGRVGVWRPHLESISEFPTDL
        KKR +   P T VGS +SV   VVSDRYLS+ILS   +TG K+KRRGRVGVWRPHLESISEFP DL
Subjt:  KKRTSELLPETAVGSDLSV---VVSDRYLSEILS---TTG-KEKRRGRVGVWRPHLESISEFPTDL

A0A6P3Z812 uncharacterized protein LOC1074108511.09e-7372.19Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAA----RSRS
        MKNTIRCCISCILPCGALDVIRIVH NG VEEI+GTIRASE+MKA+P+HVLKKPSSP+   GVVPKIVIVPPDAELQRGKIYFLMP PPP      R R 
Subjt:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAA----RSRS

Query:  LTKKKKRTS-ELLPETAVGSDLSV---VVSDRYLSEILS---TTGKEKRRGRVGVWRPHLESISEFPTD
         TKKK+R+S E+       S +S+   ++SDRYLSEILS   +T K++RRGRVGVWRPHLESISE PTD
Subjt:  LTKKKKRTS-ELLPETAVGSDLSV---VVSDRYLSEILS---TTGKEKRRGRVGVWRPHLESISEFPTD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G06980.1 unknown protein1.6e-3246.47Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVP--PPAARSRSLT
        M N++RCC++C+LPCGALD+IRIVH NG+VEEI  +I A E+++ANP HVL KP S     GVV KI+I+ P++EL+RG IYFL+P    P   R R  T
Subjt:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVP--PPAARSRSLT

Query:  KKKKRTSELLPETAVGSDLS-----VVVSDRYLSEILS--TTGKEKRRGR-------VGVWRPHLESISE
         ++K+  +     A G ++      V + ++YL E++S  +TGKE R  R       V  WRP L+SISE
Subjt:  KKKKRTSELLPETAVGSDLS-----VVVSDRYLSEILS--TTGKEKRRGR-------VGVWRPHLESISE

AT1G29195.1 unknown protein8.1e-4553.65Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPS---DQLGVV--PKIVIVPPDAELQRGKIYFLMPVPP------
        MK TIRCCI+CILPCGALDVIRIVH NGHVEEI+GTI ASE+MKA+P+HVLKKPSSP+   D+  V+   KIVIVPP+AELQRGKIYFLMP         
Subjt:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPS---DQLGVV--PKIVIVPPDAELQRGKIYFLMPVPP------

Query:  ------------PAARSRSLTKKKKRTSELLPETAVGSDLS--------VVVSDRYLSEILS---TTGKEKRRGRVGVWRPHLESISEFPTD
                       + RS  +++ R  +        +D+         ++ SDRYL+EILS    T K++R+GRVGVWRPHLESISE  T+
Subjt:  ------------PAARSRSLTKKKKRTSELLPETAVGSDLS--------VVVSDRYLSEILS---TTGKEKRRGRVGVWRPHLESISEFPTD

AT2G30230.1 unknown protein5.1e-3142.22Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAARSRSLTKK
        M N++RCC++C+LPCGALD+IRIVH NGHV+EI   + A E+++ANP HVL KP S     GVV KI+I+ P++EL+RG IYFL  +P  +   +  TKK
Subjt:  MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAARSRSLTKK

Query:  KKRTSELLPETAVGSDLS--------------VVVSDRY-----LSEILSTTGKEKRRGR-------VGVWRPHLESISE
        +K           G+D++              + + ++Y     LSE +S+ GKE R  R       V  WRPHL+SI+E
Subjt:  KKRTSELLPETAVGSDLS--------------VVVSDRY-----LSEILSTTGKEKRRGR-------VGVWRPHLESISE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAATACCATTAGATGCTGCATCTCCTGCATACTTCCATGCGGTGCTCTGGACGTGATCAGGATCGTCCATTGCAACGGCCACGTGGAAGAGATCGCGGGCACCAT
CCGCGCCAGCGAGGTCATGAAGGCCAACCCAGAGCACGTGCTCAAGAAACCGTCATCGCCGTCCGACCAGCTGGGCGTGGTCCCCAAGATCGTGATCGTGCCGCCGGACG
CCGAGCTCCAGCGTGGCAAGATTTACTTTCTCATGCCGGTTCCTCCTCCAGCCGCCCGCTCCAGATCTCTCACCAAGAAGAAGAAGAGGACGTCGGAATTACTACCGGAA
ACCGCCGTGGGTTCCGATTTATCGGTGGTGGTTTCCGATCGGTATCTGAGTGAAATACTGTCGACGACTGGGAAGGAGAAGCGGCGAGGGCGGGTCGGGGTGTGGAGGCC
TCATCTAGAGAGCATTTCCGAGTTCCCAACCGATCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAATACCATTAGATGCTGCATCTCCTGCATACTTCCATGCGGTGCTCTGGACGTGATCAGGATCGTCCATTGCAACGGCCACGTGGAAGAGATCGCGGGCACCAT
CCGCGCCAGCGAGGTCATGAAGGCCAACCCAGAGCACGTGCTCAAGAAACCGTCATCGCCGTCCGACCAGCTGGGCGTGGTCCCCAAGATCGTGATCGTGCCGCCGGACG
CCGAGCTCCAGCGTGGCAAGATTTACTTTCTCATGCCGGTTCCTCCTCCAGCCGCCCGCTCCAGATCTCTCACCAAGAAGAAGAAGAGGACGTCGGAATTACTACCGGAA
ACCGCCGTGGGTTCCGATTTATCGGTGGTGGTTTCCGATCGGTATCTGAGTGAAATACTGTCGACGACTGGGAAGGAGAAGCGGCGAGGGCGGGTCGGGGTGTGGAGGCC
TCATCTAGAGAGCATTTCCGAGTTCCCAACCGATCTTTAATGGAGTTTCCCCAAAAATGATGATTCATTTTCCTTTCTTCTAAAATCTGAATCTCCGTTTTCTTTTCCGA
TTGAATATATGTATAAACAAATTAGTTGAGGGGGAATACGGGAAATGATTGAATTTTGGCAATGTTTGCTCCGGCTGACACGGCCGGCCATGTATGGATTAGATCAAGAA
GTGATCAAGATCAGGTAATGGAGGACTGTAAATTTGGAAGGTTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAG
AGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAG
AGAGAGAGAGAGAGAGAGAGAGAGGTCAAATTAAATTTACAATTCAGATTTGGTAATTATTGAAAATGTGGTTTTTTTGCTTTAGAAACTGAAACTGAGAGGGGGGATAA
GACTTGTATTTTAATCTCTGGGAGCTTTCTGTTTTGTGAGTAAAATTTTTGATAGTGAAGTGATTGTGATTGGTATCCTTGGAATAATCTAATTACTTTTATATTCTTTT
CAAAGGAAAAAAAAAATCTAATTACGTTTATACTCATTATTACATTTAATTCAGATGGGTTGTCATGTTTGAAATAAAATTTTTGTTTAGTTTCTTTTAATACCTGAAGA
AATACTATAATTTTTATCTGGCCAC
Protein sequenceShow/hide protein sequence
MKNTIRCCISCILPCGALDVIRIVHCNGHVEEIAGTIRASEVMKANPEHVLKKPSSPSDQLGVVPKIVIVPPDAELQRGKIYFLMPVPPPAARSRSLTKKKKRTSELLPE
TAVGSDLSVVVSDRYLSEILSTTGKEKRRGRVGVWRPHLESISEFPTDL