; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0546 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0546
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionBEST Arabidopsis thaliana protein match is: NHL domain-containing protein .
Genome locationMC06:4474957..4475346
RNA-Seq ExpressionMC06g0546
SyntenyMC06g0546
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587852.1 hypothetical protein SDJN03_16417, partial [Cucurbita argyrosperma subsp. sororia]3.13e-6169.86Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPPP PYSPLNEE E         P NGC CF+LFGFGF+RN + E  NLL     R E  WM +KLKK+KEVSEMVAGPKWK FIRKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFNG----EDDGPPLGFSARFAVPMAARE
        GKKQRNRFQYDPESYALNF+G    EDD PP+GFS+RFAVP+A+RE
Subjt:  GKKQRNRFQYDPESYALNFNG----EDDGPPLGFSARFAVPMAARE

XP_022135367.1 uncharacterized protein LOC111007336 [Momordica charantia]1.03e-92100Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP
        MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP
Subjt:  MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP

Query:  ESYALNFNGEDDGPPLGFSARFAVPMAARE
        ESYALNFNGEDDGPPLGFSARFAVPMAARE
Subjt:  ESYALNFNGEDDGPPLGFSARFAVPMAARE

XP_022927019.1 uncharacterized protein LOC111433973 [Cucurbita moschata]4.45e-6169.86Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPEDP-------PPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPPP PYSPLNEE E         P NGC CF+LFGFGF+RN + E  NLL     R E  WM +KLKK+KEVSEMVAGPKWK FIRKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPEDP-------PPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFNG----EDDGPPLGFSARFAVPMAARE
        GKKQRNRFQYDPESYALNF+G    EDD PP+GFS+RFAVP+A+RE
Subjt:  GKKQRNRFQYDPESYALNFNG----EDDGPPLGFSARFAVPMAARE

XP_023003195.1 uncharacterized protein LOC111496880 [Cucurbita maxima]8.97e-6169.18Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPPP PYSPLNEE E         P NGC CF+LFGFGF+RN + E  NLL     R E  WM +KLKK+KEVSEMVAGPKWK FIRKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFNG----EDDGPPLGFSARFAVPMAARE
        G+KQRNRFQYDPESYALNF+G    EDD PP+GFS+RFAVP+A+RE
Subjt:  GKKQRNRFQYDPESYALNFNG----EDDGPPLGFSARFAVPMAARE

XP_038876901.1 uncharacterized protein LOC120069255 [Benincasa hispida]2.02e-6269.59Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPP  PYSPLNE+ +D        P NGCGCFRLFGFGF+RN + E RNLL     R E SWM RKLKK+KEVSEMVAGPKWK F+RKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFNGEDDG------PPLGFSARFAVPMAARE
        GKKQRNRFQYDPESYALNF+G  DG      PP+GFS+RFAVP+A+RE
Subjt:  GKKQRNRFQYDPESYALNFNGEDDG------PPLGFSARFAVPMAARE

TrEMBL top hitse value%identityAlignment
A0A1S3B8N4 uncharacterized protein LOC1034873831.05e-5967.11Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPEDP--------PPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYL
        MA+HQ RPP  PYSPLN++ +D           NGCGCF+LFGFG +RN + EG NLL     R E SWM +KLKKVKEVSEMVAGPKWK FIRKMGGYL
Subjt:  MAAHQPRPPPMPYSPLNEEPEDP--------PPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYL

Query:  KGKKQRNRFQYDPESYALNFNGEDDG------PPLGFSARFAVPMAARE
        KGKKQRNRFQYDPESYALNF+G  DG      PP+GFS+RFAVP+A+RE
Subjt:  KGKKQRNRFQYDPESYALNFNGEDDG------PPLGFSARFAVPMAARE

A0A5A7U6Q0 Uncharacterized protein1.05e-5967.11Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPEDP--------PPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYL
        MA+HQ RPP  PYSPLN++ +D           NGCGCF+LFGFG +RN + EG NLL     R E SWM +KLKKVKEVSEMVAGPKWK FIRKMGGYL
Subjt:  MAAHQPRPPPMPYSPLNEEPEDP--------PPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYL

Query:  KGKKQRNRFQYDPESYALNFNGEDDG------PPLGFSARFAVPMAARE
        KGKKQRNRFQYDPESYALNF+G  DG      PP+GFS+RFAVP+A+RE
Subjt:  KGKKQRNRFQYDPESYALNFNGEDDG------PPLGFSARFAVPMAARE

A0A6J1C2H6 uncharacterized protein LOC1110073364.97e-93100Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP
        MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP
Subjt:  MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP

Query:  ESYALNFNGEDDGPPLGFSARFAVPMAARE
        ESYALNFNGEDDGPPLGFSARFAVPMAARE
Subjt:  ESYALNFNGEDDGPPLGFSARFAVPMAARE

A0A6J1EGU0 uncharacterized protein LOC1114339732.15e-6169.86Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPEDP-------PPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPPP PYSPLNEE E         P NGC CF+LFGFGF+RN + E  NLL     R E  WM +KLKK+KEVSEMVAGPKWK FIRKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPEDP-------PPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFNG----EDDGPPLGFSARFAVPMAARE
        GKKQRNRFQYDPESYALNF+G    EDD PP+GFS+RFAVP+A+RE
Subjt:  GKKQRNRFQYDPESYALNFNG----EDDGPPLGFSARFAVPMAARE

A0A6J1KR34 uncharacterized protein LOC1114968804.34e-6169.18Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPPP PYSPLNEE E         P NGC CF+LFGFGF+RN + E  NLL     R E  WM +KLKK+KEVSEMVAGPKWK FIRKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFNG----EDDGPPLGFSARFAVPMAARE
        G+KQRNRFQYDPESYALNF+G    EDD PP+GFS+RFAVP+A+RE
Subjt:  GKKQRNRFQYDPESYALNFNG----EDDGPPLGFSARFAVPMAARE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)9.4e-0733.33Show/hide
Query:  LRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG----------------GYLKGKKQRNR------FQYDPESYALNFNG-------EDDGPPLGFSA
        L  +  W  R  ++++E SE+VAGP+WKT+IR+ G                G   G    NR      F+YD  SY+LNF+        +D+ P   +S 
Subjt:  LRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG----------------GYLKGKKQRNR------FQYDPESYALNFNG-------EDDGPPLGFSA

Query:  RFAVP
        RFA P
Subjt:  RFAVP

AT3G48020.1 unknown protein1.7e-0840.91Show/hide
Query:  EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG-GYLKGK--KQRNRFQYDPESYALNFNGEDD--------GPPLGFSARFA-VPMAA
        E  W  R   K++E SE+VAGP+WKTFIR+      +G+     ++F+YDP SY L+F  ED         G    FS R+A VP+A+
Subjt:  EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG-GYLKGK--KQRNRFQYDPESYALNFNGEDD--------GPPLGFSARFA-VPMAA

AT5G14890.1 NHL domain-containing protein2.9e-0839.13Show/hide
Query:  LRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG------GYLKG---KKQRNRFQYDPESYALNFNG-------EDDGPPLGFSARFAVP
        L  +  W      K++E SE+VAGPKWKTFIR+ G      G + G   + +   F+YD  SY+LNF+        ED+ P   +S RFA P
Subjt:  LRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG------GYLKG---KKQRNRFQYDPESYALNFNG-------EDDGPPLGFSARFAVP

AT5G25240.1 unknown protein1.8e-1338.94Show/hide
Query:  GCGCFRLFGFGFDRNADSEGRNLLRG----------EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDPESYALNFNGEDDGPPL
        GCG FR F F   R  D E R+  RG           G+W + KLK +KE+SE +AGPKWK FIR      K  ++   F YD ++Y+LNF+   DG   
Subjt:  GCGCFRLFGFGFDRNADSEGRNLLRG----------EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDPESYALNFNGEDDGPPL

Query:  GFSARFAVPMAAR
            RF  P+  +
Subjt:  GFSARFAVPMAAR

AT5G62865.1 unknown protein7.7e-0933.59Show/hide
Query:  EPEDPPPNGCGCFRLF------------GFGFDRNADSEGRNLLRG-EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQ---RNRFQYDPES
        +P+      C CF  F             +G  R  D    +   G E  W  R   K++E SE+VAGP+WKTFIR+     +  +      +FQYDP S
Subjt:  EPEDPPPNGCGCFRLF------------GFGFDRNADSEGRNLLRG-EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQ---RNRFQYDPES

Query:  YALNFNGEDD-------GPPLGFSARFA
        Y+LNF+ +D+       G    FS RFA
Subjt:  YALNFNGEDD-------GPPLGFSARFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCGCATCAACCCAGGCCACCCCCCATGCCCTACTCGCCACTCAACGAGGAACCAGAGGATCCCCCGCCAAATGGGTGCGGCTGTTTCCGGCTATTCGGCTTCGG
ATTCGACCGGAATGCCGATTCCGAAGGTAGGAATCTTCTGCGGGGAGAGGGATCTTGGATGGCAAGGAAATTGAAGAAGGTGAAGGAGGTTTCGGAGATGGTGGCCGGAC
CCAAATGGAAGACGTTTATCAGAAAGATGGGCGGTTATTTGAAGGGGAAGAAGCAGAGGAATAGGTTTCAGTATGACCCAGAAAGCTATGCTCTGAATTTCAATGGAGAA
GATGATGGCCCGCCCCTTGGCTTCTCTGCTAGGTTTGCTGTGCCCATGGCGGCCAGAGAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCGCATCAACCCAGGCCACCCCCCATGCCCTACTCGCCACTCAACGAGGAACCAGAGGATCCCCCGCCAAATGGGTGCGGCTGTTTCCGGCTATTCGGCTTCGG
ATTCGACCGGAATGCCGATTCCGAAGGTAGGAATCTTCTGCGGGGAGAGGGATCTTGGATGGCAAGGAAATTGAAGAAGGTGAAGGAGGTTTCGGAGATGGTGGCCGGAC
CCAAATGGAAGACGTTTATCAGAAAGATGGGCGGTTATTTGAAGGGGAAGAAGCAGAGGAATAGGTTTCAGTATGACCCAGAAAGCTATGCTCTGAATTTCAATGGAGAA
GATGATGGCCCGCCCCTTGGCTTCTCTGCTAGGTTTGCTGTGCCCATGGCGGCCAGAGAA
Protein sequenceShow/hide protein sequence
MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDPESYALNFNGE
DDGPPLGFSARFAVPMAARE