; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002374 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002374
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionBEST Arabidopsis thaliana protein match is: NHL domain-containing protein .
Genome locationscaffold30:4462128..4462517
RNA-Seq ExpressionMS002374
SyntenyMS002374
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587852.1 hypothetical protein SDJN03_16417, partial [Cucurbita argyrosperma subsp. sororia]5.1e-4770.55Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPPP PYSPLNEE E         P NGC CF+LFGFGF+RN + E  NLL     R E  WM +KLKK+KEVSEMVAGPKWK FIRKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNF----DGEDDGPPLGFSARFAVPMAARE
        GKKQRNRFQYDPESYALNF    DGEDD PP+GFS+RFAVP+A+RE
Subjt:  GKKQRNRFQYDPESYALNF----DGEDDGPPLGFSARFAVPMAARE

XP_022135367.1 uncharacterized protein LOC111007336 [Momordica charantia]1.2e-6999.23Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP
        MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP
Subjt:  MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP

Query:  ESYALNFDGEDDGPPLGFSARFAVPMAARE
        ESYALNF+GEDDGPPLGFSARFAVPMAARE
Subjt:  ESYALNFDGEDDGPPLGFSARFAVPMAARE

XP_022927019.1 uncharacterized protein LOC111433973 [Cucurbita moschata]6.7e-4770.55Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPPP PYSPLNEE E         P NGC CF+LFGFGF+RN + E  NLL     R E  WM +KLKK+KEVSEMVAGPKWK FIRKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNF----DGEDDGPPLGFSARFAVPMAARE
        GKKQRNRFQYDPESYALNF    DGEDD PP+GFS+RFAVP+A+RE
Subjt:  GKKQRNRFQYDPESYALNF----DGEDDGPPLGFSARFAVPMAARE

XP_023003195.1 uncharacterized protein LOC111496880 [Cucurbita maxima]1.1e-4669.86Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPPP PYSPLNEE E         P NGC CF+LFGFGF+RN + E  NLL     R E  WM +KLKK+KEVSEMVAGPKWK FIRKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNF----DGEDDGPPLGFSARFAVPMAARE
        G+KQRNRFQYDPESYALNF    DGEDD PP+GFS+RFAVP+A+RE
Subjt:  GKKQRNRFQYDPESYALNF----DGEDDGPPLGFSARFAVPMAARE

XP_038876901.1 uncharacterized protein LOC120069255 [Benincasa hispida]6.0e-4870.27Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPP  PYSPLNE+ +D        P NGCGCFRLFGFGF+RN + E RNLL     R E SWM RKLKK+KEVSEMVAGPKWK F+RKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFDGEDDG------PPLGFSARFAVPMAARE
        GKKQRNRFQYDPESYALNFDG  DG      PP+GFS+RFAVP+A+RE
Subjt:  GKKQRNRFQYDPESYALNFDGEDDG------PPLGFSARFAVPMAARE

TrEMBL top hitse value%identityAlignment
A0A1S3B8N4 uncharacterized protein LOC1034873836.1e-4667.79Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPEDP--------PPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYL
        MA+HQ RPP  PYSPLN++ +D           NGCGCF+LFGFG +RN + EG NLL     R E SWM +KLKKVKEVSEMVAGPKWK FIRKMGGYL
Subjt:  MAAHQPRPPPMPYSPLNEEPEDP--------PPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYL

Query:  KGKKQRNRFQYDPESYALNFDGEDDG------PPLGFSARFAVPMAARE
        KGKKQRNRFQYDPESYALNFDG  DG      PP+GFS+RFAVP+A+RE
Subjt:  KGKKQRNRFQYDPESYALNFDGEDDG------PPLGFSARFAVPMAARE

A0A5A7U6Q0 Uncharacterized protein6.1e-4667.79Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPEDP--------PPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYL
        MA+HQ RPP  PYSPLN++ +D           NGCGCF+LFGFG +RN + EG NLL     R E SWM +KLKKVKEVSEMVAGPKWK FIRKMGGYL
Subjt:  MAAHQPRPPPMPYSPLNEEPEDP--------PPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYL

Query:  KGKKQRNRFQYDPESYALNFDGEDDG------PPLGFSARFAVPMAARE
        KGKKQRNRFQYDPESYALNFDG  DG      PP+GFS+RFAVP+A+RE
Subjt:  KGKKQRNRFQYDPESYALNFDGEDDG------PPLGFSARFAVPMAARE

A0A6J1C2H6 uncharacterized protein LOC1110073366.0e-7099.23Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP
        MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP
Subjt:  MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP

Query:  ESYALNFDGEDDGPPLGFSARFAVPMAARE
        ESYALNF+GEDDGPPLGFSARFAVPMAARE
Subjt:  ESYALNFDGEDDGPPLGFSARFAVPMAARE

A0A6J1EGU0 uncharacterized protein LOC1114339733.2e-4770.55Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPPP PYSPLNEE E         P NGC CF+LFGFGF+RN + E  NLL     R E  WM +KLKK+KEVSEMVAGPKWK FIRKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNF----DGEDDGPPLGFSARFAVPMAARE
        GKKQRNRFQYDPESYALNF    DGEDD PP+GFS+RFAVP+A+RE
Subjt:  GKKQRNRFQYDPESYALNF----DGEDDGPPLGFSARFAVPMAARE

A0A6J1KR34 uncharacterized protein LOC1114968805.5e-4769.86Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPPP PYSPLNEE E         P NGC CF+LFGFGF+RN + E  NLL     R E  WM +KLKK+KEVSEMVAGPKWK FIRKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNF----DGEDDGPPLGFSARFAVPMAARE
        G+KQRNRFQYDPESYALNF    DGEDD PP+GFS+RFAVP+A+RE
Subjt:  GKKQRNRFQYDPESYALNF----DGEDDGPPLGFSARFAVPMAARE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)2.5e-0734.29Show/hide
Query:  LRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG----------------GYLKGKKQRNR------FQYDPESYALNFDG-------EDDGPPLGFSA
        L  +  W  R  ++++E SE+VAGP+WKT+IR+ G                G   G    NR      F+YD  SY+LNFD        +D+ P   +S 
Subjt:  LRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG----------------GYLKGKKQRNR------FQYDPESYALNFDG-------EDDGPPLGFSA

Query:  RFAVP
        RFA P
Subjt:  RFAVP

AT3G48020.1 unknown protein1.0e-0840.91Show/hide
Query:  EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG-GYLKGK--KQRNRFQYDPESYALNFDGEDD--------GPPLGFSARFA-VPMAA
        E  W  R   K++E SE+VAGP+WKTFIR+      +G+     ++F+YDP SY L+F+ ED         G    FS R+A VP+A+
Subjt:  EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG-GYLKGK--KQRNRFQYDPESYALNFDGEDD--------GPPLGFSARFA-VPMAA

AT5G14890.1 NHL domain-containing protein7.7e-0940.22Show/hide
Query:  LRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG------GYLKG---KKQRNRFQYDPESYALNFDG-------EDDGPPLGFSARFAVP
        L  +  W      K++E SE+VAGPKWKTFIR+ G      G + G   + +   F+YD  SY+LNFD        ED+ P   +S RFA P
Subjt:  LRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG------GYLKG---KKQRNRFQYDPESYALNFDG-------EDDGPPLGFSARFAVP

AT5G25240.1 unknown protein4.7e-1439.82Show/hide
Query:  GCGCFRLFGFGFDRNADSEGRNLLRG----------EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDPESYALNFDGEDDGPPL
        GCG FR F F   R  D E R+  RG           G+W + KLK +KE+SE +AGPKWK FIR      K  ++   F YD ++Y+LNFD   DG   
Subjt:  GCGCFRLFGFGFDRNADSEGRNLLRG----------EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDPESYALNFDGEDDGPPL

Query:  GFSARFAVPMAAR
            RF  P+  +
Subjt:  GFSARFAVPMAAR

AT5G62865.1 unknown protein2.0e-0934.38Show/hide
Query:  EPEDPPPNGCGCFRLF------------GFGFDRNADSEGRNLLRG-EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQ---RNRFQYDPES
        +P+      C CF  F             +G  R  D    +   G E  W  R   K++E SE+VAGP+WKTFIR+     +  +      +FQYDP S
Subjt:  EPEDPPPNGCGCFRLF------------GFGFDRNADSEGRNLLRG-EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQ---RNRFQYDPES

Query:  YALNFDGEDD-------GPPLGFSARFA
        Y+LNFD +D+       G    FS RFA
Subjt:  YALNFDGEDD-------GPPLGFSARFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCGCATCAACCCAGGCCACCCCCCATGCCCTACTCGCCACTCAACGAGGAACCAGAGGATCCCCCGCCAAATGGGTGCGGCTGTTTCCGGCTATTCGGCTTCGG
ATTCGACCGGAACGCCGATTCCGAAGGTAGGAATCTTCTGCGGGGAGAGGGATCTTGGATGGCAAGGAAATTGAAGAAGGTGAAGGAGGTTTCGGAGATGGTGGCCGGAC
CCAAATGGAAGACGTTTATCAGAAAGATGGGCGGTTATTTGAAGGGGAAGAAGCAGAGGAATAGGTTTCAGTATGACCCAGAAAGCTATGCTCTGAATTTCGATGGAGAA
GATGATGGCCCGCCCCTTGGCTTCTCTGCTAGGTTTGCTGTGCCCATGGCGGCCAGAGAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCGCATCAACCCAGGCCACCCCCCATGCCCTACTCGCCACTCAACGAGGAACCAGAGGATCCCCCGCCAAATGGGTGCGGCTGTTTCCGGCTATTCGGCTTCGG
ATTCGACCGGAACGCCGATTCCGAAGGTAGGAATCTTCTGCGGGGAGAGGGATCTTGGATGGCAAGGAAATTGAAGAAGGTGAAGGAGGTTTCGGAGATGGTGGCCGGAC
CCAAATGGAAGACGTTTATCAGAAAGATGGGCGGTTATTTGAAGGGGAAGAAGCAGAGGAATAGGTTTCAGTATGACCCAGAAAGCTATGCTCTGAATTTCGATGGAGAA
GATGATGGCCCGCCCCTTGGCTTCTCTGCTAGGTTTGCTGTGCCCATGGCGGCCAGAGAA
Protein sequenceShow/hide protein sequence
MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDPESYALNFDGE
DDGPPLGFSARFAVPMAARE