; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g06200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g06200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBEST Arabidopsis thaliana protein match is: NHL domain-containing protein .
Genome locationchr6:4471047..4472712
RNA-Seq ExpressionMoc06g06200
SyntenyMoc06g06200
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587852.1 hypothetical protein SDJN03_16417, partial [Cucurbita argyrosperma subsp. sororia]8.4e-4268.09Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPPP PYSPLNEE E         P NGC CF+LFGFGF+RN + E  NLL     R E  WM +KLKK+KEVSEMVAGPKWK FIRKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNF----NGEDDGPPLGFSARDRFP
        GKKQRNRFQYDPESYALNF    +GEDD PP+GFS+R   P
Subjt:  GKKQRNRFQYDPESYALNF----NGEDDGPPLGFSARDRFP

XP_022135367.1 uncharacterized protein LOC111007336 [Momordica charantia]7.1e-6597.6Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP
        MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP
Subjt:  MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP

Query:  ESYALNFNGEDDGPPLGFSARDRFP
        ESYALNFNGEDDGPPLGFSAR   P
Subjt:  ESYALNFNGEDDGPPLGFSARDRFP

XP_022927019.1 uncharacterized protein LOC111433973 [Cucurbita moschata]1.1e-4168.09Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPPP PYSPLNEE E         P NGC CF+LFGFGF+RN + E  NLL     R E  WM +KLKK+KEVSEMVAGPKWK FIRKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNF----NGEDDGPPLGFSARDRFP
        GKKQRNRFQYDPESYALNF    +GEDD PP+GFS+R   P
Subjt:  GKKQRNRFQYDPESYALNF----NGEDDGPPLGFSARDRFP

XP_023003195.1 uncharacterized protein LOC111496880 [Cucurbita maxima]1.9e-4167.38Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPPP PYSPLNEE E         P NGC CF+LFGFGF+RN + E  NLL     R E  WM +KLKK+KEVSEMVAGPKWK FIRKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNF----NGEDDGPPLGFSARDRFP
        G+KQRNRFQYDPESYALNF    +GEDD PP+GFS+R   P
Subjt:  GKKQRNRFQYDPESYALNF----NGEDDGPPLGFSARDRFP

XP_038876901.1 uncharacterized protein LOC120069255 [Benincasa hispida]9.9e-4367.83Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPP  PYSPLNE+ +D        P NGCGCFRLFGFGF+RN + E RNLL     R E SWM RKLKK+KEVSEMVAGPKWK F+RKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFNGEDDG------PPLGFSARDRFP
        GKKQRNRFQYDPESYALNF+G  DG      PP+GFS+R   P
Subjt:  GKKQRNRFQYDPESYALNFNGEDDG------PPLGFSARDRFP

TrEMBL top hitse value%identityAlignment
A0A1S3B8N4 uncharacterized protein LOC1034873831.0e-4065.28Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPEDP--------PPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYL
        MA+HQ RPP  PYSPLN++ +D           NGCGCF+LFGFG +RN + EG NLL     R E SWM +KLKKVKEVSEMVAGPKWK FIRKMGGYL
Subjt:  MAAHQPRPPPMPYSPLNEEPEDP--------PPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYL

Query:  KGKKQRNRFQYDPESYALNFNGEDDG------PPLGFSARDRFP
        KGKKQRNRFQYDPESYALNF+G  DG      PP+GFS+R   P
Subjt:  KGKKQRNRFQYDPESYALNFNGEDDG------PPLGFSARDRFP

A0A5A7U6Q0 Uncharacterized protein1.0e-4065.28Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPEDP--------PPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYL
        MA+HQ RPP  PYSPLN++ +D           NGCGCF+LFGFG +RN + EG NLL     R E SWM +KLKKVKEVSEMVAGPKWK FIRKMGGYL
Subjt:  MAAHQPRPPPMPYSPLNEEPEDP--------PPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYL

Query:  KGKKQRNRFQYDPESYALNFNGEDDG------PPLGFSARDRFP
        KGKKQRNRFQYDPESYALNF+G  DG      PP+GFS+R   P
Subjt:  KGKKQRNRFQYDPESYALNFNGEDDG------PPLGFSARDRFP

A0A6J1C2H6 uncharacterized protein LOC1110073363.4e-6597.6Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP
        MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP
Subjt:  MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDP

Query:  ESYALNFNGEDDGPPLGFSARDRFP
        ESYALNFNGEDDGPPLGFSAR   P
Subjt:  ESYALNFNGEDDGPPLGFSARDRFP

A0A6J1EGU0 uncharacterized protein LOC1114339735.3e-4268.09Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPPP PYSPLNEE E         P NGC CF+LFGFGF+RN + E  NLL     R E  WM +KLKK+KEVSEMVAGPKWK FIRKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNF----NGEDDGPPLGFSARDRFP
        GKKQRNRFQYDPESYALNF    +GEDD PP+GFS+R   P
Subjt:  GKKQRNRFQYDPESYALNF----NGEDDGPPLGFSARDRFP

A0A6J1KR34 uncharacterized protein LOC1114968809.1e-4267.38Show/hide
Query:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK
        MA HQ RPPP PYSPLNEE E         P NGC CF+LFGFGF+RN + E  NLL     R E  WM +KLKK+KEVSEMVAGPKWK FIRKMGGYLK
Subjt:  MAAHQPRPPPMPYSPLNEEPED-------PPPNGCGCFRLFGFGFDRNADSEGRNLL-----RGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNF----NGEDDGPPLGFSARDRFP
        G+KQRNRFQYDPESYALNF    +GEDD PP+GFS+R   P
Subjt:  GKKQRNRFQYDPESYALNF----NGEDDGPPLGFSARDRFP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)1.0e-0532.11Show/hide
Query:  LRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG----------------GYLKGKKQRNR------FQYDPESYALNFNG-------EDDGPPLGFSA
        L  +  W  R  ++++E SE+VAGP+WKT+IR+ G                G   G    NR      F+YD  SY+LNF+        +D+ P   +S 
Subjt:  LRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG----------------GYLKGKKQRNR------FQYDPESYALNFNG-------EDDGPPLGFSA

Query:  RDRFPAEPV
        R   P+ PV
Subjt:  RDRFPAEPV

AT3G48020.1 unknown protein1.9e-0739.13Show/hide
Query:  EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG-GYLKGK--KQRNRFQYDPESYALNFNGE--DDGPPLGF----SARDRFPAEPVPDWESP
        E  W  R   K++E SE+VAGP+WKTFIR+      +G+     ++F+YDP SY L+F  E  DD    G     S   R+ + PV   +SP
Subjt:  EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG-GYLKGK--KQRNRFQYDPESYALNFNGE--DDGPPLGF----SARDRFPAEPVPDWESP

AT5G14890.1 NHL domain-containing protein3.2e-0737.5Show/hide
Query:  LRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG------GYLKG---KKQRNRFQYDPESYALNFNG-------EDDGPPLGFSARDRFPAEPV
        L  +  W      K++E SE+VAGPKWKTFIR+ G      G + G   + +   F+YD  SY+LNF+        ED+ P   +S R   P+ PV
Subjt:  LRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMG------GYLKG---KKQRNRFQYDPESYALNFNG-------EDDGPPLGFSARDRFPAEPV

AT5G25240.1 unknown protein3.9e-1340.71Show/hide
Query:  GCGCFRLFGFGFDRNADSEGRNLLRG----------EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDPESYALNFNGEDDGPPL
        GCG FR F F   R  D E R+  RG           G+W + KLK +KE+SE +AGPKWK FIR      K  ++   F YD ++Y+LNF+   DG   
Subjt:  GCGCFRLFGFGFDRNADSEGRNLLRG----------EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDPESYALNFNGEDDGPPL

Query:  GFSARDRFPAEPV
          S+ +RF A  V
Subjt:  GFSARDRFPAEPV

AT5G62865.1 unknown protein8.5e-0832.14Show/hide
Query:  EPEDPPPNGCGCFRLF------------GFGFDRNADSEGRNLLRG-EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQ---RNRFQYDPES
        +P+      C CF  F             +G  R  D    +   G E  W  R   K++E SE+VAGP+WKTFIR+     +  +      +FQYDP S
Subjt:  EPEDPPPNGCGCFRLF------------GFGFDRNADSEGRNLLRG-EGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQ---RNRFQYDPES

Query:  YALNFNGEDD-------GPPLGFSARDRFPAEPVPDWESP
        Y+LNF+ +D+       G    FS   RF + PV   ++P
Subjt:  YALNFNGEDD-------GPPLGFSARDRFPAEPVPDWESP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCGCATCAACCCAGGCCACCCCCCATGCCCTACTCGCCACTCAACGAGGAACCAGAGGATCCCCCGCCAAATGGGTGCGGCTGTTTCCGGCTATTCGGCTTCGG
ATTCGACCGGAATGCCGATTCCGAAGGTAGGAATCTTCTGCGGGGAGAGGGATCTTGGATGGCAAGGAAATTGAAGAAGGTGAAGGAGGTTTCGGAGATGGTGGCCGGAC
CCAAATGGAAGACGTTTATCAGAAAGATGGGCGGTTATTTGAAGGGGAAGAAGCAGAGGAATAGGTTTCAGTATGACCCAGAAAGCTATGCTCTGAATTTCAATGGAGAA
GATGATGGCCCGCCCCTTGGCTTCTCTGCTAGGGACAGGTTCCCAGCGGAACCCGTTCCCGATTGGGAATCCCCGCCCCGTCCCCGCCCCCAAAATTTCGGGGACGGGGG
TGGGGATGGGGATGGCCCCCGGCCCAGCCTCGCCCCGTTGCCATCCCTAGCTACGATGTTGCAACTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCGCATCAACCCAGGCCACCCCCCATGCCCTACTCGCCACTCAACGAGGAACCAGAGGATCCCCCGCCAAATGGGTGCGGCTGTTTCCGGCTATTCGGCTTCGG
ATTCGACCGGAATGCCGATTCCGAAGGTAGGAATCTTCTGCGGGGAGAGGGATCTTGGATGGCAAGGAAATTGAAGAAGGTGAAGGAGGTTTCGGAGATGGTGGCCGGAC
CCAAATGGAAGACGTTTATCAGAAAGATGGGCGGTTATTTGAAGGGGAAGAAGCAGAGGAATAGGTTTCAGTATGACCCAGAAAGCTATGCTCTGAATTTCAATGGAGAA
GATGATGGCCCGCCCCTTGGCTTCTCTGCTAGGGACAGGTTCCCAGCGGAACCCGTTCCCGATTGGGAATCCCCGCCCCGTCCCCGCCCCCAAAATTTCGGGGACGGGGG
TGGGGATGGGGATGGCCCCCGGCCCAGCCTCGCCCCGTTGCCATCCCTAGCTACGATGTTGCAACTCTGA
Protein sequenceShow/hide protein sequence
MAAHQPRPPPMPYSPLNEEPEDPPPNGCGCFRLFGFGFDRNADSEGRNLLRGEGSWMARKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQRNRFQYDPESYALNFNGE
DDGPPLGFSARDRFPAEPVPDWESPPRPRPQNFGDGGGDGDGPRPSLAPLPSLATMLQL