; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g1053 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g1053
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionNHL domain protein
Genome locationMC03:16904166..16904645
RNA-Seq ExpressionMC03g1053
SyntenyMC03g1053
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584232.1 hypothetical protein SDJN03_20164, partial [Cucurbita argyrosperma subsp. sororia]1.76e-8375.78Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG  +E E  GEIDGADALLASRRY C CFPCF    S  DE S+WER K+K    KFDG+DHWWSGG+RSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFASVPVAVKSSASA
        NR AAVK GKFQYDPISYALNFDEG  GDVDFEGDEY+GGF +FS RF++VP + KSS +A
Subjt:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFASVPVAVKSSASA

XP_022923865.1 uncharacterized protein LOC111431456 [Cucurbita moschata]2.62e-8577.02Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG  +E E  GEIDGADALLASRRY C CFPCF    S  DE S+WER K+K    KFDG+DHWWSGG+RSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFASVPVAVKSSASA
        NR AAVKLGKFQYDPISYALNFDEG  GDVDFEGDEY+GGF NFS RF++VP + KSS +A
Subjt:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFASVPVAVKSSASA

XP_023000665.1 uncharacterized protein LOC111495035 [Cucurbita maxima]1.30e-8577.02Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG  +E E  GEIDGADALLASRRYSC CFPCF    S  DE S+WER K+K    KFDG+DHWWSGG+RSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFASVPVAVKSSASA
        NR AAVKLGKFQYDPISYALNFDEG  GDV+FEGDEY+GGF NFS RF++VP + KSS +A
Subjt:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFASVPVAVKSSASA

XP_023519821.1 uncharacterized protein LOC111783153 [Cucurbita pepo subsp. pepo]6.43e-8677.64Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG  +E E  GEIDGADALLASRRYSC CFPCF    S  DE S+WER K+K    KFDG+DHWWSGG+RSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFASVPVAVKSSASA
        NR AAVKLGKFQYDPISYALNFDEG  GDVDFEGDEY+GGF NFS RF++VP + KSS +A
Subjt:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFASVPVAVKSSASA

XP_038894780.1 uncharacterized protein LOC120083201 [Benincasa hispida]6.58e-8477.36Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGA +E E  GEIDGADALLAS+RYSC CFPCF    S  DE S+WERVK+K    KFDG+DHWW+GG+RSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGG-FHNFSGRFASVPVAVKSS
        NR A VKLGKFQYDPISYALNFD+G  GDVDF+GDEY+GG F NFS RFA+VP  VKSS
Subjt:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGG-FHNFSGRFASVPVAVKSS

TrEMBL top hitse value%identityAlignment
A0A0A0LU30 Uncharacterized protein1.13e-7774.7Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDH-WWSGGLRSLKKLREWSEIVAGPRWKTFIRRFN
        MGDRDGA +E E    IDGADALL S+RYSC CFPCF    S  DE S+WERVK+K    KFD +DH WW+GG+RSLKKLREWSEIVAGPRWKTFIRRFN
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDH-WWSGGLRSLKKLREWSEIVAGPRWKTFIRRFN

Query:  RNRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYN--GGFHNFSGRFASVPVA-VKSSASAA
        RNR A VKLGKFQYDPISYALNFDEG  GDVDF+GDEYN  GGF NFS RFA++P A VKSS+SAA
Subjt:  RNRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYN--GGFHNFSGRFASVPVA-VKSSASAA

A0A5A7ULQ3 Uncharacterized protein1.58e-7574.25Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGA +E E    IDGADALLAS+RYSC CFPCF  + S  DE S+WERVK+K    KFDG+DHWW+GG+RSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRTAAVKLGKFQYDPISYALNFDEGPA-GDVDFEGD-EYNGG--FHNFSGRFASVPVA-VKSSASAA
        NR A VKLGKFQYDPISYALNFDEG   GDVDFEG+ EY GG  F NFS RFA+VP A +KSS+S A
Subjt:  NRTAAVKLGKFQYDPISYALNFDEGPA-GDVDFEGD-EYNGG--FHNFSGRFASVPVA-VKSSASAA

A0A6J1ED51 uncharacterized protein LOC1114314561.27e-8577.02Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG  +E E  GEIDGADALLASRRY C CFPCF    S  DE S+WER K+K    KFDG+DHWWSGG+RSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFASVPVAVKSSASA
        NR AAVKLGKFQYDPISYALNFDEG  GDVDFEGDEY+GGF NFS RF++VP + KSS +A
Subjt:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFASVPVAVKSSASA

A0A6J1GSC5 uncharacterized protein LOC1114570523.69e-7872.22Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKG-----PKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRR
        MGDRDGA +ESE  GEI GADA L S RY+CLCFPCF    S+ DE S+WER K+          DG+DHWW+GG+RS+KKLREWSEIVAGPRWKTFIRR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKG-----PKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRR

Query:  FNRNRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFASVPVAVKSSA
        FNRNR AAVKLGKFQYDPISYALNFDEG  GDVDFE +E NGGF NFS RFA+VP  VKS+A
Subjt:  FNRNRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFASVPVAVKSSA

A0A6J1KJ01 uncharacterized protein LOC1114950356.28e-8677.02Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG  +E E  GEIDGADALLASRRYSC CFPCF    S  DE S+WER K+K    KFDG+DHWWSGG+RSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGP--KFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFASVPVAVKSSASA
        NR AAVKLGKFQYDPISYALNFDEG  GDV+FEGDEY+GGF NFS RF++VP + KSS +A
Subjt:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFASVPVAVKSSASA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)5.3e-2035.39Show/hide
Query:  SRESELAGEIDGADAL---LASRRYSCLCFPCF-RSHPSLPDEPSFWERVKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNR---
        S +S    E+D  D +   L ++R  C   PC   S PS      +W+R+ +   K + D+ WW   +R  +++REWSE+VAGPRWKT+IRRF R+    
Subjt:  SRESELAGEIDGADAL---LASRRYSCLCFPCF-RSHPSLPDEPSFWERVKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNR---

Query:  -------------------TAAVKLGKFQYDPISYALNFDEG-PAGDVDFEGDEYNGGFHNFSGRFA--SVPVAVKSS
                             +   GKF+YD +SY+LNFD+G   G  D   DE+   + ++S RFA  S+PV+ K S
Subjt:  -------------------TAAVKLGKFQYDPISYALNFDEG-PAGDVDFEGDEYNGGFHNFSGRFA--SVPVAVKSS

AT3G48020.1 unknown protein2.0e-1947.22Show/hide
Query:  SFWERVKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNRTAAV---KLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSG
        S+W+R+     +   +  WW   +R+  K+REWSEIVAGPRWKTFIRRFNR+           KF+YDP+SY L+F++    D D  G    GG  +FS 
Subjt:  SFWERVKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNRTAAV---KLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSG

Query:  RFASVPVA
        R+ASVPVA
Subjt:  RFASVPVA

AT5G14890.1 NHL domain-containing protein4.1e-2040.12Show/hide
Query:  ESELAGEIDGADAL---LASRRYSCLCFPCF-RSHPSLPDEPSFWERVKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNR--TAA
        ES    E+D  D +   + ++R  C   PC   S PS P+   +W+R+++   K + D+ WW  G     K+REWSEIVAGP+WKTFIRRF RN      
Subjt:  ESELAGEIDGADAL---LASRRYSCLCFPCF-RSHPSLPDEPSFWERVKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNR--TAA

Query:  VKLG-------KFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFA--SVPVAVKSS
        +  G        F+YD  SY+LNFD+G      FE DE+   + ++S RFA  S+PV+ K S
Subjt:  VKLG-------KFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFA--SVPVAVKSS

AT5G25240.1 unknown protein2.8e-0847.37Show/hide
Query:  GLRSLKKLREWSEIVAGPRWKTFIRRFNRNRTAAVKLGKFQYDPISYALNFDEGPAG
        G   LK L+E SE +AGP+WK FIR F+  R    +   F YD  +Y+LNFD+G  G
Subjt:  GLRSLKKLREWSEIVAGPRWKTFIRRFNRNRTAAVKLGKFQYDPISYALNFDEGPAG

AT5G62865.1 unknown protein6.1e-2450Show/hide
Query:  CLCFPCF-RSHPSLPDEPSFWERVKSKGPKFDGDDH-----WWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNRTAAVKLG---KFQYDPISYALNFD
        C CFP F RS  S     S W R+++        DH     WW   +R+  K+REWSEIVAGPRWKTFIRRFNR+           KFQYDP+SY+LNFD
Subjt:  CLCFPCF-RSHPSLPDEPSFWERVKSKGPKFDGDDH-----WWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNRTAAVKLG---KFQYDPISYALNFD

Query:  EGPAGDVDFEGDEY--NGGFHNFSGRFASVPV
        +      D E DEY   GG  +FS RFASVPV
Subjt:  EGPAGDVDFEGDEY--NGGFHNFSGRFASVPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACCGTGACGGAGCATCCAGAGAATCGGAACTCGCCGGAGAAATCGACGGCGCCGATGCTCTCTTGGCCTCTCGACGCTATTCCTGTCTATGCTTCCCTTGCTT
CCGATCCCACCCTTCGCTGCCCGACGAACCGTCTTTCTGGGAACGGGTAAAATCAAAAGGGCCCAAGTTCGACGGCGACGATCACTGGTGGAGCGGCGGCCTCAGATCCC
TCAAGAAGCTCCGTGAATGGTCCGAGATCGTTGCCGGCCCCAGATGGAAGACCTTCATTCGCCGCTTCAACCGGAACCGGACCGCCGCCGTCAAGCTCGGTAAATTCCAA
TACGACCCCATCAGTTACGCTTTGAATTTCGACGAGGGCCCTGCCGGCGATGTAGATTTCGAAGGCGACGAGTACAACGGCGGGTTTCATAACTTCTCCGGCCGGTTTGC
TTCCGTGCCGGTGGCGGTGAAATCGTCGGCATCTGCAGCG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACCGTGACGGAGCATCCAGAGAATCGGAACTCGCCGGAGAAATCGACGGCGCCGATGCTCTCTTGGCCTCTCGACGCTATTCCTGTCTATGCTTCCCTTGCTT
CCGATCCCACCCTTCGCTGCCCGACGAACCGTCTTTCTGGGAACGGGTAAAATCAAAAGGGCCCAAGTTCGACGGCGACGATCACTGGTGGAGCGGCGGCCTCAGATCCC
TCAAGAAGCTCCGTGAATGGTCCGAGATCGTTGCCGGCCCCAGATGGAAGACCTTCATTCGCCGCTTCAACCGGAACCGGACCGCCGCCGTCAAGCTCGGTAAATTCCAA
TACGACCCCATCAGTTACGCTTTGAATTTCGACGAGGGCCCTGCCGGCGATGTAGATTTCGAAGGCGACGAGTACAACGGCGGGTTTCATAACTTCTCCGGCCGGTTTGC
TTCCGTGCCGGTGGCGGTGAAATCGTCGGCATCTGCAGCG
Protein sequenceShow/hide protein sequence
MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNRTAAVKLGKFQ
YDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSGRFASVPVAVKSSASAA