; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020038 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020038
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionNHL domain protein
Genome locationscaffold22:1341441..1341926
RNA-Seq ExpressionMS020038
SyntenyMS020038
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584232.1 hypothetical protein SDJN03_20164, partial [Cucurbita argyrosperma subsp. sororia]1.2e-6476.07Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWER--VKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG  +E E  GEIDGADALLASRRY C CFPCF    S  DE S+WER   K+K  KFDG+DHWWSGG+RSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWER--VKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFASVPVAVKSSASAAA
        NR AAVK GKFQYDPISYALNFDEG  GDVDFEGDEY+GGF +FSDRF++VP + KSS +A A
Subjt:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFASVPVAVKSSASAAA

XP_022923865.1 uncharacterized protein LOC111431456 [Cucurbita moschata]4.7e-6677.3Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWER--VKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG  +E E  GEIDGADALLASRRY C CFPCF    S  DE S+WER   K+K  KFDG+DHWWSGG+RSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWER--VKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFASVPVAVKSSASAAA
        NR AAVKLGKFQYDPISYALNFDEG  GDVDFEGDEY+GGF NFSDRF++VP + KSS +A A
Subjt:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFASVPVAVKSSASAAA

XP_023000665.1 uncharacterized protein LOC111495035 [Cucurbita maxima]2.7e-6677.3Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWER--VKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG  +E E  GEIDGADALLASRRYSC CFPCF    S  DE S+WER   K+K  KFDG+DHWWSGG+RSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWER--VKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFASVPVAVKSSASAAA
        NR AAVKLGKFQYDPISYALNFDEG  GDV+FEGDEY+GGF NFSDRF++VP + KSS +A A
Subjt:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFASVPVAVKSSASAAA

XP_023519821.1 uncharacterized protein LOC111783153 [Cucurbita pepo subsp. pepo]1.6e-6677.91Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWER--VKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG  +E E  GEIDGADALLASRRYSC CFPCF    S  DE S+WER   K+K  KFDG+DHWWSGG+RSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWER--VKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFASVPVAVKSSASAAA
        NR AAVKLGKFQYDPISYALNFDEG  GDVDFEGDEY+GGF NFSDRF++VP + KSS +A A
Subjt:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFASVPVAVKSSASAAA

XP_038894780.1 uncharacterized protein LOC120083201 [Benincasa hispida]8.8e-6576.69Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERV--KSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGA +E E  GEIDGADALLAS+RYSC CFPCF    S  DE S+WERV  K+K  KFDG+DHWW+GG+RSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERV--KSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYN-GGFHNFSDRFASVPVAVKSSASAA
        NR A VKLGKFQYDPISYALNFD+G  GDVDF+GDEY+ GGF NFSDRFA+VP  VKSS + A
Subjt:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYN-GGFHNFSDRFASVPVAVKSSASAA

TrEMBL top hitse value%identityAlignment
A0A0A0LU30 Uncharacterized protein4.1e-6075.3Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERV--KSKGPKFDGDD-HWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFN
        MGDRDGA +E E    IDGADALL S+RYSC CFPCF    S  DE S+WERV  K+K  KFD +D HWW+GG+RSLKKLREWSEIVAGPRWKTFIRRFN
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERV--KSKGPKFDGDD-HWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFN

Query:  RNRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYN--GGFHNFSDRFASVPVA-VKSSASAA
        RNR A VKLGKFQYDPISYALNFDEG  GDVDF+GDEYN  GGF NFSDRFA++P A VKSS+SAA
Subjt:  RNRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYN--GGFHNFSDRFASVPVA-VKSSASAA

A0A5A7ULQ3 Uncharacterized protein1.7e-5874.85Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERV--KSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGA +E E    IDGADALLAS+RYSC CFPCF  + S  DE S+WERV  K+K  KFDG+DHWW+GG+RSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERV--KSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRTAAVKLGKFQYDPISYALNFDEG-PAGDVDFEGD-EY--NGGFHNFSDRFASVPVA-VKSSASAA
        NR A VKLGKFQYDPISYALNFDEG   GDVDFEG+ EY   GGF NFSDRFA+VP A +KSS+S A
Subjt:  NRTAAVKLGKFQYDPISYALNFDEG-PAGDVDFEGD-EY--NGGFHNFSDRFASVPVA-VKSSASAA

A0A6J1ED51 uncharacterized protein LOC1114314562.3e-6677.3Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWER--VKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG  +E E  GEIDGADALLASRRY C CFPCF    S  DE S+WER   K+K  KFDG+DHWWSGG+RSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWER--VKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFASVPVAVKSSASAAA
        NR AAVKLGKFQYDPISYALNFDEG  GDVDFEGDEY+GGF NFSDRF++VP + KSS +A A
Subjt:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFASVPVAVKSSASAAA

A0A6J1GSC5 uncharacterized protein LOC1114570525.4e-6072.22Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKG-----PKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRR
        MGDRDGA +ESE  GEI GADA L S RY+CLCFPCF    S+ DE S+WER K+          DG+DHWW+GG+RS+KKLREWSEIVAGPRWKTFIRR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKG-----PKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRR

Query:  FNRNRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFASVPVAVKSSA
        FNRNR AAVKLGKFQYDPISYALNFDEG  GDVDFE +E NGGF NFS+RFA+VP  VKS+A
Subjt:  FNRNRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFASVPVAVKSSA

A0A6J1KJ01 uncharacterized protein LOC1114950351.3e-6677.3Show/hide
Query:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWER--VKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG  +E E  GEIDGADALLASRRYSC CFPCF    S  DE S+WER   K+K  KFDG+DHWWSGG+RSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWER--VKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFASVPVAVKSSASAAA
        NR AAVKLGKFQYDPISYALNFDEG  GDV+FEGDEY+GGF NFSDRF++VP + KSS +A A
Subjt:  NRTAAVKLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFASVPVAVKSSASAAA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)5.4e-2035.39Show/hide
Query:  SRESELAGEIDGADAL---LASRRYSCLCFPCF-RSHPSLPDEPSFWERVKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNR---
        S +S    E+D  D +   L ++R  C   PC   S PS      +W+R+ +   K + D+ WW   +R  +++REWSE+VAGPRWKT+IRRF R+    
Subjt:  SRESELAGEIDGADAL---LASRRYSCLCFPCF-RSHPSLPDEPSFWERVKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNR---

Query:  -------------------TAAVKLGKFQYDPISYALNFDEG-PAGDVDFEGDEYNGGFHNFSDRFA--SVPVAVKSS
                             +   GKF+YD +SY+LNFD+G   G  D   DE+   + ++S RFA  S+PV+ K S
Subjt:  -------------------TAAVKLGKFQYDPISYALNFDEG-PAGDVDFEGDEYNGGFHNFSDRFA--SVPVAVKSS

AT3G48020.1 unknown protein2.1e-1947.22Show/hide
Query:  SFWERVKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNRTAAV---KLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSD
        S+W+R+     +   +  WW   +R+  K+REWSEIVAGPRWKTFIRRFNR+           KF+YDP+SY L+F++    D D  G    GG  +FS 
Subjt:  SFWERVKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNRTAAV---KLGKFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSD

Query:  RFASVPVA
        R+ASVPVA
Subjt:  RFASVPVA

AT5G14890.1 NHL domain-containing protein4.1e-2040.12Show/hide
Query:  ESELAGEIDGADAL---LASRRYSCLCFPCF-RSHPSLPDEPSFWERVKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNR--TAA
        ES    E+D  D +   + ++R  C   PC   S PS P+   +W+R+++   K + D+ WW  G     K+REWSEIVAGP+WKTFIRRF RN      
Subjt:  ESELAGEIDGADAL---LASRRYSCLCFPCF-RSHPSLPDEPSFWERVKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNR--TAA

Query:  VKLG-------KFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFA--SVPVAVKSS
        +  G        F+YD  SY+LNFD+G      FE DE+   + ++S RFA  S+PV+ K S
Subjt:  VKLG-------KFQYDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFA--SVPVAVKSS

AT5G25240.1 unknown protein2.8e-0847.37Show/hide
Query:  GLRSLKKLREWSEIVAGPRWKTFIRRFNRNRTAAVKLGKFQYDPISYALNFDEGPAG
        G   LK L+E SE +AGP+WK FIR F+  R    +   F YD  +Y+LNFD+G  G
Subjt:  GLRSLKKLREWSEIVAGPRWKTFIRRFNRNRTAAVKLGKFQYDPISYALNFDEGPAG

AT5G62865.1 unknown protein3.6e-2450Show/hide
Query:  CLCFPCF-RSHPSLPDEPSFWERVKSKGPKFDGDDH-----WWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNRTAAVKLG---KFQYDPISYALNFD
        C CFP F RS  S     S W R+++        DH     WW   +R+  K+REWSEIVAGPRWKTFIRRFNR+           KFQYDP+SY+LNFD
Subjt:  CLCFPCF-RSHPSLPDEPSFWERVKSKGPKFDGDDH-----WWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNRTAAVKLG---KFQYDPISYALNFD

Query:  EGPAGDVDFEGDEY--NGGFHNFSDRFASVPV
        +      D E DEY   GG  +FS RFASVPV
Subjt:  EGPAGDVDFEGDEY--NGGFHNFSDRFASVPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACCGTGACGGAGCATCCAGAGAATCGGAACTCGCCGGAGAAATCGACGGCGCCGATGCTCTCTTGGCCTCTCGACGCTATTCCTGTCTATGCTTCCCTTGCTT
CCGATCCCACCCTTCGCTGCCCGACGAACCGTCTTTCTGGGAACGGGTAAAATCAAAAGGGCCCAAGTTCGACGGCGACGATCACTGGTGGAGCGGCGGCCTCAGATCCC
TCAAGAAGCTCCGTGAATGGTCCGAGATCGTTGCCGGCCCCAGATGGAAGACCTTCATTCGCCGCTTCAACCGGAACCGGACCGCCGCCGTCAAGCTCGGTAAATTCCAA
TACGACCCCATCAGTTACGCTTTGAATTTCGACGAGGGCCCTGCCGGCGATGTAGATTTCGAAGGCGACGAGTACAACGGCGGGTTTCATAACTTCTCCGACCGGTTTGC
TTCCGTGCCGGTGGCGGTGAAATCGTCGGCATCTGCAGCGGCCGGT
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACCGTGACGGAGCATCCAGAGAATCGGAACTCGCCGGAGAAATCGACGGCGCCGATGCTCTCTTGGCCTCTCGACGCTATTCCTGTCTATGCTTCCCTTGCTT
CCGATCCCACCCTTCGCTGCCCGACGAACCGTCTTTCTGGGAACGGGTAAAATCAAAAGGGCCCAAGTTCGACGGCGACGATCACTGGTGGAGCGGCGGCCTCAGATCCC
TCAAGAAGCTCCGTGAATGGTCCGAGATCGTTGCCGGCCCCAGATGGAAGACCTTCATTCGCCGCTTCAACCGGAACCGGACCGCCGCCGTCAAGCTCGGTAAATTCCAA
TACGACCCCATCAGTTACGCTTTGAATTTCGACGAGGGCCCTGCCGGCGATGTAGATTTCGAAGGCGACGAGTACAACGGCGGGTTTCATAACTTCTCCGACCGGTTTGC
TTCCGTGCCGGTGGCGGTGAAATCGTCGGCATCTGCAGCGGCCGGT
Protein sequenceShow/hide protein sequence
MGDRDGASRESELAGEIDGADALLASRRYSCLCFPCFRSHPSLPDEPSFWERVKSKGPKFDGDDHWWSGGLRSLKKLREWSEIVAGPRWKTFIRRFNRNRTAAVKLGKFQ
YDPISYALNFDEGPAGDVDFEGDEYNGGFHNFSDRFASVPVAVKSSASAAAG