; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0019889 (gene) of Chayote v1 genome

Gene IDSed0019889
OrganismSechium edule (Chayote v1)
DescriptionNHL domain-containing protein
Genome locationLG08:806954..807445
RNA-Seq ExpressionSed0019889
SyntenySed0019889
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587481.1 hypothetical protein SDJN03_16046, partial [Cucurbita argyrosperma subsp. sororia]3.9e-6884.11Show/hide
Query:  EPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGG-GGARVGKF
        E PPCEA+K NSLLPTTSCGC RLSCFGSRR TAG PSWWER+RAS+ H EGRWW RGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSG    AR GKF
Subjt:  EPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGG-GGARVGKF

Query:  QYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTGVIT
        QYDPLSYAMNFDEGSGQIGEL+DD+DD+SGFRNFSARYASIP  LKT   T
Subjt:  QYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTGVIT

KAG7021468.1 putative alpha,alpha-trehalose-phosphate synthase [UDP-forming] 10 [Cucurbita argyrosperma subsp. argyrosperma]3.9e-6883.55Show/hide
Query:  EEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGG-GGARVGK
        EEPP CEA+K NSLLPTTSCGC RLSCFGSRR  AG PSWWER+RAS+ H EGRWW RGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSG    AR GK
Subjt:  EEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGG-GGARVGK

Query:  FQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTGVIT
        FQYDPLSYAMNFDEGSGQIGEL+DD+DD+SGFRNFSARYASIP  LKT   T
Subjt:  FQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTGVIT

XP_022928159.1 uncharacterized protein LOC111435061 [Cucurbita moschata]3.9e-6883.55Show/hide
Query:  EEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGG-GGARVGK
        EEPP CEA+K NSLLPTTSCGC RLSCFGSRR  AG PSWWER+RAS+ H EGRWW RGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSG    AR GK
Subjt:  EEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGG-GGARVGK

Query:  FQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTGVIT
        FQYDPLSYAMNFDEGSGQIGEL+DD+DD+SGFRNFSARYASIP  LKT   T
Subjt:  FQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTGVIT

XP_023003975.1 uncharacterized protein LOC111497423 [Cucurbita maxima]3.9e-6883.55Show/hide
Query:  EEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGG-GGARVGK
        EEPP CEA+K NSLLPTTSC C RLSCFGSRR  AG PSWWER+RAS+ H EGRWW RGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSG    AR GK
Subjt:  EEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGG-GGARVGK

Query:  FQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTGVIT
        FQYDPLSYAMNFDEGSGQIGEL+DD+DD+SGFRNFSARYASIP  LKTG  T
Subjt:  FQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTGVIT

XP_038878924.1 uncharacterized protein LOC120071012 [Benincasa hispida]7.8e-6981.94Show/hide
Query:  ANTDLEEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--GGG
        A TD+EE  PCEA+K NSLLPT  CGC+RLSCFGSRR     PSWWER+RASQ H EGRWW RGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS  GGG
Subjt:  ANTDLEEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--GGG

Query:  GARVGKFQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTG
        G R GKFQYDPLSYAMNFDEGS QIGEL+DD+DD++G+RNFSARYASIPAPLKTG
Subjt:  GARVGKFQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTG

TrEMBL top hitse value%identityAlignment
A0A1S3BVG0 uncharacterized protein LOC1034939211.0e-6676.88Show/hide
Query:  NTDLEEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS------
        + D EE P CEA+K NSLLPT  CGC+RLSCFGSRR     PSWWER+RASQ H EGRWW RGV+V+LKLREWSEIVAGPRWKTFIRRFNRNRS      
Subjt:  NTDLEEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS------

Query:  --GGGGARVGKFQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTG
          GG G R GKFQYDPLSYAMNFDEGS QIGEL+DDMDD++G+RNFSARYASIPAPLKTG
Subjt:  --GGGGARVGKFQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTG

A0A5D3BK38 Putative Serine/arginine repetitive matrix protein 21.0e-6676.88Show/hide
Query:  NTDLEEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS------
        + D EE P CEA+K NSLLPT  CGC+RLSCFGSRR     PSWWER+RASQ H EGRWW RGV+V+LKLREWSEIVAGPRWKTFIRRFNRNRS      
Subjt:  NTDLEEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS------

Query:  --GGGGARVGKFQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTG
          GG G R GKFQYDPLSYAMNFDEGS QIGEL+DDMDD++G+RNFSARYASIPAPLKTG
Subjt:  --GGGGARVGKFQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTG

A0A6J1E188 uncharacterized protein LOC1114298651.0e-6678.34Show/hide
Query:  ANTDLEEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGA
        A  D EEPPPCE +  NSLLPTT CGC+RLSC   RR  A  PSWWER+RASQ H EGRWW RGVRV+LK+REWSEIVAGPRWKTFIRRFNRNR+GG GA
Subjt:  ANTDLEEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGA

Query:  -RVGKFQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTGVIT
         R GKFQYDPLSYAMNFDEGS QIGEL+DD DD++G+RNFSARYASIPAPLKTG  T
Subjt:  -RVGKFQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTGVIT

A0A6J1EK25 uncharacterized protein LOC1114350611.9e-6883.55Show/hide
Query:  EEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGG-GGARVGK
        EEPP CEA+K NSLLPTTSCGC RLSCFGSRR  AG PSWWER+RAS+ H EGRWW RGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSG    AR GK
Subjt:  EEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGG-GGARVGK

Query:  FQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTGVIT
        FQYDPLSYAMNFDEGSGQIGEL+DD+DD+SGFRNFSARYASIP  LKT   T
Subjt:  FQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTGVIT

A0A6J1KTA4 uncharacterized protein LOC1114974231.9e-6883.55Show/hide
Query:  EEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGG-GGARVGK
        EEPP CEA+K NSLLPTTSC C RLSCFGSRR  AG PSWWER+RAS+ H EGRWW RGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSG    AR GK
Subjt:  EEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGG-GGARVGK

Query:  FQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTGVIT
        FQYDPLSYAMNFDEGSGQIGEL+DD+DD+SGFRNFSARYASIP  LKTG  T
Subjt:  FQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTGVIT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)4.8e-2437.57Show/hide
Query:  EEPPPCEAEKPN----SLLPTTSCGCYRLSCFGSRR-ETAGAPSWWERV-RASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNR-NRSGGG
        + PP  E +  +    +L     C C+ + C  S +  T G   WW+R+    +   + RWW+RG R   ++REWSE+VAGPRWKT+IRRF R N  GGG
Subjt:  EEPPPCEAEKPN----SLLPTTSCGCYRLSCFGSRR-ETAGAPSWWERV-RASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNR-NRSGGG

Query:  GARV-------------------GKFQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKT
        G RV                   GKF+YD LSY++NFD+G+ Q G  +D+      +R++S R+A+   P+ T
Subjt:  GARV-------------------GKFQYDPLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKT

AT3G48020.1 unknown protein1.4e-2345.99Show/hide
Query:  EAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRN-RSGGGGARVGKFQYDPL
        E  + N LL  +SC      C      T    SWW+R+  +  H E RWW   VR  LK+REWSEIVAGPRWKTFIRRFNR+ R G       KF+YDP+
Subjt:  EAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRN-RSGGGGARVGKFQYDPL

Query:  SYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIP
        SY ++F++        +DD     G R+FS RYAS+P
Subjt:  SYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIP

AT5G14890.1 NHL domain-containing protein1.2e-2242.22Show/hide
Query:  CYRLSCFGSRRETAGAPS-WWERVR-ASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGARVG-------KFQYDPLSYAMNFD
        C+ L C GS + +    S WW+R+R   +   + RWW+ G    +K+REWSEIVAGP+WKTFIRRF RN    GG   G        F+YD  SY++NFD
Subjt:  CYRLSCFGSRRETAGAPS-WWERVR-ASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGARVG-------KFQYDPLSYAMNFD

Query:  EGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKT
        +G  Q G  ED+      +R++S R+A+   P+ T
Subjt:  EGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKT

AT5G25240.1 unknown protein2.6e-0636.73Show/hide
Query:  CGCYRLSCFGSRRETAGAPSWWERVRAS---QAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGARVGKFQYDPLSYAMNFDEG
        CGC     F   R   G      R   S   Q    G W   G   L  L+E SE +AGP+WK FIR F+  R      R   F YD  +Y++NFD+G
Subjt:  CGCYRLSCFGSRRETAGAPSWWERVRAS---QAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGARVGKFQYDPLSYAMNFDEG

AT5G62865.1 unknown protein1.1e-2350Show/hide
Query:  CGCYRLSCFGSRRETAGAPSWWERVRA------SQAHG-EGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRN-RSGGGGARVGKFQYDPLSYAMNF
        C C+  S   SR  TA   S W R+R       S  HG E RWW   +R  LK+REWSEIVAGPRWKTFIRRFNR+ R G       KFQYDPLSY++NF
Subjt:  CGCYRLSCFGSRRETAGAPSWWERVRA------SQAHG-EGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRN-RSGGGGARVGKFQYDPLSYAMNF

Query:  DEGSGQIGELEDDMDDY---SGFRNFSARYASIP
        D+        +D+ D+Y    G R+FS R+AS+P
Subjt:  DEGSGQIGELEDDMDDY---SGFRNFSARYASIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCGAATACAGATCTAGAAGAGCCTCCTCCTTGCGAAGCGGAGAAACCCAATTCTCTATTGCCGACCACCTCCTGCGGTTGTTACCGGCTATCATGCTTCGGATC
TCGCCGGGAAACCGCCGGAGCGCCGTCGTGGTGGGAGCGGGTTAGGGCATCGCAAGCTCACGGCGAAGGGCGCTGGTGGCTGCGAGGCGTTAGGGTTCTTTTGAAGCTCC
GAGAATGGTCGGAGATCGTGGCTGGACCGAGATGGAAGACTTTCATCCGCCGATTTAATCGGAATCGGAGCGGCGGCGGCGGTGCTAGGGTTGGGAAATTTCAATACGAT
CCGTTGAGTTACGCTATGAATTTCGACGAAGGTTCGGGGCAGATAGGGGAATTGGAGGACGATATGGACGATTACAGTGGATTTCGGAACTTCTCTGCTCGTTACGCTTC
GATTCCGGCGCCGTTGAAGACCGGAGTAATCACCACCGCCACCGTCGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCGAATACAGATCTAGAAGAGCCTCCTCCTTGCGAAGCGGAGAAACCCAATTCTCTATTGCCGACCACCTCCTGCGGTTGTTACCGGCTATCATGCTTCGGATC
TCGCCGGGAAACCGCCGGAGCGCCGTCGTGGTGGGAGCGGGTTAGGGCATCGCAAGCTCACGGCGAAGGGCGCTGGTGGCTGCGAGGCGTTAGGGTTCTTTTGAAGCTCC
GAGAATGGTCGGAGATCGTGGCTGGACCGAGATGGAAGACTTTCATCCGCCGATTTAATCGGAATCGGAGCGGCGGCGGCGGTGCTAGGGTTGGGAAATTTCAATACGAT
CCGTTGAGTTACGCTATGAATTTCGACGAAGGTTCGGGGCAGATAGGGGAATTGGAGGACGATATGGACGATTACAGTGGATTTCGGAACTTCTCTGCTCGTTACGCTTC
GATTCCGGCGCCGTTGAAGACCGGAGTAATCACCACCGCCACCGTCGCGTAA
Protein sequenceShow/hide protein sequence
MAANTDLEEPPPCEAEKPNSLLPTTSCGCYRLSCFGSRRETAGAPSWWERVRASQAHGEGRWWLRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGARVGKFQYD
PLSYAMNFDEGSGQIGELEDDMDDYSGFRNFSARYASIPAPLKTGVITTATVA