; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0025664 (gene) of Chayote v1 genome

Gene IDSed0025664
OrganismSechium edule (Chayote v1)
DescriptionNHL domain protein
Genome locationLG01:12135582..12136570
RNA-Seq ExpressionSed0025664
SyntenySed0025664
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584232.1 hypothetical protein SDJN03_20164, partial [Cucurbita argyrosperma subsp. sororia]2.1e-7483.95Show/hide
Query:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        M DRDG FKE+ES GEIDG DALLASRRY C CFPC    RSASDELSWWERAKAKA +TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAAAV
        NRPA VK GKFQYDPISYALNFDEG+NGDVDFEG+E++GGFQ+FS RF+AVPA  KSS AAV
Subjt:  NRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAAAV

XP_022923865.1 uncharacterized protein LOC111431456 [Cucurbita moschata]8.6e-7685.19Show/hide
Query:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        M DRDG FKE+ES GEIDG DALLASRRY C CFPC    RSASDELSWWERAKAKA +TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAAAV
        NRPA VKLGKFQYDPISYALNFDEG+NGDVDFEG+E++GGFQNFS RF+AVPA  KSS AAV
Subjt:  NRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAAAV

XP_023000665.1 uncharacterized protein LOC111495035 [Cucurbita maxima]5.1e-7685.19Show/hide
Query:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        M DRDG FKE+ES GEIDG DALLASRRYSC CFPC    RSASDELSWWERAKAKA +TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAAAV
        NRPA VKLGKFQYDPISYALNFDEG+NGDV+FEG+E++GGFQNFS RF+AVPA  KSS AAV
Subjt:  NRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAAAV

XP_023519821.1 uncharacterized protein LOC111783153 [Cucurbita pepo subsp. pepo]3.0e-7685.8Show/hide
Query:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        M DRDG FKE+ES GEIDG DALLASRRYSC CFPC    RSASDELSWWERAKAKA +TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAAAV
        NRPA VKLGKFQYDPISYALNFDEG+NGDVDFEG+E++GGFQNFS RF+AVPA  KSS AAV
Subjt:  NRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAAAV

XP_038894780.1 uncharacterized protein LOC120083201 [Benincasa hispida]6.6e-7684.34Show/hide
Query:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        M DRDGAFKELES GEIDG DALLAS+RYSC CFPC  PRRSASDE+SWWER K KA +TKFDG+DHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFN-GGFQNFSGRFAAVPAPVKSSAA-AVTG
        NRPATVKLGKFQYDPISYALNFD+G+NGDVDF+G+E++ GGFQNFS RFAAVP PVKSS A AV G
Subjt:  NRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFN-GGFQNFSGRFAAVPAPVKSSAA-AVTG

TrEMBL top hitse value%identityAlignment
A0A0A0LU30 Uncharacterized protein4.5e-7081.07Show/hide
Query:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDD-HWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFN
        M DRDGAFKELES   IDG DALL S+RYSC CFPC  P RS SDELSWWER K KA +TKFD +D HWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFN
Subjt:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDD-HWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFN

Query:  RNRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFN--GGFQNFSGRFAAV-PAPVK-SSAAAVTG
        RNRPATVKLGKFQYDPISYALNFDEG+NGDVDF+G+E+N  GGFQNFS RFAA+ PAPVK SS+AAV G
Subjt:  RNRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFN--GGFQNFSGRFAAV-PAPVK-SSAAAVTG

A0A5A7ULQ3 Uncharacterized protein2.9e-6981.82Show/hide
Query:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        M DRDGAFKELES   IDG DALLAS+RYSC CFPC  P RS SDELSWWER K KA +TKFDG+DHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEG-NNGDVDFEGEE---FNGGFQNFSGRFAAV-PAPVKSSAA
        NRPATVKLGKFQYDPISYALNFDEG NNGDVDFEG       GGFQNFS RFAAV PAP+KSS++
Subjt:  NRPATVKLGKFQYDPISYALNFDEG-NNGDVDFEGEE---FNGGFQNFSGRFAAV-PAPVKSSAA

A0A6J1ED51 uncharacterized protein LOC1114314564.2e-7685.19Show/hide
Query:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        M DRDG FKE+ES GEIDG DALLASRRY C CFPC    RSASDELSWWERAKAKA +TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAAAV
        NRPA VKLGKFQYDPISYALNFDEG+NGDVDFEG+E++GGFQNFS RF+AVPA  KSS AAV
Subjt:  NRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAAAV

A0A6J1GSC5 uncharacterized protein LOC1114570522.6e-7080.86Show/hide
Query:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATA---TKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRR
        M DRDGAFKE ES GEI G DA L S RY+CLCFPC  PRRS SDE+SWWERAKA A A      DG+DHWW+GG+RS+KKLREWSEIVAGPRWKTFIRR
Subjt:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATA---TKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRR

Query:  FNRNRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSA
        FNRNRPA VKLGKFQYDPISYALNFDEGNNGDVDFE EE NGGF+NFS RFAAVPAPVKS+A
Subjt:  FNRNRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSA

A0A6J1KJ01 uncharacterized protein LOC1114950352.4e-7685.19Show/hide
Query:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        M DRDG FKE+ES GEIDG DALLASRRYSC CFPC    RSASDELSWWERAKAKA +TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAAAV
        NRPA VKLGKFQYDPISYALNFDEG+NGDV+FEG+E++GGFQNFS RF+AVPA  KSS AAV
Subjt:  NRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAAAV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)1.9e-2036.05Show/hide
Query:  EIDGGDAL---LASRRYSCLCFPCLRPRRSASDELS-WWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNR---------
        E+D  D +   L ++R  C   PCL   + ++   S WW+R     T  K + D+ WW   IR  +++REWSE+VAGPRWKT+IRRF R+          
Subjt:  EIDGGDAL---LASRRYSCLCFPCLRPRRSASDELS-WWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNR---------

Query:  -------------PATVKLGKFQYDPISYALNFDEGN-NGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAA
                       +   GKF+YD +SY+LNFD+GN  G  D   +EF   ++++S RFAA   PV +  +
Subjt:  -------------PATVKLGKFQYDPISYALNFDEGN-NGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAA

AT3G48020.1 unknown protein1.0e-1841.6Show/hide
Query:  SASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATV---KLGKFQYDPISYALNFDEGNNGDVDFEGEEFN
        S + + SWW+R            +  WW   +R+  K+REWSEIVAGPRWKTFIRRFNR+           KF+YDP+SY L+F++ +  D D  G    
Subjt:  SASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATV---KLGKFQYDPISYALNFDEGNNGDVDFEGEEFN

Query:  GGFQNFSGRFAAVPAPVKSSAAAVT
        GG ++FS R+A+VP     S A ++
Subjt:  GGFQNFSGRFAAVPAPVKSSAAAVT

AT5G14890.1 NHL domain-containing protein9.3e-2039.18Show/hide
Query:  FKELES--AGEIDGGDAL---LASRRYSCLCFPCLRPRRSASDELS-WWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR-
        F  LES    E+D  D +   + ++R  C   PCL   + +    S WW+R +   T  K + D+ WW  G     K+REWSEIVAGP+WKTFIRRF R 
Subjt:  FKELES--AGEIDGGDAL---LASRRYSCLCFPCLRPRRSASDELS-WWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR-

Query:  -----------NRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAA
                   NRP  V    F+YD  SY+LNFD+G      FE +EF   ++++S RFAA   PV +  +
Subjt:  -----------NRPATVKLGKFQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAA

AT5G25240.1 unknown protein1.3e-0847.37Show/hide
Query:  GIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVKLGKFQYDPISYALNFDEGNNG
        G   LK L+E SE +AGP+WK FIR F+  R    +   F YD  +Y+LNFD+G +G
Subjt:  GIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVKLGKFQYDPISYALNFDEGNNG

AT5G62865.1 unknown protein1.9e-2047.69Show/hide
Query:  CLCFPCLRPRRSASD-ELSWWERAKA--KATATKFDGDD-HWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVK----LGKFQYDPISYALNF
        C CFP  R  RS++    S W R +    +  +   GD+  WW   IR+  K+REWSEIVAGPRWKTFIRRFNR+ P   +      KFQYDP+SY+LNF
Subjt:  CLCFPCLRPRRSASD-ELSWWERAKA--KATATKFDGDD-HWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVK----LGKFQYDPISYALNF

Query:  DEGNNGDVDFEGEEFNGGFQNFSGRFAAVP
        D+ +  D ++ G    GG ++FS RFA+VP
Subjt:  DEGNNGDVDFEGEEFNGGFQNFSGRFAAVP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGACCGTGACGGAGCTTTCAAAGAATTGGAATCCGCCGGCGAAATTGACGGCGGCGATGCTCTTCTGGCTTCTAGACGATACAGTTGTCTGTGCTTCCCTTGCCT
TAGACCTCGCCGGTCGGCTTCCGACGAGCTTTCTTGGTGGGAACGGGCGAAGGCGAAAGCAACTGCCACAAAGTTCGACGGCGACGATCACTGGTGGTCCGGCGGAATCA
GATCCCTCAAGAAGCTTCGTGAGTGGTCGGAGATCGTCGCCGGGCCGAGATGGAAGACCTTCATTCGCCGGTTCAACCGGAACCGGCCCGCCACCGTGAAGCTTGGGAAA
TTTCAATACGATCCCATCAGTTACGCTTTGAATTTCGACGAGGGCAATAACGGTGATGTGGACTTCGAAGGGGAGGAATTCAACGGTGGGTTTCAAAACTTTTCTGGCCG
GTTTGCTGCCGTGCCGGCACCGGTGAAGTCGTCGGCCGCTGCAGTGACTGGATAG
mRNA sequenceShow/hide mRNA sequence
GCTGTAGCTTTCGTATAATCCAACTACCGGCTAACAAACGCTGGGCCCCACCATTTTACCGGCAGTCTCCGGTCTCCCTTTGTTTCTTACTTTTATATACCGGGAGCCCA
TTCGGACTCCGAATCCCTAGTTTTTCTCTTCGCCGCCATGGCTGACCGTGACGGAGCTTTCAAAGAATTGGAATCCGCCGGCGAAATTGACGGCGGCGATGCTCTTCTGG
CTTCTAGACGATACAGTTGTCTGTGCTTCCCTTGCCTTAGACCTCGCCGGTCGGCTTCCGACGAGCTTTCTTGGTGGGAACGGGCGAAGGCGAAAGCAACTGCCACAAAG
TTCGACGGCGACGATCACTGGTGGTCCGGCGGAATCAGATCCCTCAAGAAGCTTCGTGAGTGGTCGGAGATCGTCGCCGGGCCGAGATGGAAGACCTTCATTCGCCGGTT
CAACCGGAACCGGCCCGCCACCGTGAAGCTTGGGAAATTTCAATACGATCCCATCAGTTACGCTTTGAATTTCGACGAGGGCAATAACGGTGATGTGGACTTCGAAGGGG
AGGAATTCAACGGTGGGTTTCAAAACTTTTCTGGCCGGTTTGCTGCCGTGCCGGCACCGGTGAAGTCGTCGGCCGCTGCAGTGACTGGATAGGGTGCTGCGGCGATGTAT
TTTCCTTTTCCGGCGAAGTTGAACCGGCGGTTTCCGGCCGGTGGTGGGATTTTAACGGATAGTTTTTCAACTGATGAGCTAGAACTGGACCGTTTGCGATTAATTAGGAG
GCGCCAATGGCGTTCGTACGCGTGTTTAATATGGCTGTTTACTTTTCTGTAGTGGTGACGTGTAGCAAAAGGATTGGTGTTTATATTCGTATAGTTATATGATTTTGTTT
ATTTTTTCAATTAATTTGTATAATTCATATAAAATCTTACAGCTTGAAAATTACATTTAATTTACTGTGGAATTACAGGTTTACGAACTAATTGTTATTTTTTTGGTTA
Protein sequenceShow/hide protein sequence
MADRDGAFKELESAGEIDGGDALLASRRYSCLCFPCLRPRRSASDELSWWERAKAKATATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVKLGK
FQYDPISYALNFDEGNNGDVDFEGEEFNGGFQNFSGRFAAVPAPVKSSAAAVTG