; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0019868 (gene) of Chayote v1 genome

Gene IDSed0019868
OrganismSechium edule (Chayote v1)
DescriptionBEST Arabidopsis thaliana protein match is: NHL domain-containing protein .
Genome locationLG12:30070246..30072135
RNA-Seq ExpressionSed0019868
SyntenySed0019868
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057881.1 putative Serine/arginine repetitive matrix protein 2 [Cucumis melo var. makuwa]2.1e-5468.57Show/hide
Query:  MAYTDT--EQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRS--
        MA+TD   E+ P  EADKS S  P   CGCFRLSCFGSRR     PS               WWARGV V+LKLREWSEIVAGPRWKTFIRRFNRNRS  
Subjt:  MAYTDT--EQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRS--

Query:  -------GGGGAARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGTGDFSTVA
               GGG   RAGKFQYDPLSYAMNFDEGSRQIGELDDD+DDF+  RNFSARYASIPAPLKTGGT D S VA
Subjt:  -------GGGGAARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGTGDFSTVA

XP_008453108.1 PREDICTED: uncharacterized protein LOC103493921 [Cucumis melo]1.3e-5469.36Show/hide
Query:  MAYTDT--EQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRS--
        MA+TD   E+ P  EADKS S  P   CGCFRLSCFGSRR     PS               WWARGV V+LKLREWSEIVAGPRWKTFIRRFNRNRS  
Subjt:  MAYTDT--EQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRS--

Query:  -----GGGGAARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGTGDFSTVA
             GGG   RAGKFQYDPLSYAMNFDEGSRQIGELDDD+DDF+  RNFSARYASIPAPLKTGGT D S VA
Subjt:  -----GGGGAARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGTGDFSTVA

XP_022921694.1 uncharacterized protein LOC111429865 [Cucurbita moschata]2.5e-5570.48Show/hide
Query:  MAYTDTEQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MAY D E+PPP E D S S  P T CGCFRLSC   RR     PS               WWARGV V+LK+REWSEIVAGPRWKTFIRRFNRNR+GG G
Subjt:  MAYTDTEQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  AARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGT--GDFSTVA
        A R GKFQYDPLSYAMNFDEGSRQIGELDDD DDF+  RNFSARYASIPAPLKTGGT   D STVA
Subjt:  AARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGT--GDFSTVA

XP_022988416.1 uncharacterized protein LOC111485666 [Cucurbita maxima]1.9e-5570.18Show/hide
Query:  MAYTDTEQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MAY D E+PPP EAD S S  P T CGCFRLSC  SRR     PS               WWARGV V+LK+REWSEIVAGPRWKTFIRRFNRNR+GGGG
Subjt:  MAYTDTEQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  -----AARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGT--GDFSTVA
             A R GKFQYDPLSYAMNFDEGSRQIGELDDD DDF+  RNFSARYASIPAPLKTGGT   D STVA
Subjt:  -----AARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGT--GDFSTVA

XP_038878924.1 uncharacterized protein LOC120071012 [Benincasa hispida]6.0e-5773.65Show/hide
Query:  MAYTDTEQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRS-GGG
        MA+TD E+  P EADKS S  P   CGCFRLSCFGSRR     PS               WWARGV VLLKLREWSEIVAGPRWKTFIRRFNRNRS GGG
Subjt:  MAYTDTEQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRS-GGG

Query:  GAARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGT--GDFSTVA
        G  RAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDF+  RNFSARYASIPAPLKTGGT   D S+VA
Subjt:  GAARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGT--GDFSTVA

TrEMBL top hitse value%identityAlignment
A0A1S3BVG0 uncharacterized protein LOC1034939216.1e-5569.36Show/hide
Query:  MAYTDT--EQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRS--
        MA+TD   E+ P  EADKS S  P   CGCFRLSCFGSRR     PS               WWARGV V+LKLREWSEIVAGPRWKTFIRRFNRNRS  
Subjt:  MAYTDT--EQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRS--

Query:  -----GGGGAARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGTGDFSTVA
             GGG   RAGKFQYDPLSYAMNFDEGSRQIGELDDD+DDF+  RNFSARYASIPAPLKTGGT D S VA
Subjt:  -----GGGGAARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGTGDFSTVA

A0A5A7UWM1 Putative Serine/arginine repetitive matrix protein 21.0e-5468.57Show/hide
Query:  MAYTDT--EQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRS--
        MA+TD   E+ P  EADKS S  P   CGCFRLSCFGSRR     PS               WWARGV V+LKLREWSEIVAGPRWKTFIRRFNRNRS  
Subjt:  MAYTDT--EQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRS--

Query:  -------GGGGAARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGTGDFSTVA
               GGG   RAGKFQYDPLSYAMNFDEGSRQIGELDDD+DDF+  RNFSARYASIPAPLKTGGT D S VA
Subjt:  -------GGGGAARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGTGDFSTVA

A0A5D3BK38 Putative Serine/arginine repetitive matrix protein 26.1e-5569.36Show/hide
Query:  MAYTDT--EQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRS--
        MA+TD   E+ P  EADKS S  P   CGCFRLSCFGSRR     PS               WWARGV V+LKLREWSEIVAGPRWKTFIRRFNRNRS  
Subjt:  MAYTDT--EQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRS--

Query:  -----GGGGAARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGTGDFSTVA
             GGG   RAGKFQYDPLSYAMNFDEGSRQIGELDDD+DDF+  RNFSARYASIPAPLKTGGT D S VA
Subjt:  -----GGGGAARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGTGDFSTVA

A0A6J1E188 uncharacterized protein LOC1114298651.2e-5570.48Show/hide
Query:  MAYTDTEQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MAY D E+PPP E D S S  P T CGCFRLSC   RR     PS               WWARGV V+LK+REWSEIVAGPRWKTFIRRFNRNR+GG G
Subjt:  MAYTDTEQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  AARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGT--GDFSTVA
        A R GKFQYDPLSYAMNFDEGSRQIGELDDD DDF+  RNFSARYASIPAPLKTGGT   D STVA
Subjt:  AARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGT--GDFSTVA

A0A6J1JH52 uncharacterized protein LOC1114856669.4e-5670.18Show/hide
Query:  MAYTDTEQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MAY D E+PPP EAD S S  P T CGCFRLSC  SRR     PS               WWARGV V+LK+REWSEIVAGPRWKTFIRRFNRNR+GGGG
Subjt:  MAYTDTEQPPPREADKSKSQSPATICGCFRLSCFGSRR-----PS---------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  -----AARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGT--GDFSTVA
             A R GKFQYDPLSYAMNFDEGSRQIGELDDD DDF+  RNFSARYASIPAPLKTGGT   D STVA
Subjt:  -----AARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGT--GDFSTVA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)8.2e-2033.89Show/hide
Query:  EQPPPREADKSKSQSPATICG---CFRLSCFGSRRPS----------------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGA
        + PP  E D +     A       CF + C  S +PS                      WW RG     ++REWSE+VAGPRWKT+IRRF R+   GGG 
Subjt:  EQPPPREADKSKSQSPATICG---CFRLSCFGSRRPS----------------------WWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGA

Query:  ARA-------------------GKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGTGDFST
         R                    GKF+YD LSY++NFD+G+ Q G  DD+       R++S R+A+   P+ T  + DF +
Subjt:  ARA-------------------GKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIPAPLKTGGTGDFST

AT3G48020.1 unknown protein1.3e-2050.54Show/hide
Query:  RRPSWWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIP
        + P WW R     LK+REWSEIVAGPRWKTFIRRFNR+   G     + KF+YDP+SY ++F++  +     DDD      +R+FS RYAS+P
Subjt:  RRPSWWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFSALRNFSARYASIP

AT5G14890.1 NHL domain-containing protein5.3e-1942.03Show/hide
Query:  CFRLSCFGSRRPS------WWAR-------------GVGVLLKLREWSEIVAGPRWKTFIRRFNRNR--SGG--GGAARAG--KFQYDPLSYAMNFDEGS
        CF L C GS +PS      WW R              V   +K+REWSEIVAGP+WKTFIRRF RN   +GG  GG  R     F+YD  SY++NFD+G 
Subjt:  CFRLSCFGSRRPS------WWAR-------------GVGVLLKLREWSEIVAGPRWKTFIRRFNRNR--SGG--GGAARAG--KFQYDPLSYAMNFDEGS

Query:  RQIGELDDDIDDFSALRNFSARYASIPAPLKTGGTGDF
        +Q G  +D+       R++S R+A+   P+ T  + DF
Subjt:  RQIGELDDDIDDFSALRNFSARYASIPAPLKTGGTGDF

AT5G25240.1 unknown protein1.8e-0639.29Show/hide
Query:  KSQSPATICGCFRLSCFGSRRPSWWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAARAGKFQYDPLSYAMNFDEG
        +S+S     GC +      RR +W   G   L  L+E SE +AGP+WK FIR F+   SG     R   F YD  +Y++NFD+G
Subjt:  KSQSPATICGCFRLSCFGSRRPSWWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAARAGKFQYDPLSYAMNFDEG

AT5G62865.1 unknown protein2.6e-2154.26Show/hide
Query:  PSWWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDF---SALRNFSARYASIP
        P WW R     LK+REWSEIVAGPRWKTFIRRFNR+   G     + KFQYDPLSY++NFD+        DD+ D++     LR+FS R+AS+P
Subjt:  PSWWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAARAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDF---SALRNFSARYASIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTATACAGATACAGAACAGCCTCCGCCACGTGAGGCGGACAAATCCAAATCTCAGTCGCCGGCCACCATCTGCGGTTGTTTCCGGCTATCGTGCTTCGGATCTCG
CCGGCCGTCGTGGTGGGCGCGAGGCGTTGGGGTTCTGTTGAAGCTCCGAGAATGGTCGGAGATCGTCGCCGGACCGAGATGGAAGACTTTCATTCGCCGATTCAATCGGA
ATCGGAGTGGTGGTGGTGGTGCCGCTAGAGCTGGAAAATTCCAGTACGATCCGTTGAGTTACGCTATGAATTTCGACGAAGGTTCGAGGCAGATTGGGGAATTAGACGAT
GATATCGATGATTTCAGTGCACTTCGGAACTTCTCTGCTCGTTACGCTTCGATTCCGGCGCCGTTGAAAACCGGAGGAACCGGGGATTTTTCTACCGTCGCGTAA
mRNA sequenceShow/hide mRNA sequence
TTCGGATCGATTATCGGTTTCAAATTCGTGTAGGCGACCGGGAGTAGCCGGTGACGCCGGTCACAGCTGGTTTCTGTTTCATTTTGCGTTGCGCTGTATAAAACTGCAAG
ATCATCAAATCATCAGATCCGATAATGGCGTATACAGATACAGAACAGCCTCCGCCACGTGAGGCGGACAAATCCAAATCTCAGTCGCCGGCCACCATCTGCGGTTGTTT
CCGGCTATCGTGCTTCGGATCTCGCCGGCCGTCGTGGTGGGCGCGAGGCGTTGGGGTTCTGTTGAAGCTCCGAGAATGGTCGGAGATCGTCGCCGGACCGAGATGGAAGA
CTTTCATTCGCCGATTCAATCGGAATCGGAGTGGTGGTGGTGGTGCCGCTAGAGCTGGAAAATTCCAGTACGATCCGTTGAGTTACGCTATGAATTTCGACGAAGGTTCG
AGGCAGATTGGGGAATTAGACGATGATATCGATGATTTCAGTGCACTTCGGAACTTCTCTGCTCGTTACGCTTCGATTCCGGCGCCGTTGAAAACCGGAGGAACCGGGGA
TTTTTCTACCGTCGCGTAAAGGTGCGCTGCATCTACTAAAATGCGCCTTGTTTACTGTTTAGTGCTGTTATTGTCCACACATTGAGTTGGAACGTTGGCCGTTTGACTTC
ACCGGCGGCGCCGCCTCCTCCGTTAGTCTCAAAAGTACATATCACACATCCTTGACATACCGAAGAATAGGTTAGTGATGTAATGATTTTATTGTTCTAGTGTAAATTAG
TTTAACACCAACGTTATAGGTTCGAACTCGAGAATCGGTATTTGTTATCATTCGGGTGATAGGTACTAAAAAAACACATGTTTGAGTTGATTTTTTTGTAGTTTTTAATG
GTGATCTTTTATATTAAACTATATCCTGATTGAGTTATATAATAATAATAATGATTATGGTGTTGGTTTGTTTAAGTGGGGGTGATTATAGGCCATCAAATTTATGCATT
TGGATTGTCCTTATACACACTGTTTTGCAACGAGATTGGATTTTAGGAAGTAGCTTTAAATTTTGGTGGCATGTCCTGTCAGTCTTTTCAATTGGAAGGGAATGGGGAAC
AAAAGAACCAATTAACAATTGTTTGTTTAGTTCAAGATATTTTGAACTCTTTGCAGAAAGATTGAGGCTTAAGGAAACTATTAGGAAAATCTGGAATATCTTCAAAAAAG
TCATGGTTTACTATATACAGCCTTTTCAAGTTCCGAAGAAGTTGTGTGGTGGTGGGATGCTTGAATTTTGAAGACGAAGGTAACCATGCTCATCCCTAGCAAGGCACCAG
GAGACATTGAAAGAGTAATATCACAGTTGAGTCTCGAATCTGGAACATTCAAGGGATCATTACACTCAAAACCTAAGTTTTCAATTATTGTGTCACCCTTTGGGGACGAA
GCTACTTGCAACTTGAAATTGAGAAATTGTCATGCTTTCAAGTTATATTGATATAATTAAATTTATCATAAAGACAAC
Protein sequenceShow/hide protein sequence
MAYTDTEQPPPREADKSKSQSPATICGCFRLSCFGSRRPSWWARGVGVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAARAGKFQYDPLSYAMNFDEGSRQIGELDD
DIDDFSALRNFSARYASIPAPLKTGGTGDFSTVA