; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0016646 (gene) of Chayote v1 genome

Gene IDSed0016646
OrganismSechium edule (Chayote v1)
DescriptionDUF4228 domain protein
Genome locationLG08:2497747..2502112
RNA-Seq ExpressionSed0016646
SyntenySed0016646
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7023330.1 hypothetical protein SDJN02_14355, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-7284.52Show/hide
Query:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK
        MGNSASC PSMASNGAAKV++LDG LQS+ KPV AAELMIEHSGKFLCDS DL VGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLT+EEM SL+  A+K
Subjt:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK

Query:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC
        ALK GN+SGFGRIFPVLI+++C S  DVNRLKS D DREN+SSKPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC

XP_008453692.1 PREDICTED: uncharacterized protein LOC103494340 [Cucumis melo]7.2e-7285.71Show/hide
Query:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK
        MGNSASCAPS+ASNGAAKV++LDG LQSFTKPVTAAELMIEHSGKFLCDS DLKVGHRIQGLLPDEDLE RRLYFLLPMDLLYSVLT+EEM+SL+ IA+K
Subjt:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK

Query:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC
        ALK+GN+SGFGRIFPVLISE CNS  DV  LK  D D ENQSSK V+RLMSKQRSWKPALETIAETSC
Subjt:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC

XP_022921761.1 uncharacterized protein LOC111429917 [Cucurbita moschata]2.9e-7385.71Show/hide
Query:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK
        MGNSASCAPSMASNGAAKV++LDG LQS+ KPV AAELMIEHSGKFLCDS DL VGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEM SL+  A+K
Subjt:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK

Query:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC
        ALK GN+SGFGRIFPVLI+++C S  DVNRLKS D DREN+SSKPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC

XP_022987495.1 uncharacterized protein LOC111485040 [Cucurbita maxima]1.1e-7285.12Show/hide
Query:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK
        MGNSASCAPSMASNGAAKV++LDG LQS+TK V AAELMIEHSGKFLCDS DL VGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLT+EEM SL+  A+K
Subjt:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK

Query:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC
        ALK GN+SGFGRIFPVLI+++C S  DVNRLKS D DREN+SSKPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC

XP_038879989.1 uncharacterized protein LOC120071684 [Benincasa hispida]9.4e-7285.12Show/hide
Query:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK
        MGNS SCAPSMASNGAAKV++LDG LQSFTKPV AAELMIEHSGKFLCDS DLK+GHRIQGLLPDEDLE RRLYFLLPMDLLYSVLT+EEM+SLS IA+K
Subjt:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK

Query:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC
        ALK GN+SGFGRIFPVLISE+C S  DV++LK  D DRENQSSK V+RLMSKQRSWKPALETIAETSC
Subjt:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC

TrEMBL top hitse value%identityAlignment
A0A1S3BY23 uncharacterized protein LOC1034943403.5e-7285.71Show/hide
Query:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK
        MGNSASCAPS+ASNGAAKV++LDG LQSFTKPVTAAELMIEHSGKFLCDS DLKVGHRIQGLLPDEDLE RRLYFLLPMDLLYSVLT+EEM+SL+ IA+K
Subjt:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK

Query:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC
        ALK+GN+SGFGRIFPVLISE CNS  DV  LK  D D ENQSSK V+RLMSKQRSWKPALETIAETSC
Subjt:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC

A0A5D3E1W3 DUF4228 domain protein3.5e-7285.71Show/hide
Query:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK
        MGNSASCAPS+ASNGAAKV++LDG LQSFTKPVTAAELMIEHSGKFLCDS DLKVGHRIQGLLPDEDLE RRLYFLLPMDLLYSVLT+EEM+SL+ IA+K
Subjt:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK

Query:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC
        ALK+GN+SGFGRIFPVLISE CNS  DV  LK  D D ENQSSK V+RLMSKQRSWKPALETIAETSC
Subjt:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC

A0A6J1C0W6 uncharacterized protein LOC1110072547.8e-7282.66Show/hide
Query:  ALSMGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNI
        + SMGNSASCAPSM SNGAAKV++LDG L+S+TKPV AAELMIE+SGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLT+EEM+SL+ I
Subjt:  ALSMGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNI

Query:  ASKALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDR--ENQSSKPVQRLMSKQRSWKPALETIAETSC
        A+KALK+GN+SGFGRIFPVLISE+C    +VNRLKS  SDR  EN + KPVQRLMSKQRSWKPALETIAETSC
Subjt:  ASKALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDR--ENQSSKPVQRLMSKQRSWKPALETIAETSC

A0A6J1E2A0 uncharacterized protein LOC1114299171.4e-7385.71Show/hide
Query:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK
        MGNSASCAPSMASNGAAKV++LDG LQS+ KPV AAELMIEHSGKFLCDS DL VGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEM SL+  A+K
Subjt:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK

Query:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC
        ALK GN+SGFGRIFPVLI+++C S  DVNRLKS D DREN+SSKPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC

A0A6J1JH13 uncharacterized protein LOC1114850405.4e-7385.12Show/hide
Query:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK
        MGNSASCAPSMASNGAAKV++LDG LQS+TK V AAELMIEHSGKFLCDS DL VGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLT+EEM SL+  A+K
Subjt:  MGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASK

Query:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC
        ALK GN+SGFGRIFPVLI+++C S  DVNRLKS D DREN+SSKPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G18290.1 unknown protein1.0e-2337.5Show/hide
Query:  MGNSASCAP----SMASNGAAKVVT-LDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRR-LYFLLPMDLLYSVLTIEEMNSL
        MGN++SCAP    + +S+G  K++    G L+ F+KP+  ++++  HSG F+ DS  L++ HR+  + PDE L  RR LY LLP D+L+SVLT EE++ +
Subjt:  MGNSASCAP----SMASNGAAKVVT-LDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRR-LYFLLPMDLLYSVLTIEEMNSL

Query:  SNIASKALKRGNTSGFGRIFPVLISEICNSQ----PDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAET
        SN A++ L +   +   RIFPV I  +   +      VN  ++ D     ++        SK  SW+P LETI E+
Subjt:  SNIASKALKRGNTSGFGRIFPVLISEICNSQ----PDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAET

AT3G03280.1 unknown protein1.4e-0933.33Show/hide
Query:  MGNSASCAPSMASNG-AAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECR--RLYFLLPMDLLYSVLTIEEMNSLSNI
        MGN  SCA +  S+   AKV+  DG ++    P  AAELM+E    FL D+  +KVG +   L  D+DL+     +Y   PM    S     +M   + +
Subjt:  MGNSASCAPSMASNG-AAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECR--RLYFLLPMDLLYSVLTIEEMNSLSNI

Query:  ASKALKRGNTSGFGRIFPVLISEICNSQPDVNRL---KSTDSDRENQSSKPVQRLMSKQRSWKPALETIAE
             KR    G  R+ P       N   D  RL   K    D E  S+      +S  +S KP LETIAE
Subjt:  ASKALKRGNTSGFGRIFPVLISEICNSQPDVNRL---KSTDSDRENQSSKPVQRLMSKQRSWKPALETIAE

AT3G50800.1 unknown protein1.6e-0833.33Show/hide
Query:  AKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASKALKRGNTSG
        AK++  DG LQ F+ PV   +++ ++   F+C+S D+     +  +   EDL    LYF+LP+  L   L  +EM +L+  AS AL +    G
Subjt:  AKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASKALKRGNTSG

AT4G37240.1 unknown protein3.7e-1032.71Show/hide
Query:  CAPSMASNGA-AKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASKALKRG
        C+ S ++  A AK++  DG +  F  PV    +++++   F+C+S D+     +  +  DE+L+  ++YF LP+  L   L  EEM +L+  AS AL RG
Subjt:  CAPSMASNGA-AKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASKALKRG

Query:  NTSGFGR
           G  R
Subjt:  NTSGFGR

AT5G66580.1 unknown protein1.1e-0932.63Show/hide
Query:  AAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASKALKRGNTSGF
        +AK++ LDG LQ F+ PV   +++ ++   F+C+S ++     +  +  +E+L   +LYF+LP+  L   L  EEM +L+  AS AL +    G+
Subjt:  AAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLSNIASKALKRGNTSGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGGGGTGAAGCTTAAGCATAAATTGAAGTACTGTGAAGAGAGATTCCATAATCGGCGCAACTTTCATTTTCTCCAATTTCGCCCAACACACGCGCTCTCAATGGG
GAACTCAGCATCCTGCGCGCCTTCAATGGCCTCCAATGGCGCCGCCAAGGTCGTAACCCTAGACGGGAACTTACAGAGCTTCACGAAGCCGGTGACGGCCGCCGAACTAA
TGATCGAGCATTCCGGCAAGTTCCTCTGCGATTCCGGCGATCTCAAGGTCGGCCACCGGATCCAAGGCCTGTTGCCGGACGAAGATCTCGAGTGCCGCCGGTTGTACTTT
CTTCTTCCGATGGATCTTCTTTACTCTGTGTTGACGATCGAAGAAATGAATTCTCTCAGTAACATCGCTTCAAAGGCTCTGAAACGTGGAAATACGAGTGGATTTGGACG
GATCTTTCCTGTTTTGATCAGCGAAATCTGTAATTCTCAGCCGGATGTGAATCGATTGAAATCTACGGACAGCGATCGAGAGAATCAGAGTTCGAAGCCGGTGCAGAGAT
TGATGTCGAAACAGAGATCGTGGAAGCCGGCGCTCGAAACAATCGCCGAAACTTCGTGCGTGTAG
mRNA sequenceShow/hide mRNA sequence
TGCAGGTGTCGATTATCGGCAATCAAAATGACTTCCGCCTCCAACTACAATCGTTTCTTCTTCCTCCCTCCTTCTCATCATGTGTTCAATAACTTGAAAAATGGTGGGGG
TGAAGCTTAAGCATAAATTGAAGTACTGTGAAGAGAGATTCCATAATCGGCGCAACTTTCATTTTCTCCAATTTCGCCCAACACACGCGCTCTCAATGGGGAACTCAGCA
TCCTGCGCGCCTTCAATGGCCTCCAATGGCGCCGCCAAGGTCGTAACCCTAGACGGGAACTTACAGAGCTTCACGAAGCCGGTGACGGCCGCCGAACTAATGATCGAGCA
TTCCGGCAAGTTCCTCTGCGATTCCGGCGATCTCAAGGTCGGCCACCGGATCCAAGGCCTGTTGCCGGACGAAGATCTCGAGTGCCGCCGGTTGTACTTTCTTCTTCCGA
TGGATCTTCTTTACTCTGTGTTGACGATCGAAGAAATGAATTCTCTCAGTAACATCGCTTCAAAGGCTCTGAAACGTGGAAATACGAGTGGATTTGGACGGATCTTTCCT
GTTTTGATCAGCGAAATCTGTAATTCTCAGCCGGATGTGAATCGATTGAAATCTACGGACAGCGATCGAGAGAATCAGAGTTCGAAGCCGGTGCAGAGATTGATGTCGAA
ACAGAGATCGTGGAAGCCGGCGCTCGAAACAATCGCCGAAACTTCGTGCGTGTAGAAACGAAGACAATCGCAATCGCAATCGCAATAGCGAACGCGTATGAAATTATTTT
TTGCTTTTTTTTTCTAAATTCCGAATGTGTATGCGATTAATTGCGAATTCCTCTGCGATTCCGGCGAATTCAAAGGTCGGCCACCAGATCCGCCGTCGATTATACTTTCT
TCTTCCGATGGATCTTCTTTACTCTGTGTTGACTATCGAAGAACTGAGTTCTCTCAGTAACATCGCTTCAAAGGCTCTGAAACGTGGAAATTCCAGTGGATTTGGACGGA
TCTTTGCTGTTTTGATTCCTGAAATTTGTATTTCTCCGTCGGATGTGAATCGGTCGAAATCGAAGGACGGCGATTTTGAGGTTCAGAGTTCGAAGCCGGTGAAGAGATTG
ATTTCGAAACAGAGATCGTGGAAAGCCATGGCTGAAACTTCGTGCCTGTAGATAGAAAGACAATCGCAATAGCAAATGTGTATGAGATTTTCTTTTTTTGAATTGCAAAA
TGTGTATGAGATTAATTGCATTACGAATAATTCGTGCGATTTTGCTTCGGTAAACGTAGATATAATAAACATTGGGGATTTTGCTTATTCTTTTAAAATATATACCTATA
CTTTTTGTGATGAGATT
Protein sequenceShow/hide protein sequence
MVGVKLKHKLKYCEERFHNRRNFHFLQFRPTHALSMGNSASCAPSMASNGAAKVVTLDGNLQSFTKPVTAAELMIEHSGKFLCDSGDLKVGHRIQGLLPDEDLECRRLYF
LLPMDLLYSVLTIEEMNSLSNIASKALKRGNTSGFGRIFPVLISEICNSQPDVNRLKSTDSDRENQSSKPVQRLMSKQRSWKPALETIAETSCV