; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0006474 (gene) of Chayote v1 genome

Gene IDSed0006474
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationLG05:41098663..41099910
RNA-Seq ExpressionSed0006474
SyntenySed0006474
Gene Ontology termsGO:0009855 - determination of bilateral symmetry (biological process)
GO:0010087 - phloem or xylem histogenesis (biological process)
GO:0010305 - leaf vascular tissue pattern formation (biological process)
GO:0010588 - cotyledon vascular tissue pattern formation (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151872.1 uncharacterized protein LOC101212188 [Cucumis sativus]1.1e-7288.46Show/hide
Query:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT
        MKLVGVLICLLVVAMDIVAG+L IEADIAQNKVK LRLWIFECRDPS+QAFKLGLGAAGLL LAH+IANLLGGCNCICSQ+ELEKSPPN+QIS+ACL+FT
Subjt:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT

Query:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA
        WIILAVG+S+LVIGA+ NNK RA+CGFTHHHF SIGGILCFVHGLFCVAYYVT+TA
Subjt:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA

XP_008455827.1 PREDICTED: uncharacterized protein LOC103495927 [Cucumis melo]8.5e-7389.1Show/hide
Query:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT
        MKLVGVLICLLVVAMDIVAG+L IEADIAQNKVK LRLWIFECRDPS+QAFKLGLGAAGLL LAH+IANLLGGCNCICSQ+ELEKSPPN+QIS+ACLVFT
Subjt:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT

Query:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA
        WIILAVG+S+LVIGA+ NNK RA+CGFTHHHF SIGGILCFVHGLFCVAYYVT+TA
Subjt:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA

XP_022140603.1 uncharacterized protein LOC111011214 isoform X2 [Momordica charantia]1.8e-7085.26Show/hide
Query:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT
        MK+VGVLICLLVVAMDIVAG+L IEA+IAQNKVKQLRLWIFECR+PS+QAFKLGLGAAGLL LAH+IANLLGGCNCICSQEE++KSPPN+Q+SLACL+FT
Subjt:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT

Query:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA
        WIILAVG+S+LVIG L+NNK RA+CGFTHHHF SIGGILCFVH LFCVAYYV++TA
Subjt:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA

XP_022944616.1 uncharacterized protein LOC111449022 [Cucurbita moschata]3.0e-7087.18Show/hide
Query:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT
        MKLVGVL+CLLVVAMDIVAG+L IEADIAQNKVK LRLWIFECRDPS QA++LGL AAG+L LAHVIANLLGGCNCICSQE LEKSPPNKQIS+ACLVFT
Subjt:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT

Query:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA
        WIILAV +S+LVIGAL NNK RA+CGFTHHHF SIGGILCFVHGLFCVAYYVT+TA
Subjt:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA

XP_038902419.1 uncharacterized protein LOC120089064 [Benincasa hispida]1.7e-7389.74Show/hide
Query:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT
        M+LVGVLICLLVVAMDIVAG+L IEADIAQNKVK LRLWIFECRDPS+QAFKLGLGAAGLL LAH+IANLLGGCNCICSQEELEKSPPNKQ+S+ACL+FT
Subjt:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT

Query:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA
        WIILAVG+S+LVIGAL NNK RATCGFTHHHF SIGGILCFVHGLFCVAYYVT+TA
Subjt:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA

TrEMBL top hitse value%identityAlignment
A0A0A0LR43 Uncharacterized protein5.4e-7388.46Show/hide
Query:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT
        MKLVGVLICLLVVAMDIVAG+L IEADIAQNKVK LRLWIFECRDPS+QAFKLGLGAAGLL LAH+IANLLGGCNCICSQ+ELEKSPPN+QIS+ACL+FT
Subjt:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT

Query:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA
        WIILAVG+S+LVIGA+ NNK RA+CGFTHHHF SIGGILCFVHGLFCVAYYVT+TA
Subjt:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA

A0A1S3C1S7 uncharacterized protein LOC1034959274.1e-7389.1Show/hide
Query:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT
        MKLVGVLICLLVVAMDIVAG+L IEADIAQNKVK LRLWIFECRDPS+QAFKLGLGAAGLL LAH+IANLLGGCNCICSQ+ELEKSPPN+QIS+ACLVFT
Subjt:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT

Query:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA
        WIILAVG+S+LVIGA+ NNK RA+CGFTHHHF SIGGILCFVHGLFCVAYYVT+TA
Subjt:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA

A0A6J1CID0 uncharacterized protein LOC111011214 isoform X28.6e-7185.26Show/hide
Query:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT
        MK+VGVLICLLVVAMDIVAG+L IEA+IAQNKVKQLRLWIFECR+PS+QAFKLGLGAAGLL LAH+IANLLGGCNCICSQEE++KSPPN+Q+SLACL+FT
Subjt:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT

Query:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA
        WIILAVG+S+LVIG L+NNK RA+CGFTHHHF SIGGILCFVH LFCVAYYV++TA
Subjt:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA

A0A6J1FUW3 uncharacterized protein LOC1114490221.5e-7087.18Show/hide
Query:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT
        MKLVGVL+CLLVVAMDIVAG+L IEADIAQNKVK LRLWIFECRDPS QA++LGL AAG+L LAHVIANLLGGCNCICSQE LEKSPPNKQIS+ACLVFT
Subjt:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT

Query:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA
        WIILAV +S+LVIGAL NNK RA+CGFTHHHF SIGGILCFVHGLFCVAYYVT+TA
Subjt:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA

A0A6J1I7N7 uncharacterized protein LOC1114698863.3e-7087.74Show/hide
Query:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT
        MKLVGVL+CLLVVAMDIVAG+LAIEADIAQNKVK LRLWIFECRDPS+QAFKLGLGAAGLL LAH+IANLLGG NCIC +EELEKSPPNKQIS+ CLVFT
Subjt:  MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFT

Query:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTST
        WIILAVG+SLLVIGALAN+K R +CGFTHHHF S+GGILCFVHGLFCVAYYV+ST
Subjt:  WIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTST

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05291.1 Protein of unknown function (DUF1218)4.7e-2139.22Show/hide
Query:  VLICL-LVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNK---QISLACLVFTW
        +++C+ L V +DIVAG + ++A  AQ  VK  +L   EC+ PS+ AF LG+ A   LA AHV AN++ GC+     + L   P NK     ++ACL   W
Subjt:  VLICL-LVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNK---QISLACLVFTW

Query:  IILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTS
        ++   G  +L  G  +N + R  C FT++H FSIGG +CF+H +    YY++S
Subjt:  IILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTS

AT1G11500.1 Protein of unknown function (DUF1218)9.2e-2541.25Show/hide
Query:  VGVLICLLVVAMDIVAGVLAIEADIAQNKV------KQLRLWIFEC-RDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLAC
        +G L+ ++++  DI A VL IEA+IAQ+K       +  R     C R PS  AF  G+ A  LL + HV+AN+LGGC  I S+++ +++  NK +++A 
Subjt:  VGVLICLLVVAMDIVAGVLAIEADIAQNKV------KQLRLWIFEC-RDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLAC

Query:  LVFTWIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA
        LV +WI   V  S L+IG LAN++    C   H  FF IGGI C  HG+   AYYV++ A
Subjt:  LVFTWIILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA

AT2G32280.1 Protein of unknown function (DUF1218)1.4e-5761.94Show/hide
Query:  KLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFTW
        K+ G+L+CL++V +D+ A +L I+A++AQN+VK +RLW+FECR+PSQ AF+LGLGAA +L +AHV+ NL+GGC CICSQ+E ++S   +QIS+ACLV TW
Subjt:  KLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFTW

Query:  IILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA
        I+ AVG   +VIG ++N+K R++CGFTHHHF SIGGILCF+H LFCVAYYV++TA
Subjt:  IILAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA

AT4G21310.1 Protein of unknown function (DUF1218)2.8e-5362.75Show/hide
Query:  VGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFTWII
        VG  IC+L++AMD+ AG+L IEA+IAQNKVK L++WIFECRDPS  AFK GL A  LL LAHV AN LGGC C+ S+++LEKS  NKQ+++A L+FTWII
Subjt:  VGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFTWII

Query:  LAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA
        LA+  S+L++G +AN++ R  CG +HH   SIGGILCFVHGLF VAYY+++TA
Subjt:  LAVGISLLVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTGGTTGGCGTGTTGATTTGTTTGCTCGTTGTGGCTATGGATATTGTCGCTGGGGTGCTCGCCATCGAAGCTGATATTGCACAGAATAAGGTGAAGCAATTACG
ACTATGGATATTTGAGTGCAGAGATCCAAGCCAGCAGGCTTTCAAATTGGGATTGGGAGCAGCAGGGCTGTTGGCTTTAGCCCATGTGATTGCTAATTTGTTGGGGGGCT
GCAATTGCATTTGCTCTCAAGAAGAGCTTGAAAAATCTCCTCCAAATAAGCAAATCTCCCTTGCATGCCTTGTTTTCACCTGGATAATTCTAGCGGTGGGAATTTCTTTG
CTGGTGATTGGGGCATTGGCGAACAACAAACGGAGAGCAACATGTGGATTTACACACCATCACTTTTTTTCAATTGGAGGAATTTTGTGTTTTGTTCATGGTTTGTTTTG
TGTTGCTTATTATGTTACTTCCACTGCTGTTTAG
mRNA sequenceShow/hide mRNA sequence
GCTTTTTTCAACCCCATAAATAGTTTGTTTCCTCTGCTCACCAACCTTTTGTTTCAAACCCTTTTGATTTTGTTATCCGACTTAGAAGACAAGGATACCATGAAGTTGGT
TGGCGTGTTGATTTGTTTGCTCGTTGTGGCTATGGATATTGTCGCTGGGGTGCTCGCCATCGAAGCTGATATTGCACAGAATAAGGTGAAGCAATTACGACTATGGATAT
TTGAGTGCAGAGATCCAAGCCAGCAGGCTTTCAAATTGGGATTGGGAGCAGCAGGGCTGTTGGCTTTAGCCCATGTGATTGCTAATTTGTTGGGGGGCTGCAATTGCATT
TGCTCTCAAGAAGAGCTTGAAAAATCTCCTCCAAATAAGCAAATCTCCCTTGCATGCCTTGTTTTCACCTGGATAATTCTAGCGGTGGGAATTTCTTTGCTGGTGATTGG
GGCATTGGCGAACAACAAACGGAGAGCAACATGTGGATTTACACACCATCACTTTTTTTCAATTGGAGGAATTTTGTGTTTTGTTCATGGTTTGTTTTGTGTTGCTTATT
ATGTTACTTCCACTGCTGTTTAGTAATGGAAATCATATGTCACCTGCATTTTTATTTCTAATTAAAGGTGAATGGTCTGTCCCATTCATTCTAGGTTGACCCTTTTTCTT
CATTGTGTTGCATATCAGA
Protein sequenceShow/hide protein sequence
MKLVGVLICLLVVAMDIVAGVLAIEADIAQNKVKQLRLWIFECRDPSQQAFKLGLGAAGLLALAHVIANLLGGCNCICSQEELEKSPPNKQISLACLVFTWIILAVGISL
LVIGALANNKRRATCGFTHHHFFSIGGILCFVHGLFCVAYYVTSTAV