; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017857 (gene) of Chayote v1 genome

Gene IDSed0017857
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationLG01:10502087..10503469
RNA-Seq ExpressionSed0017857
SyntenySed0017857
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573220.1 hypothetical protein SDJN03_27107, partial [Cucurbita argyrosperma subsp. sororia]7.7e-7482.39Show/hide
Query:  MAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNPQHFQESSANRRLGLLF
        MA+NYGF+VCILVMVMDAVAGLLA+RAEK+QN+V L+  S+ V+ECSRK RDDAFSQGLAA ILLGLAH IAKV GGCI IRN QHFQ+S+ANRRLGLLF
Subjt:  MAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNPQHFQESSANRRLGLLF

Query:  MIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANPSVP
        MI SW  LAIGFSMLVAGTVDNSKRKNSCEISSHGLFL+GGI+CFIHGL TVAYYVSATAAYREE+R  K  PSVP
Subjt:  MIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANPSVP

XP_022139993.1 uncharacterized protein LOC111010766 [Momordica charantia]3.1e-7574.04Show/hide
Query:  FLY-----SNSALPSLSFPIRNLRKSSEFAAQMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLA
        FLY     SNS LP+L   I NL +SSE AA+MAQNYGF+VCILVMVMDAVAG+L +RAEK+QNRV+L+  S+ V+ECSRK RDDAFSQGLA  ILLGLA
Subjt:  FLY-----SNSALPSLSFPIRNLRKSSEFAAQMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLA

Query:  HIIAKVFGGCICIRNPQHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRI
        H IAKV G CICIR+ QHFQESSAN+RLGL FMI SW  LAIGFSML+AGTVDNS  KNSCEISS GLFL GGI+CF HGLCTVAYYVSATAA REEQR 
Subjt:  HIIAKVFGGCICIRNPQHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRI

Query:  HKANPSVP
           N S P
Subjt:  HKANPSVP

XP_022954638.1 uncharacterized protein LOC111456841 [Cucurbita moschata]9.8e-7777.6Show/hide
Query:  SFPIRNLRKSSEFAAQMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNP
        SFP+  +     F   MA+NYGF+VCILVMVMDAVAGLLA+RAEK+QN+V L+  S+ V+ECSRK RDDAFSQGLAA ILLGLAH IAKV GGCI IRN 
Subjt:  SFPIRNLRKSSEFAAQMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNP

Query:  QHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANPSVP
        QHFQ+S+ANRRLGLLFMI SW  LAIGFSMLVAGTVDNSKRKNSC+ISSHGLFL+GGI+CFIHGLCTVAYYVSATAAYREE+R  K  PSVP
Subjt:  QHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANPSVP

XP_022994470.1 uncharacterized protein LOC111490180 [Cucurbita maxima]3.7e-7677.08Show/hide
Query:  SFPIRNLRKSSEFAAQMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNP
        SFP+  +     F   MA+NYGF+VCILVMVMDAVAGLLA+RAEK+QN+V L+  S+  +ECSRK RDDAFSQGLAA ILLGLAH IAKV GGCI IRN 
Subjt:  SFPIRNLRKSSEFAAQMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNP

Query:  QHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANPSVP
        QHFQ+S+ANRRLGLLFMI SW  LAIGFSMLVAGTVDNSKRKNSC+ISSHGLFL+GGI+CFIHGLCTVAYYVSATAAYREE+R  K  PSVP
Subjt:  QHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANPSVP

XP_023542772.1 uncharacterized protein LOC111802580 [Cucurbita pepo subsp. pepo]1.2e-7778.65Show/hide
Query:  SFPIRNLRKSSEFAAQMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNP
        SFP+  +     F   MAQNYGF+VCILVMVMDAVAGLLA+RAEK+QN+V L+  S+ V+ECSRK RDDAFSQGLAA ILLGLAH IAKV GGCI IRN 
Subjt:  SFPIRNLRKSSEFAAQMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNP

Query:  QHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANPSVP
        QHFQ+S+ANRRLGLLFMI SW  LAIGFSMLVAGTVDNSKRKNSCEISSHGLFL+GGI+CFIHGLCTVAYYVSATAAYREE+R  K  PSVP
Subjt:  QHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANPSVP

TrEMBL top hitse value%identityAlignment
A0A0A0LRQ0 Uncharacterized protein1.7e-6669.61Show/hide
Query:  LPSLSF-------PIRNLRKSSEFAA--QMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHII
        LP L F       PI NL +SSE AA  +M +NYGF+VCILV+V+DAVAGLL + AEK+QNRV+L   S+ + ECSRK RDDAFS+GLAA+ILLGLAH+I
Subjt:  LPSLSF-------PIRNLRKSSEFAA--QMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHII

Query:  AKVFGG--CICIRNPQHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIH
        AKV GG  CICIRN Q+ QE SAN+ LG LFMI SW  LAIGFS+L+A T+DNSK KNSCEISSHGLFL GGI+CF HGLCTVAYYVSATAAYREEQR  
Subjt:  AKVFGG--CICIRNPQHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIH

Query:  KANP
        K  P
Subjt:  KANP

A0A1S3B4I4 uncharacterized protein LOC1034857086.2e-6172Show/hide
Query:  MAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCIC--IRNPQHFQESSANRRLGL
        M +NYGF+VCILVMV+D VAGLL + AEK+QNRV+L   S+ V ECSRK RDDAFS+GLAAAILLGLAH+IA V GGC C  I N Q+ Q+ SAN+ LGL
Subjt:  MAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCIC--IRNPQHFQESSANRRLGL

Query:  LFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANP
         FMI SW  L IGFS+L+A T+DNSK KNSCEISSHGLFL GGI+CF+HGLCTVAYYVSATAAYREEQR  K +P
Subjt:  LFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANP

A0A6J1CGZ6 uncharacterized protein LOC1110107661.5e-7574.04Show/hide
Query:  FLY-----SNSALPSLSFPIRNLRKSSEFAAQMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLA
        FLY     SNS LP+L   I NL +SSE AA+MAQNYGF+VCILVMVMDAVAG+L +RAEK+QNRV+L+  S+ V+ECSRK RDDAFSQGLA  ILLGLA
Subjt:  FLY-----SNSALPSLSFPIRNLRKSSEFAAQMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLA

Query:  HIIAKVFGGCICIRNPQHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRI
        H IAKV G CICIR+ QHFQESSAN+RLGL FMI SW  LAIGFSML+AGTVDNS  KNSCEISS GLFL GGI+CF HGLCTVAYYVSATAA REEQR 
Subjt:  HIIAKVFGGCICIRNPQHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRI

Query:  HKANPSVP
           N S P
Subjt:  HKANPSVP

A0A6J1GRM8 uncharacterized protein LOC1114568414.7e-7777.6Show/hide
Query:  SFPIRNLRKSSEFAAQMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNP
        SFP+  +     F   MA+NYGF+VCILVMVMDAVAGLLA+RAEK+QN+V L+  S+ V+ECSRK RDDAFSQGLAA ILLGLAH IAKV GGCI IRN 
Subjt:  SFPIRNLRKSSEFAAQMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNP

Query:  QHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANPSVP
        QHFQ+S+ANRRLGLLFMI SW  LAIGFSMLVAGTVDNSKRKNSC+ISSHGLFL+GGI+CFIHGLCTVAYYVSATAAYREE+R  K  PSVP
Subjt:  QHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANPSVP

A0A6J1K198 uncharacterized protein LOC1114901801.8e-7677.08Show/hide
Query:  SFPIRNLRKSSEFAAQMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNP
        SFP+  +     F   MA+NYGF+VCILVMVMDAVAGLLA+RAEK+QN+V L+  S+  +ECSRK RDDAFSQGLAA ILLGLAH IAKV GGCI IRN 
Subjt:  SFPIRNLRKSSEFAAQMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNP

Query:  QHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANPSVP
        QHFQ+S+ANRRLGLLFMI SW  LAIGFSMLVAGTVDNSKRKNSC+ISSHGLFL+GGI+CFIHGLCTVAYYVSATAAYREE+R  K  PSVP
Subjt:  QHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANPSVP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05291.1 Protein of unknown function (DUF1218)6.4e-1027.75Show/hide
Query:  VVCILVMV-MDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNPQHFQESSANR-----RLGLLFM
        +VCI++ V +D VAG + L+A+ +Q  V          EC   ++  AF  G+ A   L  AH+ A V  GC      Q       N+      +  LF+
Subjt:  VVCILVMV-MDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNPQHFQESSANR-----RLGLLFM

Query:  IFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANPS
        I  W +   G  +L  G   N++ +  C  +++ +F +GG +CF+H + +  YY+S+  A    +  H+  P+
Subjt:  IFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANPS

AT1G11500.1 Protein of unknown function (DUF1218)3.1e-2835.88Show/hide
Query:  MAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRV----MLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNPQHFQESSANRRL
        M    GF+V ++++  D  A +L + AE +Q++       +        C R   D AF++G+AA +LL + H++A V GGC  IR+ Q F+ ++AN+ L
Subjt:  MAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRV----MLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNPQHFQESSANRRL

Query:  GLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQ
         + F++ SW    + +S L+ GT+ NS+    C +     FL+GGI C  HG+ T AYYVSA AA +E++
Subjt:  GLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQ

AT2G32280.1 Protein of unknown function (DUF1218)8.3e-3442.14Show/hide
Query:  GFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNPQHFQESSANRRLGLLFMIFSW
        G +VC++++ +D  A +L ++AE +QN+V  +   + +FEC R+   DAF  GL AA +L +AH++  + GGC+CI +   FQ SS+ R++ +  ++ +W
Subjt:  GFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNPQHFQESSANRRLGLLFMIFSW

Query:  TLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYRE
         + A+GF  +V GT+ NSK ++SC  + H    +GGILCF+H L  VAYYVSATAA  E
Subjt:  TLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYRE

AT4G21310.1 Protein of unknown function (DUF1218)2.9e-4250.3Show/hide
Query:  MAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNPQHFQESSANRRLGLLF
        MA+N GF +CIL++ MD  AG+L + AE +QN+V  +   M +FEC R     AF  GLAA ILL LAH+ A   GGC+C+ + Q  ++SSAN++L +  
Subjt:  MAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICIRNPQHFQESSANRRLGLLF

Query:  MIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREE
        +IF+W +LAI FSML+ GT+ NS+ + +C IS H +  +GGILCF+HGL  VAYY+SATA+ RE+
Subjt:  MIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTCCTTTATTCCAATTCAGCCCTCCCAAGCCTTTCATTCCCGATTAGAAATCTTCGCAAGAGCTCTGAATTTGCAGCACAAATGGCGCAAAACTACGGCTTTGT
GGTCTGCATTTTGGTCATGGTAATGGACGCTGTTGCTGGCCTACTCGCCCTTCGAGCCGAAAAGTCTCAGAATCGGGTGATGCTACGATTTGGGAGCATGTTGGTGTTTG
AGTGCAGCAGAAAGGCGAGAGATGATGCTTTTAGTCAGGGGCTGGCTGCGGCTATTCTGCTTGGCCTTGCTCATATCATTGCTAAAGTATTTGGTGGGTGCATTTGCATT
AGAAATCCACAGCATTTCCAAGAATCAAGTGCTAACAGGCGATTGGGATTGCTCTTCATGATTTTCTCATGGACACTTTTGGCTATTGGGTTTTCTATGTTGGTGGCTGG
GACGGTTGACAATTCCAAGCGGAAAAACTCTTGTGAGATATCAAGCCATGGGCTGTTTTTAGTAGGTGGGATTCTGTGTTTCATTCATGGCCTATGTACAGTCGCTTATT
ATGTTTCTGCAACAGCAGCTTATAGAGAAGAACAGAGGATACACAAAGCAAATCCTTCTGTTCCATAA
mRNA sequenceShow/hide mRNA sequence
GAAGAAAATACCGAAAGTAAAGAAGCTTTCTTGAAAAATTTGTCGCCATTGATGATTCCTTTCATAAATGTAACAGTGCAACAATCATGGGTTTCCTTTATTCCAATTCA
GCCCTCCCAAGCCTTTCATTCCCGATTAGAAATCTTCGCAAGAGCTCTGAATTTGCAGCACAAATGGCGCAAAACTACGGCTTTGTGGTCTGCATTTTGGTCATGGTAAT
GGACGCTGTTGCTGGCCTACTCGCCCTTCGAGCCGAAAAGTCTCAGAATCGGGTGATGCTACGATTTGGGAGCATGTTGGTGTTTGAGTGCAGCAGAAAGGCGAGAGATG
ATGCTTTTAGTCAGGGGCTGGCTGCGGCTATTCTGCTTGGCCTTGCTCATATCATTGCTAAAGTATTTGGTGGGTGCATTTGCATTAGAAATCCACAGCATTTCCAAGAA
TCAAGTGCTAACAGGCGATTGGGATTGCTCTTCATGATTTTCTCATGGACACTTTTGGCTATTGGGTTTTCTATGTTGGTGGCTGGGACGGTTGACAATTCCAAGCGGAA
AAACTCTTGTGAGATATCAAGCCATGGGCTGTTTTTAGTAGGTGGGATTCTGTGTTTCATTCATGGCCTATGTACAGTCGCTTATTATGTTTCTGCAACAGCAGCTTATA
GAGAAGAACAGAGGATACACAAAGCAAATCCTTCTGTTCCATAA
Protein sequenceShow/hide protein sequence
MGFLYSNSALPSLSFPIRNLRKSSEFAAQMAQNYGFVVCILVMVMDAVAGLLALRAEKSQNRVMLRFGSMLVFECSRKARDDAFSQGLAAAILLGLAHIIAKVFGGCICI
RNPQHFQESSANRRLGLLFMIFSWTLLAIGFSMLVAGTVDNSKRKNSCEISSHGLFLVGGILCFIHGLCTVAYYVSATAAYREEQRIHKANPSVP