; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0016741 (gene) of Chayote v1 genome

Gene IDSed0016741
OrganismSechium edule (Chayote v1)
DescriptionFUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages;
Genome locationLG04:13570379..13574220
RNA-Seq ExpressionSed0016741
SyntenySed0016741
Gene Ontology termsGO:0048767 - root hair elongation (biological process)
GO:0071816 - tail-anchored membrane protein insertion into ER membrane (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0043529 - GET complex (cellular component)
GO:0043621 - protein self-association (molecular function)
InterPro domainsIPR028945 - Get1 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596276.1 hypothetical protein SDJN03_09456, partial [Cucurbita argyrosperma subsp. sororia]2.9e-7788.57Show/hide
Query:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY
        MEAE IVEHR SIVAPSIF IV+AFQFLA WL+ LKK+GSNNQVEMELRKSIKQLLREAS LSQPSTFAQAAKLRR AAAKEKELANYQESR+K++K+SY
Subjt:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY

Query:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRV
         L+SRVLL+SKVLIYIVLVCWFWRASVA+VP HLVQPFGRFLSWRAG TVNDYVKVGIIPWLILSTRVSKFVCRV
Subjt:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRV

XP_008448478.1 PREDICTED: uncharacterized protein LOC103490650 [Cucumis melo]4.2e-7686.36Show/hide
Query:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY
        MEAEGIVEH  SI AP IF IV+ FQFLARWLE LKK GSN+QVE+ELRKSIKQLLREAS LSQPSTFAQAAKLRRLAAAKEKELANYQESRNKE+KTSY
Subjt:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY

Query:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRVV
         L+S+VLL+SKV+IYIVLVCWFWRASVA+VP HLVQPFG+FLSW+AG TVNDYVKVGIIPWLILSTRVSKFVC+VV
Subjt:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRVV

XP_022945757.1 uncharacterized protein LOC111449636 [Cucurbita moschata]9.9e-7889.14Show/hide
Query:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY
        MEAE IVEHR SIVAPSIFLIV+AFQFLA WL+ LKK+GSNNQVEMELRKSIKQLLREAS LSQPSTFAQAAKLRR AAAKEKELANYQESR+K++K+SY
Subjt:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY

Query:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRV
         L+SRVLL+SKVLIYIVLVCWFWRASVA+VP HLVQPFGRFLSWRAG TVNDYVKVGIIPWLILSTRVSKFVCRV
Subjt:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRV

XP_022971588.1 uncharacterized protein LOC111470261 [Cucurbita maxima]9.9e-7889.14Show/hide
Query:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY
        MEAE IVEHR SIVAPSIFLIV+AFQFLA WL+ LKK+GSNNQVEMELRKSIKQLLREAS LSQPSTFAQAAKLRRLAAAKEKELANYQESR+K++K+SY
Subjt:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY

Query:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRV
         L+SRVLL+SKVLIYIVLVCWFWRASVA+VP HLVQPFGRFLSWRAG  VNDYVKVGIIPWLILSTRVSKFVCRV
Subjt:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRV

XP_023539852.1 uncharacterized protein LOC111800404 [Cucurbita pepo subsp. pepo]3.4e-7889.71Show/hide
Query:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY
        MEAE IVEHR SIVAPSIFLIV+AFQFLA WL+ LKK+GSNNQVEMELRKSIKQLLREAS LSQPSTFAQAAKLRR AAAKEKELANYQESR+KE+K+SY
Subjt:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY

Query:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRV
         L+SRVLL+SKVLIYIVLVCWFWRASVA+VP HLVQPFGRFLSWRAG TVNDYVKVGIIPWLILSTRVSKFVCRV
Subjt:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRV

TrEMBL top hitse value%identityAlignment
A0A0A0L4T9 Uncharacterized protein1.0e-7585.8Show/hide
Query:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY
        ME EGIVEHR SI AP IF IV+ FQFLA+WLE LKK+GSN+QVEMELRKSIKQLL+EAS LSQPSTFAQAAKLRRLAAAKEKELANYQESRNKE+KTSY
Subjt:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY

Query:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRVV
         L+S+VLL+SKV+I+IVLVCWFWRASVA+VP HLVQPFG+FLSWRAG TVNDYVKVGIIPWLILSTRVSKFV RVV
Subjt:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRVV

A0A1S3BKE1 uncharacterized protein LOC1034906502.0e-7686.36Show/hide
Query:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY
        MEAEGIVEH  SI AP IF IV+ FQFLARWLE LKK GSN+QVE+ELRKSIKQLLREAS LSQPSTFAQAAKLRRLAAAKEKELANYQESRNKE+KTSY
Subjt:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY

Query:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRVV
         L+S+VLL+SKV+IYIVLVCWFWRASVA+VP HLVQPFG+FLSW+AG TVNDYVKVGIIPWLILSTRVSKFVC+VV
Subjt:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRVV

A0A6J1G1T6 uncharacterized protein LOC1114496364.8e-7889.14Show/hide
Query:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY
        MEAE IVEHR SIVAPSIFLIV+AFQFLA WL+ LKK+GSNNQVEMELRKSIKQLLREAS LSQPSTFAQAAKLRR AAAKEKELANYQESR+K++K+SY
Subjt:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY

Query:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRV
         L+SRVLL+SKVLIYIVLVCWFWRASVA+VP HLVQPFGRFLSWRAG TVNDYVKVGIIPWLILSTRVSKFVCRV
Subjt:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRV

A0A6J1I7A9 uncharacterized protein LOC1114702614.8e-7889.14Show/hide
Query:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY
        MEAE IVEHR SIVAPSIFLIV+AFQFLA WL+ LKK+GSNNQVEMELRKSIKQLLREAS LSQPSTFAQAAKLRRLAAAKEKELANYQESR+K++K+SY
Subjt:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY

Query:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRV
         L+SRVLL+SKVLIYIVLVCWFWRASVA+VP HLVQPFGRFLSWRAG  VNDYVKVGIIPWLILSTRVSKFVCRV
Subjt:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRV

A0A6J1L884 uncharacterized protein LOC1115002121.4e-7486.29Show/hide
Query:  EAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSYD
        EAEGIVEHR SI AP IFLIV+AFQFLARWLE LKK GSN+QVEMELRKSIKQLLREAS LSQPSTFAQAAKLRRLAAAKEKELANYQESRNKE+KTSY 
Subjt:  EAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSYD

Query:  LFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRVV
        L+SRVLL+SKV +YI LV WFWR SVA+VP HLVQPFGR LSW+AG  VNDYVKVGIIPWLILSTRVSKFVC+VV
Subjt:  LFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRVV

SwissProt top hitse value%identityAlignment
Q1H5D2 Protein GET16.7e-5361.93Show/hide
Query:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY
        ME E ++E R  + AP  F++VV FQ L++WL+QLKKKGS N  E ELR  IKQLLREASALSQP+TFAQAAKLRR AA KEKELA Y E  +KE+K SY
Subjt:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY

Query:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRVV
        D++ + LL SKV+IY++LV  FWR  +A +   LVQPFG  LSW  G  +  +V VGIIPWLILS RVSK+VCR V
Subjt:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRVV

Arabidopsis top hitse value%identityAlignment
AT4G16444.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: CHD5-like protein (InterPro:IPR007514); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).4.8e-5461.93Show/hide
Query:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY
        ME E ++E R  + AP  F++VV FQ L++WL+QLKKKGS N  E ELR  IKQLLREASALSQP+TFAQAAKLRR AA KEKELA Y E  +KE+K SY
Subjt:  MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSY

Query:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRVV
        D++ + LL SKV+IY++LV  FWR  +A +   LVQPFG  LSW  G  +  +V VGIIPWLILS RVSK+VCR V
Subjt:  DLFSRVLLMSKVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCTGAAGGAATCGTCGAGCATCGGAGATCGATTGTGGCTCCGTCCATTTTTCTGATCGTCGTTGCTTTTCAGTTTCTTGCTAGATGGTTGGAGCAGCTCAAGAA
GAAAGGTTCAAACAATCAGGTGGAAATGGAGTTGCGCAAATCAATAAAGCAACTTCTGAGGGAGGCAAGCGCCTTATCTCAACCATCTACATTTGCACAAGCTGCAAAAC
TTCGGAGGTTGGCAGCTGCTAAGGAGAAGGAACTGGCAAATTATCAAGAATCGCGGAATAAGGAGATGAAAACATCATATGATTTATTTAGTCGAGTATTGCTGATGTCA
AAGGTTTTGATATATATTGTGCTGGTTTGCTGGTTTTGGAGAGCTTCTGTGGCTTCTGTACCTCCTCATCTTGTGCAGCCATTTGGGAGATTTTTATCTTGGAGGGCTGG
AAGTACCGTAAATGATTATGTGAAGGTTGGAATTATACCATGGTTGATACTGTCAACACGGGTTAGCAAATTTGTTTGTCGAGTGGTCTAG
mRNA sequenceShow/hide mRNA sequence
CGATTAACGCGTATTGAGGTAATCATTCATCTCACTACTTGAGGGTAGTTTCGTAACTTTGCTTTTTCCTGAGATATTTTCTCTCGTGCTGAACAAAACGCTGCGTTTTG
TTCCTCCGTAAACTTGCCTGGGCCTTGGGTTTTGGGCTTGGATTTGTGGAGCCCATTTGTTGTGAATTATCGAAGCCCAACTTCTGGCCCAACGTTTTTCTGAATTACTT
CTGATTGGGAGTTTTGATCAACTCACGGTTGTGGTAAAGAAATCCTCTGAAAATGGAAGCTGAAGGAATCGTCGAGCATCGGAGATCGATTGTGGCTCCGTCCATTTTTC
TGATCGTCGTTGCTTTTCAGTTTCTTGCTAGATGGTTGGAGCAGCTCAAGAAGAAAGGTTCAAACAATCAGGTGGAAATGGAGTTGCGCAAATCAATAAAGCAACTTCTG
AGGGAGGCAAGCGCCTTATCTCAACCATCTACATTTGCACAAGCTGCAAAACTTCGGAGGTTGGCAGCTGCTAAGGAGAAGGAACTGGCAAATTATCAAGAATCGCGGAA
TAAGGAGATGAAAACATCATATGATTTATTTAGTCGAGTATTGCTGATGTCAAAGGTTTTGATATATATTGTGCTGGTTTGCTGGTTTTGGAGAGCTTCTGTGGCTTCTG
TACCTCCTCATCTTGTGCAGCCATTTGGGAGATTTTTATCTTGGAGGGCTGGAAGTACCGTAAATGATTATGTGAAGGTTGGAATTATACCATGGTTGATACTGTCAACA
CGGGTTAGCAAATTTGTTTGTCGAGTGGTCTAGTAAACAAGTGATTTAAGGTAAGTATGATGTTGATGATAAACATCACATGCTCTATGTAATGTAATAAATATTTGGAA
AGCGTTTTTTTTTTTTTTCTGAATTTGAGCAAAGGCAGGCTGGAGAAGTTTTTTACCGCCCCACCCGTTTTTTCTAATATAAATATAAAAGGAGGCTGAGCATGGTTATA
T
Protein sequenceShow/hide protein sequence
MEAEGIVEHRRSIVAPSIFLIVVAFQFLARWLEQLKKKGSNNQVEMELRKSIKQLLREASALSQPSTFAQAAKLRRLAAAKEKELANYQESRNKEMKTSYDLFSRVLLMS
KVLIYIVLVCWFWRASVASVPPHLVQPFGRFLSWRAGSTVNDYVKVGIIPWLILSTRVSKFVCRVV