; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0022021 (gene) of Chayote v1 genome

Gene IDSed0022021
OrganismSechium edule (Chayote v1)
DescriptionBetaGal beta-1,3-N-acetylglucosaminyltransferase 2
Genome locationLG06:4422148..4424840
RNA-Seq ExpressionSed0022021
SyntenySed0022021
Gene Ontology termsGO:0004497 - monooxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588936.1 hypothetical protein SDJN03_17501, partial [Cucurbita argyrosperma subsp. sororia]5.0e-6474.48Show/hide
Query:  MEPVAPAITVSIEE--DWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR
        ME +A AI++S EE  DWELCNDDGFV+KRK+RRLDPAEA AARSS      + AEE RRRERRR TLLKVRAKY+ EIEQWEVLSN+LRA+EERTR LR
Subjt:  MEPVAPAITVSIEE--DWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR

Query:  EQYRREEEEGTASRLKASA-SGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE
        EQYRRE EEGTA  L+AS+ + +Q KELSCA MV+DLLSQVE QE II NV KLCD+A  LCKTE D+LKQ LIDLPIWASPR LMASLCDE
Subjt:  EQYRREEEEGTASRLKASA-SGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE

XP_022928186.1 uncharacterized protein LOC111435083 [Cucurbita moschata]3.8e-6474.48Show/hide
Query:  MEPVAPAITVSIEE--DWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR
        ME +A AI++S+EE  DWELCNDDGFV+KRK+RRLDPAEA AARSS      + AEE RRRERRR TLLKVRAKY+ EIEQWEVLSN+LRA EERTR L+
Subjt:  MEPVAPAITVSIEE--DWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR

Query:  EQYRREEEEGTASRLKASA-SGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE
        EQYRRE EEGTA  L+AS+ + +Q KELSCA MV+DLLSQVE QEAII NV  LCDIA  LCKTE D+LKQ LIDLPIWASPR LMASLCDE
Subjt:  EQYRREEEEGTASRLKASA-SGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE

XP_022982556.1 uncharacterized protein LOC111481397 [Cucurbita maxima]2.7e-6273.44Show/hide
Query:  MEPVAPAI-TVSIE-EDWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR
        M+ +A A+ TVS+E EDWELCNDDGFV+KRK+RRLDPAEA AARSS      + AEE RRRERRR TLLKVRAKYR EIEQWEVLS++LRA+EER R L+
Subjt:  MEPVAPAI-TVSIE-EDWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR

Query:  EQYRREEEEGTASRLKAS-ASGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE
        EQYR+E E G ASRL+AS  + +Q KELSCA MV+DLLSQVEA EAII NV KLCDIA  LC TE+++LKQRLIDLPIWASPR LMASLCDE
Subjt:  EQYRREEEEGTASRLKAS-ASGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE

XP_022989565.1 uncharacterized protein LOC111486625 [Cucurbita maxima]3.8e-6474.48Show/hide
Query:  MEPVAPAITVSIEE--DWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR
        ME +A AI++SIEE  DWELCNDDGFV+KRK+RRLDP EA AARSS      + AEE RRRERRR TLLKVRAKY+ EIEQWEVLSN+LRA+EERTR L+
Subjt:  MEPVAPAITVSIEE--DWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR

Query:  EQYRREEEEGTASRLKASASGIQP-KELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE
        EQYRRE EEGTAS L+AS+  + P KELSC+ MV+DLLSQVE QEAII NV KLCDIA  LCKTE D++KQ LIDLPIWASPR LMASLCDE
Subjt:  EQYRREEEEGTASRLKASASGIQP-KELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE

XP_023530399.1 uncharacterized protein LOC111792986 [Cucurbita pepo subsp. pepo]2.7e-6273.96Show/hide
Query:  MEPVAPAITVSIEE--DWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR
        ME +A AI++S+EE  DWEL NDDGFV+KRK+RRLDPAEA AARSS      + AEE RRRERRR TLLKVRAKY+ EIEQWEVLSN+LRA+EERTR LR
Subjt:  MEPVAPAITVSIEE--DWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR

Query:  EQYRREEEEGTASRLKASA-SGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE
        EQ RRE EEG A  L+AS+ + +Q KELSCA MV+DLLSQVE QEAII NV KLCDIA  LCKTE D+LKQ LIDLPIWASPR LMASLCDE
Subjt:  EQYRREEEEGTASRLKASA-SGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE

TrEMBL top hitse value%identityAlignment
A0A0A0LFK5 Uncharacterized protein3.6e-6070.31Show/hide
Query:  MEPVAPAI-TVSIEE-DWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR
        ME +A A+ TVS+EE DWELCNDDGFV+KRK+RRLDPAE  AARSSA     + AEE RRR+RRR TLLKVRAKY+ EIEQWEVLS +LR +EER RNL+
Subjt:  MEPVAPAI-TVSIEE-DWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR

Query:  EQYRREEEEGTASRLKASA-SGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE
        EQYRR  E+G  S L+AS+ + ++ KELS A MVDDLLSQVEAQ+A+I NV K CDIA  LC+TE+DRLKQRLIDLPIW SPR L+ASLCDE
Subjt:  EQYRREEEEGTASRLKASA-SGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE

A0A6J1EJL6 uncharacterized protein LOC1114350831.8e-6474.48Show/hide
Query:  MEPVAPAITVSIEE--DWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR
        ME +A AI++S+EE  DWELCNDDGFV+KRK+RRLDPAEA AARSS      + AEE RRRERRR TLLKVRAKY+ EIEQWEVLSN+LRA EERTR L+
Subjt:  MEPVAPAITVSIEE--DWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR

Query:  EQYRREEEEGTASRLKASA-SGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE
        EQYRRE EEGTA  L+AS+ + +Q KELSCA MV+DLLSQVE QEAII NV  LCDIA  LCKTE D+LKQ LIDLPIWASPR LMASLCDE
Subjt:  EQYRREEEEGTASRLKASA-SGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE

A0A6J1F3L3 uncharacterized protein LOC1114420711.1e-6172.92Show/hide
Query:  MEPVAPAI-TVSIE-EDWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR
        M+ +A A+ TVS+E EDWELCNDDGFV+KRK+RRLDPAEA AARSS      + AEE RRRERRR TLLKVRAKYR EIEQWEVLS++L+A+EER R L+
Subjt:  MEPVAPAI-TVSIE-EDWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR

Query:  EQYRREEEEGTASRLKAS-ASGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE
        EQYR+E E G ASRL+AS  + +Q KELSCA MV+DLLSQVEAQEAII NV KLCDIA  LC TE+++LKQRLIDLPIWASP  LMASLCDE
Subjt:  EQYRREEEEGTASRLKAS-ASGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE

A0A6J1IZN1 uncharacterized protein LOC1114813971.3e-6273.44Show/hide
Query:  MEPVAPAI-TVSIE-EDWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR
        M+ +A A+ TVS+E EDWELCNDDGFV+KRK+RRLDPAEA AARSS      + AEE RRRERRR TLLKVRAKYR EIEQWEVLS++LRA+EER R L+
Subjt:  MEPVAPAI-TVSIE-EDWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR

Query:  EQYRREEEEGTASRLKAS-ASGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE
        EQYR+E E G ASRL+AS  + +Q KELSCA MV+DLLSQVEA EAII NV KLCDIA  LC TE+++LKQRLIDLPIWASPR LMASLCDE
Subjt:  EQYRREEEEGTASRLKAS-ASGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE

A0A6J1JMQ6 uncharacterized protein LOC1114866251.8e-6474.48Show/hide
Query:  MEPVAPAITVSIEE--DWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR
        ME +A AI++SIEE  DWELCNDDGFV+KRK+RRLDP EA AARSS      + AEE RRRERRR TLLKVRAKY+ EIEQWEVLSN+LRA+EERTR L+
Subjt:  MEPVAPAITVSIEE--DWELCNDDGFVHKRKKRRLDPAEAAAARSSA-----VPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLR

Query:  EQYRREEEEGTASRLKASASGIQP-KELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE
        EQYRRE EEGTAS L+AS+  + P KELSC+ MV+DLLSQVE QEAII NV KLCDIA  LCKTE D++KQ LIDLPIWASPR LMASLCDE
Subjt:  EQYRREEEEGTASRLKASASGIQP-KELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G27520.1 unknown protein8.9e-2740.2Show/hide
Query:  APAITVSI--EEDWELCNDDGFVHKRKKRRLDPAEAAAARSSAVP-------AEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLREQ
        +P  T SI  +EDWE   DDGFV+ RKKR      A A  +S  P        EE  RR R++  L+K++ KY+ EI+QWE+LSNS  A++E+    +  
Subjt:  APAITVSI--EEDWELCNDDGFVHKRKKRRLDPAEAAAARSSAVP-------AEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLREQ

Query:  YRRE--EEEGTASRLKASAS--------GIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE
         R E      T S    S+S        G +    S + M+D LL  VE QEA+I  V KLC++   +C+ E++  KQ   DLPIW+SP  LMASLC +
Subjt:  YRRE--EEEGTASRLKASAS--------GIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCAGTCGCACCAGCCATTACGGTCTCCATAGAAGAAGATTGGGAGCTTTGCAACGACGATGGCTTCGTCCACAAGCGGAAGAAGCGCCGCCTGGATCCTGCGGA
AGCGGCTGCCGCTCGCTCGTCGGCGGTTCCGGCGGAGGAGATTCGGCGGCGAGAGCGGCGGAGGGGGACGTTGTTGAAGGTTAGGGCGAAGTACCGGGGCGAGATTGAAC
AGTGGGAGGTTTTGTCGAACAGCTTGCGGGCAGTGGAGGAGAGGACTCGGAATCTGCGGGAACAGTACCGGCGGGAAGAGGAGGAAGGAACGGCGTCGCGTTTGAAGGCT
TCGGCGAGCGGGATTCAGCCGAAGGAGCTCTCGTGCGCGTTAATGGTGGACGATCTTCTCTCTCAGGTGGAAGCTCAGGAAGCCATAATTTGCAATGTTTTCAAGCTCTG
TGATATAGCTGGAGAATTGTGCAAGACGGAAGACGACCGATTGAAACAGCGTCTAATTGACCTTCCCATTTGGGCATCACCCCGCGCGCTTATGGCTTCGCTGTGCGATG
AGTAA
mRNA sequenceShow/hide mRNA sequence
GCCCACAAAATTTCATCAGATTTCAAATTTCGAAAGAAATGAAAACCTGATTGATAGAAGAAGAAGAAGAAGATTGGTTTTGGTTTCATGGAGCCAGTCGCACCAGCCAT
TACGGTCTCCATAGAAGAAGATTGGGAGCTTTGCAACGACGATGGCTTCGTCCACAAGCGGAAGAAGCGCCGCCTGGATCCTGCGGAAGCGGCTGCCGCTCGCTCGTCGG
CGGTTCCGGCGGAGGAGATTCGGCGGCGAGAGCGGCGGAGGGGGACGTTGTTGAAGGTTAGGGCGAAGTACCGGGGCGAGATTGAACAGTGGGAGGTTTTGTCGAACAGC
TTGCGGGCAGTGGAGGAGAGGACTCGGAATCTGCGGGAACAGTACCGGCGGGAAGAGGAGGAAGGAACGGCGTCGCGTTTGAAGGCTTCGGCGAGCGGGATTCAGCCGAA
GGAGCTCTCGTGCGCGTTAATGGTGGACGATCTTCTCTCTCAGGTGGAAGCTCAGGAAGCCATAATTTGCAATGTTTTCAAGCTCTGTGATATAGCTGGAGAATTGTGCA
AGACGGAAGACGACCGATTGAAACAGCGTCTAATTGACCTTCCCATTTGGGCATCACCCCGCGCGCTTATGGCTTCGCTGTGCGATGAGTAACAAAGCAGGCAGCGAAGA
GAAACCGTAGCTGGGAACCGTCCATCTGTAAGATCTTACTATGTATAAAGTTATCGGAATTCTATGAAATCACACCAAACTGTAGCATATTTGTGGAGACCACTTTCTTT
GTTACAATAGAATGTTGAATCCTGAAGTTTGTGCCTCTCCCTCATTGATCATCATGATATTGGAAGAAATTTAGTGTTCTCAATTTATAGAAATGTCAATGAAAATATTG
ACATGTTATTGAAC
Protein sequenceShow/hide protein sequence
MEPVAPAITVSIEEDWELCNDDGFVHKRKKRRLDPAEAAAARSSAVPAEEIRRRERRRGTLLKVRAKYRGEIEQWEVLSNSLRAVEERTRNLREQYRREEEEGTASRLKA
SASGIQPKELSCALMVDDLLSQVEAQEAIICNVFKLCDIAGELCKTEDDRLKQRLIDLPIWASPRALMASLCDE