; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g14440 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g14440
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBetaGal beta-1,3-N-acetylglucosaminyltransferase 2
Genome locationchr8:10945868..10947223
RNA-Seq ExpressionMoc08g14440
SyntenyMoc08g14440
Gene Ontology termsGO:0004497 - monooxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588936.1 hypothetical protein SDJN03_17501, partial [Cucurbita argyrosperma subsp. sororia]1.3e-7583.51Show/hide
Query:  AAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRLQEQYR
        AAA++    EE DWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAAD+EAEENRRRERRR TLLKVRAKYQREIEQWEVLS NLRAMEER R+L+EQYR
Subjt:  AAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRLQEQYR

Query:  REGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        REGEEGT    EASSL+ VQ++ELS ASMVEDLLSQVE QE II +VSKLCD+AEALC+T+ D+LKQCLIDLPIWASPRELMASLCDE
Subjt:  REGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

XP_022152178.1 uncharacterized protein LOC111019962 [Momordica charantia]6.8e-96100Show/hide
Query:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRL
        MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRL
Subjt:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRL

Query:  QEQYRREGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        QEQYRREGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
Subjt:  QEQYRREGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

XP_022989565.1 uncharacterized protein LOC111486625 [Cucurbita maxima]1.0e-7582.45Show/hide
Query:  AAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRLQEQYR
        AAA++    EE DWELCNDDGFVYKRKRRRLDP EAVAARSSVAQAAD+EAEENRRRERRRKTLLKVRAKYQREIEQWEVLS NLRAMEER R+LQEQYR
Subjt:  AAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRLQEQYR

Query:  REGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        REGEEGT    EASSL+++ ++ELS +SMVEDLLSQVE QE+II +VSKLCDIAEALC+T+ D++KQCLIDLPIWASPRELMASLCDE
Subjt:  REGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

XP_023526083.1 uncharacterized protein LOC111789675 [Cucurbita pepo subsp. pepo]9.2e-7784.97Show/hide
Query:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRL
        MDS  AAAVATV+ E+EDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAAD+EAEENRRRERRRKTLLKVRAKY++EIEQWEVLS NLRAMEER R+L
Subjt:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRL

Query:  QEQYRREGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        QEQYR+EGE G   L EASSL+VVQ +ELS ASMVEDLLSQVEAQE+II +VSKLCDIAEALC T+E++LKQ LIDLPIWASPRELMASLCDE
Subjt:  QEQYRREGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

XP_038904940.1 uncharacterized protein LOC120091144 [Benincasa hispida]2.9e-7884.57Show/hide
Query:  AAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRLQEQYR
        AAAVATV+ EE+DWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQA D+EAEENRRRE RRKTLLKVRAKYQRE+EQWEVLS NLR MEER R+LQEQYR
Subjt:  AAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRLQEQYR

Query:  REGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        R+GEEGT  L EASSL+VV+++E+S ASMV+DLLSQVEAQE+IIH+VS+ CDIAEALCQT+EDR KQCLIDLPIW+SPRELMASLCDE
Subjt:  REGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

TrEMBL top hitse value%identityAlignment
A0A6J1DE60 uncharacterized protein LOC1110199623.3e-96100Show/hide
Query:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRL
        MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRL
Subjt:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRL

Query:  QEQYRREGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        QEQYRREGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
Subjt:  QEQYRREGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

A0A6J1EJL6 uncharacterized protein LOC1114350831.4e-7583.51Show/hide
Query:  AAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRLQEQYR
        AAA++    EE DWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAAD+EAEENRRRERRR TLLKVRAKYQREIEQWEVLS NLRA EER R+LQEQYR
Subjt:  AAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRLQEQYR

Query:  REGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        REGEEGT    EASSL+ VQ++ELS ASMVEDLLSQVE QE+II +VS LCDIAEALC+T+ D+LKQCLIDLPIWASPRELMASLCDE
Subjt:  REGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

A0A6J1F3L3 uncharacterized protein LOC1114420713.5e-7482.9Show/hide
Query:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRL
        MDS  AAAVATV+ E+EDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQA D+EAEENRRRERRRKTLLKVRAKY+REIEQWEVLS NL+AMEER R+L
Subjt:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRL

Query:  QEQYRREGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        QEQYR+EGE G     EAS L+VVQ +ELS ASMVEDLLSQVEAQE+II +VSKLCDIAEALC T+E++LKQ LIDLPIWASP ELMASLCDE
Subjt:  QEQYRREGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

A0A6J1IZN1 uncharacterized protein LOC1114813976.0e-7482.38Show/hide
Query:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRL
        MDS  AAAVATV+ E+EDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAAD+EAEENRRRERRRKTLLKVRAKY++EIEQWEVLS NLRAMEER R+L
Subjt:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRL

Query:  QEQYRREGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        QEQYR+EGE G     EAS L+ VQ +ELS ASMVEDLLSQVEA E+II +VSKLCDIAE+LC T+E++LKQ LIDLPIWASPRELMASLCDE
Subjt:  QEQYRREGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

A0A6J1JMQ6 uncharacterized protein LOC1114866254.9e-7682.45Show/hide
Query:  AAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRLQEQYR
        AAA++    EE DWELCNDDGFVYKRKRRRLDP EAVAARSSVAQAAD+EAEENRRRERRRKTLLKVRAKYQREIEQWEVLS NLRAMEER R+LQEQYR
Subjt:  AAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRLQEQYR

Query:  REGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        REGEEGT    EASSL+++ ++ELS +SMVEDLLSQVE QE+II +VSKLCDIAEALC+T+ D++KQCLIDLPIWASPRELMASLCDE
Subjt:  REGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G27520.1 unknown protein7.6e-2941.71Show/hide
Query:  EEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRLQEQYRRE--GEEGTV
        +EDWE   DDGFVY RK+R    A+A           D   EE  RR R+++ L+K++ KYQ EI+QWE+LS +  AM+E+  R Q   R E      T+
Subjt:  EEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRLQEQYRRE--GEEGTV

Query:  PLAEASSLSVVQQREL-------SRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
             SS +    RE        S +SM++ LL  VE QE++I+ VSKLC++ E +C+ +E+  KQ   DLPIW+SP +LMASLC +
Subjt:  PLAEASSLSVVQQREL-------SRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCGAACGCAGCGGCCGCCGTAGCGACGGTCGCCGGAGAAGAAGAGGACTGGGAGCTCTGCAACGACGACGGATTCGTCTACAAGCGGAAGAGGCGCCGG
CTCGACCCGGCGGAAGCCGTCGCAGCTCGCTCGTCGGTGGCCCAAGCGGCGGACATTGAGGCGGAGGAGAATCGGCGGCGAGAGCGCAGGAGGAAGACGTTGTTG
AAGGTTAGGGCGAAGTATCAGAGAGAGATTGAGCAGTGGGAGGTTTTGTCGAGAAACTTGCGGGCGATGGAGGAGCGGAATCGGAGGCTGCAGGAACAGTACCGG
CGGGAAGGGGAAGAAGGAACGGTGCCGCTTGCGGAGGCTTCCTCGTTGTCCGTTGTTCAGCAGAGGGAGCTATCACGCGCGTCAATGGTGGAAGATCTTCTCTCT
CAGGTGGAAGCTCAGGAATCAATCATTCATCATGTTTCAAAGCTTTGTGATATAGCCGAAGCTCTGTGCCAGACACAAGAGGACAGATTGAAACAGTGTCTAATC
GACCTTCCCATTTGGGCATCACCCCGCGAGCTCATGGCCTCACTGTGTGACGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCGAACGCAGCGGCCGCCGTAGCGACGGTCGCCGGAGAAGAAGAGGACTGGGAGCTCTGCAACGACGACGGATTCGTCTACAAGCGGAAGAGGCGCCGG
CTCGACCCGGCGGAAGCCGTCGCAGCTCGCTCGTCGGTGGCCCAAGCGGCGGACATTGAGGCGGAGGAGAATCGGCGGCGAGAGCGCAGGAGGAAGACGTTGTTG
AAGGTTAGGGCGAAGTATCAGAGAGAGATTGAGCAGTGGGAGGTTTTGTCGAGAAACTTGCGGGCGATGGAGGAGCGGAATCGGAGGCTGCAGGAACAGTACCGG
CGGGAAGGGGAAGAAGGAACGGTGCCGCTTGCGGAGGCTTCCTCGTTGTCCGTTGTTCAGCAGAGGGAGCTATCACGCGCGTCAATGGTGGAAGATCTTCTCTCT
CAGGTGGAAGCTCAGGAATCAATCATTCATCATGTTTCAAAGCTTTGTGATATAGCCGAAGCTCTGTGCCAGACACAAGAGGACAGATTGAAACAGTGTCTAATC
GACCTTCCCATTTGGGCATCACCCCGCGAGCTCATGGCCTCACTGTGTGACGAGTAA
Protein sequenceShow/hide protein sequence
MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRRRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRAKYQREIEQWEVLSRNLRAMEERNRRLQEQYR
REGEEGTVPLAEASSLSVVQQRELSRASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE