; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001583 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001583
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionBetaGal beta-1,3-N-acetylglucosaminyltransferase 2
Genome locationscaffold879:119097..120456
RNA-Seq ExpressionMS001583
SyntenyMS001583
Gene Ontology termsGO:0004497 - monooxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588936.1 hypothetical protein SDJN03_17501, partial [Cucurbita argyrosperma subsp. sororia]4.6e-7682.98Show/hide
Query:  AAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRLQEQYR
        AAA++    EE DWELCNDDGFVYKRKR RLDPAEAVAARSSVAQAAD+EAEENRRRERRR TLLKVR KYQREIEQWEVLS NLRAMEER R+L+EQYR
Subjt:  AAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRLQEQYR

Query:  REGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        REGEEGT    EASSL+ VQ++ELSCASMVEDLLSQVE QE II +VSKLCD+AEALC+T+ D+LKQCLIDLPIWASPRELMASLCDE
Subjt:  REGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

XP_022152178.1 uncharacterized protein LOC111019962 [Momordica charantia]4.8e-9498.45Show/hide
Query:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRL
        MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKR RLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVR KYQREIEQWEVLSRNLRAMEERNRRL
Subjt:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRL

Query:  QEQYRREGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        QEQYRREGEEGTVPLAEASSLSVVQQRELS ASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
Subjt:  QEQYRREGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

XP_022989565.1 uncharacterized protein LOC111486625 [Cucurbita maxima]3.5e-7681.91Show/hide
Query:  AAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRLQEQYR
        AAA++    EE DWELCNDDGFVYKRKR RLDP EAVAARSSVAQAAD+EAEENRRRERRRKTLLKVR KYQREIEQWEVLS NLRAMEER R+LQEQYR
Subjt:  AAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRLQEQYR

Query:  REGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        REGEEGT    EASSL+++ ++ELSC+SMVEDLLSQVE QE+II +VSKLCDIAEALC+T+ D++KQCLIDLPIWASPRELMASLCDE
Subjt:  REGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

XP_023526083.1 uncharacterized protein LOC111789675 [Cucurbita pepo subsp. pepo]3.2e-7784.46Show/hide
Query:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRL
        MDS  AAAVATV+ E+EDWELCNDDGFVYKRKR RLDPAEAVAARSSVAQAAD+EAEENRRRERRRKTLLKVR KY++EIEQWEVLS NLRAMEER R+L
Subjt:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRL

Query:  QEQYRREGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        QEQYR+EGE G   L EASSL+VVQ +ELSCASMVEDLLSQVEAQE+II +VSKLCDIAEALC T+E++LKQ LIDLPIWASPRELMASLCDE
Subjt:  QEQYRREGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

XP_038904940.1 uncharacterized protein LOC120091144 [Benincasa hispida]9.8e-7984.04Show/hide
Query:  AAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRLQEQYR
        AAAVATV+ EE+DWELCNDDGFVYKRKR RLDPAEAVAARSSVAQA D+EAEENRRRE RRKTLLKVR KYQRE+EQWEVLS NLR MEER R+LQEQYR
Subjt:  AAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRLQEQYR

Query:  REGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        R+GEEGT  L EASSL+VV+++E+SCASMV+DLLSQVEAQE+IIH+VS+ CDIAEALCQT+EDR KQCLIDLPIW+SPRELMASLCDE
Subjt:  REGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

TrEMBL top hitse value%identityAlignment
A0A6J1DE60 uncharacterized protein LOC1110199622.3e-9498.45Show/hide
Query:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRL
        MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKR RLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVR KYQREIEQWEVLSRNLRAMEERNRRL
Subjt:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRL

Query:  QEQYRREGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        QEQYRREGEEGTVPLAEASSLSVVQQRELS ASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
Subjt:  QEQYRREGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

A0A6J1EJL6 uncharacterized protein LOC1114350834.9e-7682.98Show/hide
Query:  AAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRLQEQYR
        AAA++    EE DWELCNDDGFVYKRKR RLDPAEAVAARSSVAQAAD+EAEENRRRERRR TLLKVR KYQREIEQWEVLS NLRA EER R+LQEQYR
Subjt:  AAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRLQEQYR

Query:  REGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        REGEEGT    EASSL+ VQ++ELSCASMVEDLLSQVE QE+II +VS LCDIAEALC+T+ D+LKQCLIDLPIWASPRELMASLCDE
Subjt:  REGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

A0A6J1F3L3 uncharacterized protein LOC1114420711.2e-7482.38Show/hide
Query:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRL
        MDS  AAAVATV+ E+EDWELCNDDGFVYKRKR RLDPAEAVAARSSVAQA D+EAEENRRRERRRKTLLKVR KY+REIEQWEVLS NL+AMEER R+L
Subjt:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRL

Query:  QEQYRREGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        QEQYR+EGE G     EAS L+VVQ +ELSCASMVEDLLSQVEAQE+II +VSKLCDIAEALC T+E++LKQ LIDLPIWASP ELMASLCDE
Subjt:  QEQYRREGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

A0A6J1IZN1 uncharacterized protein LOC1114813972.1e-7481.87Show/hide
Query:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRL
        MDS  AAAVATV+ E+EDWELCNDDGFVYKRKR RLDPAEAVAARSSVAQAAD+EAEENRRRERRRKTLLKVR KY++EIEQWEVLS NLRAMEER R+L
Subjt:  MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRL

Query:  QEQYRREGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        QEQYR+EGE G     EAS L+ VQ +ELSCASMVEDLLSQVEA E+II +VSKLCDIAE+LC T+E++LKQ LIDLPIWASPRELMASLCDE
Subjt:  QEQYRREGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

A0A6J1JMQ6 uncharacterized protein LOC1114866251.7e-7681.91Show/hide
Query:  AAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRLQEQYR
        AAA++    EE DWELCNDDGFVYKRKR RLDP EAVAARSSVAQAAD+EAEENRRRERRRKTLLKVR KYQREIEQWEVLS NLRAMEER R+LQEQYR
Subjt:  AAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRLQEQYR

Query:  REGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
        REGEEGT    EASSL+++ ++ELSC+SMVEDLLSQVE QE+II +VSKLCDIAEALC+T+ D++KQCLIDLPIWASPRELMASLCDE
Subjt:  REGEEGTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G27520.1 unknown protein5.8e-2941.71Show/hide
Query:  EEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRLQEQYRRE--GEEGTV
        +EDWE   DDGFVY RK+ R   A+A           D   EE  RR R+++ L+K++ KYQ EI+QWE+LS +  AM+E+  R Q   R E      T+
Subjt:  EEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRLQEQYRRE--GEEGTV

Query:  PLAEASSLSVVQQREL-------SCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE
             SS +    RE        S +SM++ LL  VE QE++I+ VSKLC++ E +C+ +E+  KQ   DLPIW+SP +LMASLC +
Subjt:  PLAEASSLSVVQQREL-------SCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCGAACGCAGCGGCCGCCGTAGCGACGGTCGCCGGAGAAGAAGAGGACTGGGAGCTCTGCAACGACGACGGATTCGTCTACAAGCGGAAGAGGTGCCGGCTCGA
CCCGGCGGAAGCCGTCGCAGCTCGCTCGTCGGTGGCCCAAGCGGCGGACATTGAGGCGGAGGAGAATCGGCGGCGAGAGCGCAGGAGGAAGACGTTGTTGAAGGTTAGGG
GGAAGTATCAGAGAGAGATTGAGCAGTGGGAGGTTTTGTCGAGAAACTTGCGGGCGATGGAGGAGCGGAATCGGAGGCTGCAGGAACAGTACCGGCGGGAAGGGGAAGAA
GGAACGGTGCCGCTTGCGGAGGCTTCCTCGTTGTCCGTTGTTCAGCAGAGGGAGCTATCATGCGCGTCAATGGTGGAAGATCTTCTCTCTCAGGTGGAAGCTCAGGAATC
AATCATTCATCATGTTTCAAAGCTTTGTGATATAGCCGAAGCTCTGTGCCAGACACAAGAGGACAGATTGAAACAGTGTCTAATCGACCTTCCCATTTGGGCATCACCCC
GCGAGCTCATGGCCTCACTGTGTGACGAG
mRNA sequenceShow/hide mRNA sequence
ATGGATTCGAACGCAGCGGCCGCCGTAGCGACGGTCGCCGGAGAAGAAGAGGACTGGGAGCTCTGCAACGACGACGGATTCGTCTACAAGCGGAAGAGGTGCCGGCTCGA
CCCGGCGGAAGCCGTCGCAGCTCGCTCGTCGGTGGCCCAAGCGGCGGACATTGAGGCGGAGGAGAATCGGCGGCGAGAGCGCAGGAGGAAGACGTTGTTGAAGGTTAGGG
GGAAGTATCAGAGAGAGATTGAGCAGTGGGAGGTTTTGTCGAGAAACTTGCGGGCGATGGAGGAGCGGAATCGGAGGCTGCAGGAACAGTACCGGCGGGAAGGGGAAGAA
GGAACGGTGCCGCTTGCGGAGGCTTCCTCGTTGTCCGTTGTTCAGCAGAGGGAGCTATCATGCGCGTCAATGGTGGAAGATCTTCTCTCTCAGGTGGAAGCTCAGGAATC
AATCATTCATCATGTTTCAAAGCTTTGTGATATAGCCGAAGCTCTGTGCCAGACACAAGAGGACAGATTGAAACAGTGTCTAATCGACCTTCCCATTTGGGCATCACCCC
GCGAGCTCATGGCCTCACTGTGTGACGAG
Protein sequenceShow/hide protein sequence
MDSNAAAAVATVAGEEEDWELCNDDGFVYKRKRCRLDPAEAVAARSSVAQAADIEAEENRRRERRRKTLLKVRGKYQREIEQWEVLSRNLRAMEERNRRLQEQYRREGEE
GTVPLAEASSLSVVQQRELSCASMVEDLLSQVEAQESIIHHVSKLCDIAEALCQTQEDRLKQCLIDLPIWASPRELMASLCDE