; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027591 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027591
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionBetaGal beta-1,3-N-acetylglucosaminyltransferase 2
Genome locationchr8:2312795..2314860
RNA-Seq ExpressionLag0027591
SyntenyLag0027591
Gene Ontology termsGO:0004497 - monooxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581058.1 hypothetical protein SDJN03_21060, partial [Cucurbita argyrosperma subsp. sororia]8.7e-8280.57Show/hide
Query:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ
        M+SI AA+ATVS+E+EDWELCN DGFVYKRKRRRLDPAE VAARSSVAQA DL+AEEN RRERRRKTLLKVRAKY+REIEQWEVLS+NL+AMEER RKLQ
Subjt:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ

Query:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDEIQKHCMPR
        E YR+EGE G AS L+ASLL VVQ KELSCASMVE+L+SQVE QEAIIRNVSKLC IA+ALC TEE++LKQRLIDLPIWASPRELMASL    QK  MPR
Subjt:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDEIQKHCMPR

Query:  FLIAGKMGAIS
        FLI  +MGA++
Subjt:  FLIAGKMGAIS

KAG6588936.1 hypothetical protein SDJN03_17501, partial [Cucurbita argyrosperma subsp. sororia]8.1e-8086.46Show/hide
Query:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ
        MESI AAI+  + EE DWELCN DGFVYKRKRRRLDPAEAVAARSSVAQAADL+AEEN RRERRR TLLKVRAKYQREIEQWEVLSNNLRAMEERTRKL+
Subjt:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ

Query:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE
        E YRREGEEGTA  L+AS LN VQEKELSCASMVE+LLSQVEGQE IIRNVSKLC +A+ALCKTE D+LKQ LIDLPIWASPRELMASLCDE
Subjt:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE

KAG7017788.1 hypothetical protein SDJN02_19654, partial [Cucurbita argyrosperma subsp. argyrosperma]1.8e-7984.9Show/hide
Query:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ
        M+SI AA+ATVS+E+EDWELCN DGFVYKRKRRRLDPAEAVAARSSVAQA DL+AEEN RRERRRKTLLKVRAKY+REIEQWEVLS+NL+AMEER RKLQ
Subjt:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ

Query:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE
        E YR+EGE G AS L+ASLL VVQ KELSC SMVE+LLSQVE QEAIIRNVSKLC IA+ALC TEE++LKQRLIDLPIWASPRELMASLCDE
Subjt:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE

XP_022989565.1 uncharacterized protein LOC111486625 [Cucurbita maxima]1.6e-8086.46Show/hide
Query:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ
        MESI AAI+    EE DWELCN DGFVYKRKRRRLDP EAVAARSSVAQAADL+AEEN RRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ
Subjt:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ

Query:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE
        E YRREGEEGTAS L+AS LN++ EKELSC+SMVE+LLSQVEGQEAIIRNVSKLC IA+ALCKTE D++KQ LIDLPIWASPRELMASLCDE
Subjt:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE

XP_023526083.1 uncharacterized protein LOC111789675 [Cucurbita pepo subsp. pepo]1.6e-8085.94Show/hide
Query:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ
        M+SI AA+ATVS+E+EDWELCN DGFVYKRKRRRLDPAEAVAARSSVAQAADL+AEEN RRERRRKTLLKVRAKY++EIEQWEVLS+NLRAMEER RKLQ
Subjt:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ

Query:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE
        E YR+EGE G ASLL+AS L VVQ KELSCASMVE+LLSQVE QEAIIRNVSKLC IA+ALC TEE++LKQRLIDLPIWASPRELMASLCDE
Subjt:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE

TrEMBL top hitse value%identityAlignment
A0A6J1DE60 uncharacterized protein LOC1110199622.0e-7683.96Show/hide
Query:  AAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQELYRR
        AA+ATV+ EEEDWELCN DGFVYKRKRRRLDPAEAVAARSSVAQAAD++AEEN RRERRRKTLLKVRAKYQREIEQWEVLS NLRAMEER R+LQE YRR
Subjt:  AAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQELYRR

Query:  EGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE
        EGEEGT  L +AS L+VVQ++ELS ASMVE+LLSQVE QE+II +VSKLC IA+ALC+T+EDRLKQ LIDLPIWASPRELMASLCDE
Subjt:  EGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE

A0A6J1EJL6 uncharacterized protein LOC1114350838.7e-8086.98Show/hide
Query:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ
        MESI AAI+    EE DWELCN DGFVYKRKRRRLDPAEAVAARSSVAQAADL+AEEN RRERRR TLLKVRAKYQREIEQWEVLSNNLRA EERTRKLQ
Subjt:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ

Query:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE
        E YRREGEEGTA  L+AS LN VQEKELSCASMVE+LLSQVEGQEAIIRNVS LC IA+ALCKTE D+LKQ LIDLPIWASPRELMASLCDE
Subjt:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE

A0A6J1F3L3 uncharacterized protein LOC1114420712.5e-7984.9Show/hide
Query:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ
        M+SI AA+ATVS+E+EDWELCN DGFVYKRKRRRLDPAEAVAARSSVAQA DL+AEEN RRERRRKTLLKVRAKY+REIEQWEVLS+NL+AMEER RKLQ
Subjt:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ

Query:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE
        E YR+EGE G AS L+ASLL VVQ KELSCASMVE+LLSQVE QEAIIRNVSKLC IA+ALC TEE++LKQRLIDLPIWASP ELMASLCDE
Subjt:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE

A0A6J1IZN1 uncharacterized protein LOC1114813974.3e-7984.38Show/hide
Query:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ
        M+SI AA+ATVS+E+EDWELCN DGFVYKRKRRRLDPAEAVAARSSVAQAADL+AEEN RRERRRKTLLKVRAKY++EIEQWEVLS+NLRAMEER RKLQ
Subjt:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ

Query:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE
        E YR+EGE G AS L+ASLL  VQ KELSCASMVE+LLSQVE  EAIIRNVSKLC IA++LC TEE++LKQRLIDLPIWASPRELMASLCDE
Subjt:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE

A0A6J1JMQ6 uncharacterized protein LOC1114866257.9e-8186.46Show/hide
Query:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ
        MESI AAI+    EE DWELCN DGFVYKRKRRRLDP EAVAARSSVAQAADL+AEEN RRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ
Subjt:  MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQ

Query:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE
        E YRREGEEGTAS L+AS LN++ EKELSC+SMVE+LLSQVEGQEAIIRNVSKLC IA+ALCKTE D++KQ LIDLPIWASPRELMASLCDE
Subjt:  ELYRREGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G27520.1 unknown protein6.9e-2939.7Show/hide
Query:  IEAAIATVSVE-EEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQEL
        +++ ++T S++ +EDWE    DGFVY RK+R    A+A           D   EE  RR R+++ L+K++ KYQ EI+QWE+LSN+  AM+E+  + Q  
Subjt:  IEAAIATVSVE-EEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQEL

Query:  YRRE--GEEGTASLLDASLLNVVQEKEL-------SCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE
         R E      T S    S       +E        S +SM+++LL  VE QEA+I  VSKLC + + +C+ EE+  KQ   DLPIW+SP +LMASLC +
Subjt:  YRRE--GEEGTASLLDASLLNVVQEKEL-------SCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCAATTGAAGCAGCCATAGCTACGGTCTCTGTAGAAGAAGAAGATTGGGAGCTCTGCAACTACGATGGCTTTGTCTACAAGCGCAAGAGGCGCCGGCTG
GACCCGGCGGAAGCCGTCGCAGCTCGTTCGTCGGTGGCTCAGGCGGCGGACCTTCAGGCGGAGGAGAATCTGCGGCGAGAGCGCAGGAGGAAGACGTTGTTGAAG
GTTAGAGCGAAGTATCAGAGAGAGATTGAGCAGTGGGAGGTTTTGTCGAACAACTTGCGGGCGATGGAGGAGAGGACTCGGAAGCTGCAGGAACTGTACAGACGG
GAAGGGGAAGAAGGAACCGCGTCGCTTCTGGACGCTTCCTTGTTGAACGTGGTTCAGGAGAAGGAGCTGTCCTGCGCGTCAATGGTGGAGGAACTTCTCTCTCAG
GTGGAAGGTCAGGAAGCTATCATTCGCAATGTTTCAAAGCTTTGTTATATAGCTGATGCATTGTGCAAGACAGAAGAAGACCGATTGAAACAGCGTCTAATAGAT
CTTCCCATTTGGGCATCACCCCGTGAGCTCATGGCTTCACTTTGTGACGAAATTCAGAAACATTGCATGCCAAGATTCTTGATTGCTGGAAAGATGGGTGCTATA
TCACTCAATGATCCTACACATCTGTTGAGAAAAGCTGAGATTCTCAAACTTATTCTGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCAATTGAAGCAGCCATAGCTACGGTCTCTGTAGAAGAAGAAGATTGGGAGCTCTGCAACTACGATGGCTTTGTCTACAAGCGCAAGAGGCGCCGGCTG
GACCCGGCGGAAGCCGTCGCAGCTCGTTCGTCGGTGGCTCAGGCGGCGGACCTTCAGGCGGAGGAGAATCTGCGGCGAGAGCGCAGGAGGAAGACGTTGTTGAAG
GTTAGAGCGAAGTATCAGAGAGAGATTGAGCAGTGGGAGGTTTTGTCGAACAACTTGCGGGCGATGGAGGAGAGGACTCGGAAGCTGCAGGAACTGTACAGACGG
GAAGGGGAAGAAGGAACCGCGTCGCTTCTGGACGCTTCCTTGTTGAACGTGGTTCAGGAGAAGGAGCTGTCCTGCGCGTCAATGGTGGAGGAACTTCTCTCTCAG
GTGGAAGGTCAGGAAGCTATCATTCGCAATGTTTCAAAGCTTTGTTATATAGCTGATGCATTGTGCAAGACAGAAGAAGACCGATTGAAACAGCGTCTAATAGAT
CTTCCCATTTGGGCATCACCCCGTGAGCTCATGGCTTCACTTTGTGACGAAATTCAGAAACATTGCATGCCAAGATTCTTGATTGCTGGAAAGATGGGTGCTATA
TCACTCAATGATCCTACACATCTGTTGAGAAAAGCTGAGATTCTCAAACTTATTCTGAGTTGA
Protein sequenceShow/hide protein sequence
MESIEAAIATVSVEEEDWELCNYDGFVYKRKRRRLDPAEAVAARSSVAQAADLQAEENLRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEERTRKLQELYRR
EGEEGTASLLDASLLNVVQEKELSCASMVEELLSQVEGQEAIIRNVSKLCYIADALCKTEEDRLKQRLIDLPIWASPRELMASLCDEIQKHCMPRFLIAGKMGAI
SLNDPTHLLRKAEILKLILS