; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014866 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014866
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF1262)
Genome locationchr12:5382651..5383329
RNA-Seq ExpressionLag0014866
SyntenyLag0014866
Gene Ontology termsGO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601340.1 hypothetical protein SDJN03_06573, partial [Cucurbita argyrosperma subsp. sororia]1.0e-3351.52Show/hide
Query:  SEEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQRLE----------------------REVVSVADETVAS
        +E  GLD+ LRA+LP+  F LPC+SS PVVVGKWYCPFIF+R+  VD Q++ S YYEMTLEQ  E                      +EVVSV  E  A 
Subjt:  SEEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQRLE----------------------REVVSVADETVAS

Query:  GGRDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESRFG
        G     DGV W  P+RVGLSLA+VERMKWEE+R GFEWV EG EK+V ++R    +      RFG
Subjt:  GGRDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESRFG

KAG7032126.1 hypothetical protein SDJN02_06169, partial [Cucurbita argyrosperma subsp. argyrosperma]1.0e-3351.52Show/hide
Query:  SEEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQRLE----------------------REVVSVADETVAS
        +E  GLD+ LRA+LP+  F LPC+SS PVVVGKWYCPFIF+R+  VD Q++ S YYEMTLEQ  E                      +EVVSV  E  A 
Subjt:  SEEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQRLE----------------------REVVSVADETVAS

Query:  GGRDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESRFG
        G     DGV W  P+RVGLSLA+VERMKWEE+R GFEWV EG EK+V ++R    +      RFG
Subjt:  GGRDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESRFG

XP_011655744.1 uncharacterized protein LOC105435579 [Cucumis sativus]4.5e-3451.53Show/hide
Query:  EEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQR---------------------LEREVVSVADETVASGG
        E  GLD  LRA+LP   F LPC  S+P+VVGKWYCPFIF+R+G VD Q+  S YYEMTL+Q                      +E+EVVSVA   VA  G
Subjt:  EEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQR---------------------LEREVVSVADETVASGG

Query:  RDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESRFG
         + GDG  WF  SRVGLS+AIVER+KWEEE+ GFEWV EG EK+V ++R    +      RFG
Subjt:  RDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESRFG

XP_038892872.1 uncharacterized protein LOC120081783 [Benincasa hispida]7.7e-3453.66Show/hide
Query:  GLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDG--AVDVQMDRSMYYEMTLEQR--------------------LEREVVSVADETVAS-GGR
        GL++ LRA LP+ NF LPCKSS+ VVVGKWYCPFIFIR+G  AV  QM  S+YYE+TL Q                     +EREVVS+A E     G R
Subjt:  GLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDG--AVDVQMDRSMYYEMTLEQR--------------------LEREVVSVADETVAS-GGR

Query:  DDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESRFGRR
        + GDG+VWF P +VGLSL IVERMKWE+ER GF WVEE +EKKV   R V  K+  K    G +
Subjt:  DDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESRFGRR

XP_038892947.1 uncharacterized protein LOC120081844 [Benincasa hispida]7.9e-4764.67Show/hide
Query:  EEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQ----------------------RLEREVVSVADETVASG
        E HG+DSVLR+QLP+FNFPLPCK S+PVVVGKWYCP+IFIRDG VDVQMDRSMYYEMTLEQ                       +E+EVV V  +TVA G
Subjt:  EEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQ----------------------RLEREVVSVADETVASG

Query:  GRDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRR
         RD G+G VWFSP+++GLSLAIVER+KW+EE  GFEWVEEGKEK+V ++R
Subjt:  GRDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRR

TrEMBL top hitse value%identityAlignment
A0A0A0KQZ6 Toxin_10 domain-containing protein2.2e-3451.53Show/hide
Query:  EEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQR---------------------LEREVVSVADETVASGG
        E  GLD  LRA+LP   F LPC  S+P+VVGKWYCPFIF+R+G VD Q+  S YYEMTL+Q                      +E+EVVSVA   VA  G
Subjt:  EEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQR---------------------LEREVVSVADETVASGG

Query:  RDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESRFG
         + GDG  WF  SRVGLS+AIVER+KWEEE+ GFEWV EG EK+V ++R    +      RFG
Subjt:  RDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESRFG

A0A0A0KT31 Uncharacterized protein1.4e-3351.18Show/hide
Query:  HIYSEEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDG--AVDVQMDRSMYYEMTLEQR--------------------LEREVVSVADET
        H  +  HGL++ LRA+LP+ NF LPCKSS+PV VGKWY PFIFIRDG  AV  QM  S YYE+TL Q                     +EREVVS   E+
Subjt:  HIYSEEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDG--AVDVQMDRSMYYEMTLEQR--------------------LEREVVSVADET

Query:  VASGGRDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSL--RRWVSGKDYRKESRFG
         AS  ++  DG+VWF P +VGLSL +VERMKWEE+R GF+WV+EG+EKKV +   R    +  +K +RFG
Subjt:  VASGGRDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSL--RRWVSGKDYRKESRFG

A0A6J1D6Y4 uncharacterized protein LOC1110176033.2e-3350.91Show/hide
Query:  SEEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQR----------------------LEREVVSVADETVAS
        +E  GLD+ LRA+LP+  FP      NPVVVGKWYCPFIF+RDGAV+ QM  S YYEMTL+Q                       +EREV+S+A +  A+
Subjt:  SEEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQR----------------------LEREVVSVADETVAS

Query:  GGRDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESRFG
        GGR+ GDGV+WF  S VGLSLAIVER+KWEEER GFE+ +E ++K V ++R    K+  +  RFG
Subjt:  GGRDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESRFG

A0A6J1GYA9 uncharacterized protein LOC1114586051.1e-3354.3Show/hide
Query:  SEEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQRLE----------------------REVVSVADETVAS
        +E  GLD+ LRA+LP+  F LPC+SS PVVVGKWYCPFIF+R+  VD Q++ S YYEMTLEQ  E                      +EVVSV  E  A 
Subjt:  SEEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQRLE----------------------REVVSVADETVAS

Query:  GGRDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRR
        G     DGV W  P+RVGLSLA+VERMKWEE R GFEWV EG EK+V ++R
Subjt:  GGRDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRR

A0A6J1J260 uncharacterized protein LOC1114806023.2e-3350.91Show/hide
Query:  SEEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQRLE----------------------REVVSVADETVAS
        +E  GLD+ LRA LP+  F LPC+SS PVVVGKWYCPFIF+R+  VD Q++ S YYEMTLEQ  E                      +EVVSV  E  A 
Subjt:  SEEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQRLE----------------------REVVSVADETVAS

Query:  GGRDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESRFG
        GG    DGV W  P+RVGLSLA+V RM+WEE+R GFEWV EG EK+V ++R    +      RFG
Subjt:  GGRDDGDGVVWFSPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESRFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13470.1 Protein of unknown function (DUF1262)4.5e-1635.59Show/hide
Query:  SEEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQRLE------------REV---VSVADETVASGGRD---
        SE+ GL    +    E    LP   +  VVVGKWY PFIF+++G    Q+  S YY M L QR E            REV   V V  E V   G++   
Subjt:  SEEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQRLE------------REV---VSVADETVASGGRD---

Query:  -----DGDGVVWF--SPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLR------RWVSGKDYRKESRFGRRR
             D +GV WF  +  R+GL   ++ERMKWEEER G++   +    K S R       W S + Y     F  RR
Subjt:  -----DGDGVVWF--SPSRVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLR------RWVSGKDYRKESRFGRRR

AT1G13490.1 Protein of unknown function (DUF1262)1.8e-1734.09Show/hide
Query:  EEHGL--------DSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGA-VDVQMDRSMYYEMTLEQR-----------------------LEREVV
        EE+GL        D+ LR +LP+ +          VVVGKWY PF+F+++G   + QM++SMYY MTL+QR                       +E EVV
Subjt:  EEHGL--------DSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGA-VDVQMDRSMYYEMTLEQR-----------------------LEREVV

Query:  SVADETVASGGRD-DGDGVVWFSPS---RVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESR
         +  + +A   +  + DGVVWFS S   ++GL   ++ERMKWEEER G+   +E +       ++  G  + K  R
Subjt:  SVADETVASGGRD-DGDGVVWFSPS---RVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESR

AT1G13500.1 Protein of unknown function (DUF1262)1.4e-2038.41Show/hide
Query:  LDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQR-----------------------LEREVVSVADETVASGGRD
        +D+ LR +LP+ +        N VVVGKWY PF+F+++G    QM +SMYY MTL+QR                       +E EVV +  E +A   + 
Subjt:  LDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQR-----------------------LEREVVSVADETVASGGRD

Query:  -DGDGVVWFSPS---RVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRR
         + DGVVWF+ S   ++GL   ++ERMKWEEER G  W+ +G E++ S++R
Subjt:  -DGDGVVWFSPS---RVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRR

AT1G13510.1 Protein of unknown function (DUF1262)8.2e-1838.19Show/hide
Query:  GLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQR----------------------LEREVVSVADETVASGGRD
        GL++ LR +LP             VVVGKWY PFIF+++  V+ Q+ RSMYY MTLEQR                      L+ EVV V  + ++ G   
Subjt:  GLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQR----------------------LEREVVSVADETVASGGRD

Query:  DGDGVVWFS--PSRVGLSLAIVERMKWEEERVGFEWVEEGKEKK
        + +G VWF+    ++GL   +VERMKWEEER G  W  +G  ++
Subjt:  DGDGVVWFS--PSRVGLSLAIVERMKWEEERVGFEWVEEGKEKK

AT1G13530.1 Protein of unknown function (DUF1262)5.2e-2040.67Show/hide
Query:  LDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQRLER------------EV---VSVADETVASGGRDDG------
        +D+ LR +LP+F           VVVGKWY PF+F+++G    QM +SMYY MTL QR E             EV   V V  E V   G + G      
Subjt:  LDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQRLER------------EV---VSVADETVASGGRDDG------

Query:  --DGVVWFSPS---RVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRR
          DGVVWF  S   ++G+   ++ERMKWEEER G  W ++G E K S++R
Subjt:  --DGVVWFSPS---RVGLSLAIVERMKWEEERVGFEWVEEGKEKKVSLRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATTCCTCCACATTTATTCAGAAGAACATGGCTTGGACTCTGTTCTGCGAGCCCAACTCCCTGAATTCAACTTCCCATTGCCCTGTAAGAGCTCCAACCCTGTCGT
CGTCGGGAAATGGTACTGCCCTTTCATCTTCATTCGCGACGGCGCAGTGGATGTTCAGATGGACAGATCCATGTATTACGAAATGACCCTCGAGCAGAGACTGGAGAGGG
AAGTGGTTTCGGTTGCCGATGAGACGGTGGCTTCCGGCGGTAGAGATGACGGTGATGGTGTTGTGTGGTTTAGTCCGTCGAGGGTTGGGTTGAGCTTGGCGATTGTGGAG
AGAATGAAATGGGAGGAGGAGAGAGTTGGATTCGAATGGGTTGAAGAAGGAAAAGAAAAGAAAGTCAGTTTAAGAAGATGGGTCAGTGGAAAAGATTACAGAAAAGAAAG
TCGGTTTGGTCGTCGGCGGCTGAGTTTGGTCGTCGTCGAAGCGGCGGCGGCGGCGGCGACGGGAAGGGTGGCGGCGACAAGGGAAAGAATGGAGAAAGAAAGGGAAAGAA
AGAGAAGGGGTACCGTGGGGTGGGGGAGAGGAGAGAGAGAGGATTTTTTGGGTGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGATTCCTCCACATTTATTCAGAAGAACATGGCTTGGACTCTGTTCTGCGAGCCCAACTCCCTGAATTCAACTTCCCATTGCCCTGTAAGAGCTCCAACCCTGTCGT
CGTCGGGAAATGGTACTGCCCTTTCATCTTCATTCGCGACGGCGCAGTGGATGTTCAGATGGACAGATCCATGTATTACGAAATGACCCTCGAGCAGAGACTGGAGAGGG
AAGTGGTTTCGGTTGCCGATGAGACGGTGGCTTCCGGCGGTAGAGATGACGGTGATGGTGTTGTGTGGTTTAGTCCGTCGAGGGTTGGGTTGAGCTTGGCGATTGTGGAG
AGAATGAAATGGGAGGAGGAGAGAGTTGGATTCGAATGGGTTGAAGAAGGAAAAGAAAAGAAAGTCAGTTTAAGAAGATGGGTCAGTGGAAAAGATTACAGAAAAGAAAG
TCGGTTTGGTCGTCGGCGGCTGAGTTTGGTCGTCGTCGAAGCGGCGGCGGCGGCGGCGACGGGAAGGGTGGCGGCGACAAGGGAAAGAATGGAGAAAGAAAGGGAAAGAA
AGAGAAGGGGTACCGTGGGGTGGGGGAGAGGAGAGAGAGAGGATTTTTTGGGTGGGTAA
Protein sequenceShow/hide protein sequence
MGFLHIYSEEHGLDSVLRAQLPEFNFPLPCKSSNPVVVGKWYCPFIFIRDGAVDVQMDRSMYYEMTLEQRLEREVVSVADETVASGGRDDGDGVVWFSPSRVGLSLAIVE
RMKWEEERVGFEWVEEGKEKKVSLRRWVSGKDYRKESRFGRRRLSLVVVEAAAAAATGRVAATRERMEKERERKRRGTVGWGRGEREDFLGG