; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033939 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033939
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF4228 domain protein
Genome locationchr3:3117920..3118516
RNA-Seq ExpressionLag0033939
SyntenyLag0033939
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7023330.1 hypothetical protein SDJN02_14355, partial [Cucurbita argyrosperma subsp. argyrosperma]3.6e-7687.5Show/hide
Query:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK
        MGNSASC PSMASN AAKVLSL+G LQS+ KPV+AAELMIEHSGKFLCDS DL VGHRIQGLLP+EDLECRRLYFLLPMDLLYSVLT+EEM SLTY ATK
Subjt:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK

Query:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSAKPVQRLMSKQRSWKPALETIAETSC
        ALKHGNSSGFG+IFPVLI++LCI  SDVNR+KS DGDRENRS+KPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSAKPVQRLMSKQRSWKPALETIAETSC

XP_022135244.1 uncharacterized protein LOC111007254 [Momordica charantia]3.6e-7684.62Show/hide
Query:  FSTFAKHALISMGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIE
        F  FAK +  SMGNSASCAPSM SN AAKVLSL+G L+S+TKPVKAAELMIE+SGKFLCDSGDLKVGHRIQGLLP+EDLECRRLYFLLPMDLLYSVLT+E
Subjt:  FSTFAKHALISMGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIE

Query:  EMSSLTYIATKALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDR--ENRSAKPVQRLMSKQRSWKPALETIAETSCT
        EMSSLTYIATKALK GNSSGFG+IFPVLISELCI PS+VNR+KS   DR  EN + KPVQRLMSKQRSWKPALETIAETSCT
Subjt:  EMSSLTYIATKALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDR--ENRSAKPVQRLMSKQRSWKPALETIAETSCT

XP_022921761.1 uncharacterized protein LOC111429917 [Cucurbita moschata]7.2e-7788.69Show/hide
Query:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK
        MGNSASCAPSMASN AAKVLSL+G LQS+ KPV+AAELMIEHSGKFLCDS DL VGHRIQGLLP+EDLECRRLYFLLPMDLLYSVLTIEEM SLTY ATK
Subjt:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK

Query:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSAKPVQRLMSKQRSWKPALETIAETSC
        ALKHGNSSGFG+IFPVLI++LCI  SDVNR+KS DGDRENRS+KPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSAKPVQRLMSKQRSWKPALETIAETSC

XP_022965466.1 uncharacterized protein LOC111465363 [Cucurbita maxima]2.7e-7690Show/hide
Query:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK
        MGNSA CAPSMA N AAKVL+L+GNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRI GLL EE LECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK
Subjt:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK

Query:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRE-NRSAKPVQRLMSKQRSWKPALETIAETSCT
        ALK GNSSGFG+IFPVLISELCIFPSDVN +KS DGD E N+S+KPVQRLMSKQRSWKPALETIAETSCT
Subjt:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRE-NRSAKPVQRLMSKQRSWKPALETIAETSCT

XP_022987495.1 uncharacterized protein LOC111485040 [Cucurbita maxima]2.7e-7688.1Show/hide
Query:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK
        MGNSASCAPSMASN AAKVLSL+G LQS+TK V+AAELMIEHSGKFLCDS DL VGHRIQGLLP+EDLECRRLYFLLPMDLLYSVLT+EEM SLTY ATK
Subjt:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK

Query:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSAKPVQRLMSKQRSWKPALETIAETSC
        ALKHGNSSGFG+IFPVLI++LCI  SDVNR+KS DGDRENRS+KPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSAKPVQRLMSKQRSWKPALETIAETSC

TrEMBL top hitse value%identityAlignment
A0A6J1C0W6 uncharacterized protein LOC1110072541.7e-7684.62Show/hide
Query:  FSTFAKHALISMGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIE
        F  FAK +  SMGNSASCAPSM SN AAKVLSL+G L+S+TKPVKAAELMIE+SGKFLCDSGDLKVGHRIQGLLP+EDLECRRLYFLLPMDLLYSVLT+E
Subjt:  FSTFAKHALISMGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIE

Query:  EMSSLTYIATKALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDR--ENRSAKPVQRLMSKQRSWKPALETIAETSCT
        EMSSLTYIATKALK GNSSGFG+IFPVLISELCI PS+VNR+KS   DR  EN + KPVQRLMSKQRSWKPALETIAETSCT
Subjt:  EMSSLTYIATKALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDR--ENRSAKPVQRLMSKQRSWKPALETIAETSCT

A0A6J1E2A0 uncharacterized protein LOC1114299173.5e-7788.69Show/hide
Query:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK
        MGNSASCAPSMASN AAKVLSL+G LQS+ KPV+AAELMIEHSGKFLCDS DL VGHRIQGLLP+EDLECRRLYFLLPMDLLYSVLTIEEM SLTY ATK
Subjt:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK

Query:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSAKPVQRLMSKQRSWKPALETIAETSC
        ALKHGNSSGFG+IFPVLI++LCI  SDVNR+KS DGDRENRS+KPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSAKPVQRLMSKQRSWKPALETIAETSC

A0A6J1F5K3 uncharacterized protein LOC1114410403.6e-7488.24Show/hide
Query:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK
        MGNSA CAPSMASN AAKVL+L+GNLQSF KPVKAAELMIEHSGKFLCDSGDLKVGHRI GLL EE LE  RLYFLLPMDLLYSVLTIEEMSSLTYIATK
Subjt:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK

Query:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRE-NRSAKPVQRLMSKQRSWKPALETIAETSCT
        A+K GNSSGFG+IFPVLISELCIFPSDVN +KS DGD E N+S+KPVQRLMSKQRSWKPALETIAETSCT
Subjt:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRE-NRSAKPVQRLMSKQRSWKPALETIAETSCT

A0A6J1HLR9 uncharacterized protein LOC1114653631.3e-7690Show/hide
Query:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK
        MGNSA CAPSMA N AAKVL+L+GNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRI GLL EE LECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK
Subjt:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK

Query:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRE-NRSAKPVQRLMSKQRSWKPALETIAETSCT
        ALK GNSSGFG+IFPVLISELCIFPSDVN +KS DGD E N+S+KPVQRLMSKQRSWKPALETIAETSCT
Subjt:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRE-NRSAKPVQRLMSKQRSWKPALETIAETSCT

A0A6J1JH13 uncharacterized protein LOC1114850401.3e-7688.1Show/hide
Query:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK
        MGNSASCAPSMASN AAKVLSL+G LQS+TK V+AAELMIEHSGKFLCDS DL VGHRIQGLLP+EDLECRRLYFLLPMDLLYSVLT+EEM SLTY ATK
Subjt:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATK

Query:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSAKPVQRLMSKQRSWKPALETIAETSC
        ALKHGNSSGFG+IFPVLI++LCI  SDVNR+KS DGDRENRS+KPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSAKPVQRLMSKQRSWKPALETIAETSC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G18290.1 unknown protein7.6e-2438.12Show/hide
Query:  MGNSASCAP----SMASNAAAKVLS-LEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRR-LYFLLPMDLLYSVLTIEEMSSL
        MGN++SCAP    + +S+   K+L+   G L+ F+KP+K ++++  HSG F+ DS  L++ HR+  + P+E L  RR LY LLP D+L+SVLT EE+S +
Subjt:  MGNSASCAP----SMASNAAAKVLS-LEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRR-LYFLLPMDLLYSVLTIEEMSSL

Query:  TYIATKALKHGNSSGFGKIFPVLISELCIF---------PSDVNRVKSTDGDRENRSAKPVQRLMSKQRSWKPALETIAET
        +  A + L     +   +IFPV     CIF         PS VN  ++ DG     +        SK  SW+P LETI E+
Subjt:  TYIATKALKHGNSSGFGKIFPVLISELCIF---------PSDVNRVKSTDGDRENRSAKPVQRLMSKQRSWKPALETIAET

AT3G03280.1 unknown protein4.0e-0932.94Show/hide
Query:  MGNSASCAPSMASNA-AAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECR--RLYFLLPMDLLYSVLTIEEMSSLTYI
        MGN  SCA +  S++  AKV+  +G ++    P KAAELM+E    FL D+  +KVG +   L  ++DL+     +Y   PM    S     +M+ L Y+
Subjt:  MGNSASCAPSMASNA-AAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECR--RLYFLLPMDLLYSVLTIEEMSSLTYI

Query:  ATKALKHGNSSGFGKIFPVLISELCIFPSDVNRV--KSTDGDRENRSAKPVQRLMSKQRSWKPALETIAE
          K  K   + G  ++ P           DV  +  K    D E  SA      +S  +S KP LETIAE
Subjt:  ATKALKHGNSSGFGKIFPVLISELCIFPSDVNRV--KSTDGDRENRSAKPVQRLMSKQRSWKPALETIAE

AT4G37240.1 unknown protein1.1e-0928.36Show/hide
Query:  CAPSMASN-AAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATKALKHG
        C+ S ++  A AK++  +G +  F  PVK   +++++   F+C+S D+     +  +  +E+L+  ++YF LP+  L   L  EEM++L   A+ AL  G
Subjt:  CAPSMASN-AAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATKALKHG

Query:  NSSGFGKIFPVLISELCIFP--SDVNRVKSTDGD
           G            C+ P  SD  R++   GD
Subjt:  NSSGFGKIFPVLISELCIFP--SDVNRVKSTDGD

AT5G17350.1 unknown protein2.4e-0930.16Show/hide
Query:  MGNSASCA---PSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECR--RLYFLLPMDLLYSVLTIEEMSSLT
        MGN  S A    S +S++AAKV+  +G +++   P+KAAELM+E    FL D+  LK+G +   L  ++DL+ +   +Y   PM    S     +++ L 
Subjt:  MGNSASCA---PSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECR--RLYFLLPMDLLYSVLTIEEMSSLT

Query:  YIATKALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDG-----------------DRENRSAKPVQRLMSKQRSWKPALETIAETS
          A K  +H   S          S   +     N   S DG                 D E  SA      +S  +S KP LETI E S
Subjt:  YIATKALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDG-----------------DRENRSAKPVQRLMSKQRSWKPALETIAETS

AT5G66580.1 unknown protein1.8e-0931.58Show/hide
Query:  AAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATKALKHGNSSGF
        +AK++ L+G LQ F+ PVK  +++ ++   F+C+S ++     +  +   E+L   +LYF+LP+  L   L  EEM++L   A+ AL      G+
Subjt:  AAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATKALKHGNSSGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATAAATTCAAGCACAGCGAGAGAGATTCCATTATCGGCGCAGCTCCAACTTTCTCCACTTTTGCCAAACACGCGCTGATCTCCATGGGGAACTCAGCGTCCTGTGC
GCCTTCAATGGCCTCCAATGCCGCCGCGAAGGTCTTATCCTTAGAAGGAAACTTGCAGAGTTTCACGAAGCCAGTGAAGGCCGCGGAATTGATGATCGAGCATTCCGGAA
AATTCCTCTGCGATTCCGGCGATCTCAAGGTCGGCCATCGGATTCAAGGCCTTCTGCCGGAGGAAGATCTCGAGTGCCGCCGATTGTACTTTCTACTTCCGATGGATCTT
CTTTACTCTGTTTTGACGATCGAAGAAATGAGCTCTCTCACTTACATCGCGACAAAGGCTCTGAAACATGGAAATTCGAGTGGATTTGGAAAGATCTTCCCTGTTTTGAT
CAGTGAACTCTGTATTTTTCCGTCGGATGTGAATCGAGTGAAATCGACGGACGGCGATCGAGAGAATCGAAGTGCGAAGCCGGTGCAGAGATTGATGTCGAAACAGAGAT
CGTGGAAGCCGGCGCTCGAAACCATTGCTGAAACTTCGTGCACATAG
mRNA sequenceShow/hide mRNA sequence
ATGCATAAATTCAAGCACAGCGAGAGAGATTCCATTATCGGCGCAGCTCCAACTTTCTCCACTTTTGCCAAACACGCGCTGATCTCCATGGGGAACTCAGCGTCCTGTGC
GCCTTCAATGGCCTCCAATGCCGCCGCGAAGGTCTTATCCTTAGAAGGAAACTTGCAGAGTTTCACGAAGCCAGTGAAGGCCGCGGAATTGATGATCGAGCATTCCGGAA
AATTCCTCTGCGATTCCGGCGATCTCAAGGTCGGCCATCGGATTCAAGGCCTTCTGCCGGAGGAAGATCTCGAGTGCCGCCGATTGTACTTTCTACTTCCGATGGATCTT
CTTTACTCTGTTTTGACGATCGAAGAAATGAGCTCTCTCACTTACATCGCGACAAAGGCTCTGAAACATGGAAATTCGAGTGGATTTGGAAAGATCTTCCCTGTTTTGAT
CAGTGAACTCTGTATTTTTCCGTCGGATGTGAATCGAGTGAAATCGACGGACGGCGATCGAGAGAATCGAAGTGCGAAGCCGGTGCAGAGATTGATGTCGAAACAGAGAT
CGTGGAAGCCGGCGCTCGAAACCATTGCTGAAACTTCGTGCACATAG
Protein sequenceShow/hide protein sequence
MHKFKHSERDSIIGAAPTFSTFAKHALISMGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDL
LYSVLTIEEMSSLTYIATKALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSAKPVQRLMSKQRSWKPALETIAETSCT