; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022262 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022262
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4228 domain protein
Genome locationscaffold2:3096107..3096700
RNA-Seq ExpressionSpg022262
SyntenySpg022262
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7023330.1 hypothetical protein SDJN02_14355, partial [Cucurbita argyrosperma subsp. argyrosperma]7.9e-7687.5Show/hide
Query:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATE
        MGNSASC PSMASN AAKVLSL+G LQS+ KPV+AAELMIEHSGKFLCDS DL VGHRIQGLLP+EDLECRRLYFLLPMDLLYSVLT+EEM SLTY AT+
Subjt:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATE

Query:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSSKPVQRLMSKQRSWKPALETIAETSC
        ALKHGNSSGFG+IFPVLI++LCI  SDVNR+KS DGDRENRSSKPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSSKPVQRLMSKQRSWKPALETIAETSC

XP_022135244.1 uncharacterized protein LOC111007254 [Momordica charantia]3.8e-7885.08Show/hide
Query:  FPTFAKHALSMGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEE
        FP FAK + SMGNSASCAPSM SN AAKVLSL+G L+S+TKPVKAAELMIE+SGKFLCDSGDLKVGHRIQGLLP+EDLECRRLYFLLPMDLLYSVLT+EE
Subjt:  FPTFAKHALSMGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEE

Query:  MSSLTYIATEALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDR--ENRSSKPVQRLMSKQRSWKPALETIAETSCT
        MSSLTYIAT+ALK GNSSGFG+IFPVLISELCI PS+VNR+KS   DR  EN + KPVQRLMSKQRSWKPALETIAETSCT
Subjt:  MSSLTYIATEALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDR--ENRSSKPVQRLMSKQRSWKPALETIAETSCT

XP_022921761.1 uncharacterized protein LOC111429917 [Cucurbita moschata]1.6e-7688.69Show/hide
Query:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATE
        MGNSASCAPSMASN AAKVLSL+G LQS+ KPV+AAELMIEHSGKFLCDS DL VGHRIQGLLP+EDLECRRLYFLLPMDLLYSVLTIEEM SLTY AT+
Subjt:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATE

Query:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSSKPVQRLMSKQRSWKPALETIAETSC
        ALKHGNSSGFG+IFPVLI++LCI  SDVNR+KS DGDRENRSSKPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSSKPVQRLMSKQRSWKPALETIAETSC

XP_022965466.1 uncharacterized protein LOC111465363 [Cucurbita maxima]4.7e-7690Show/hide
Query:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATE
        MGNSA CAPSMA N AAKVL+L+GNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRI GLL EE LECRRLYFLLPMDLLYSVLTIEEMSSLTYIAT+
Subjt:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATE

Query:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRE-NRSSKPVQRLMSKQRSWKPALETIAETSCT
        ALK GNSSGFG+IFPVLISELCIFPSDVN +KS DGD E N+SSKPVQRLMSKQRSWKPALETIAETSCT
Subjt:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRE-NRSSKPVQRLMSKQRSWKPALETIAETSCT

XP_022987495.1 uncharacterized protein LOC111485040 [Cucurbita maxima]6.1e-7688.1Show/hide
Query:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATE
        MGNSASCAPSMASN AAKVLSL+G LQS+TK V+AAELMIEHSGKFLCDS DL VGHRIQGLLP+EDLECRRLYFLLPMDLLYSVLT+EEM SLTY AT+
Subjt:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATE

Query:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSSKPVQRLMSKQRSWKPALETIAETSC
        ALKHGNSSGFG+IFPVLI++LCI  SDVNR+KS DGDRENRSSKPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSSKPVQRLMSKQRSWKPALETIAETSC

TrEMBL top hitse value%identityAlignment
A0A6J1C0W6 uncharacterized protein LOC1110072541.8e-7885.08Show/hide
Query:  FPTFAKHALSMGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEE
        FP FAK + SMGNSASCAPSM SN AAKVLSL+G L+S+TKPVKAAELMIE+SGKFLCDSGDLKVGHRIQGLLP+EDLECRRLYFLLPMDLLYSVLT+EE
Subjt:  FPTFAKHALSMGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEE

Query:  MSSLTYIATEALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDR--ENRSSKPVQRLMSKQRSWKPALETIAETSCT
        MSSLTYIAT+ALK GNSSGFG+IFPVLISELCI PS+VNR+KS   DR  EN + KPVQRLMSKQRSWKPALETIAETSCT
Subjt:  MSSLTYIATEALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDR--ENRSSKPVQRLMSKQRSWKPALETIAETSCT

A0A6J1E2A0 uncharacterized protein LOC1114299177.7e-7788.69Show/hide
Query:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATE
        MGNSASCAPSMASN AAKVLSL+G LQS+ KPV+AAELMIEHSGKFLCDS DL VGHRIQGLLP+EDLECRRLYFLLPMDLLYSVLTIEEM SLTY AT+
Subjt:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATE

Query:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSSKPVQRLMSKQRSWKPALETIAETSC
        ALKHGNSSGFG+IFPVLI++LCI  SDVNR+KS DGDRENRSSKPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSSKPVQRLMSKQRSWKPALETIAETSC

A0A6J1F5K3 uncharacterized protein LOC1114410406.1e-7488.24Show/hide
Query:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATE
        MGNSA CAPSMASN AAKVL+L+GNLQSF KPVKAAELMIEHSGKFLCDSGDLKVGHRI GLL EE LE  RLYFLLPMDLLYSVLTIEEMSSLTYIAT+
Subjt:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATE

Query:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRE-NRSSKPVQRLMSKQRSWKPALETIAETSCT
        A+K GNSSGFG+IFPVLISELCIFPSDVN +KS DGD E N+SSKPVQRLMSKQRSWKPALETIAETSCT
Subjt:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRE-NRSSKPVQRLMSKQRSWKPALETIAETSCT

A0A6J1HLR9 uncharacterized protein LOC1114653632.3e-7690Show/hide
Query:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATE
        MGNSA CAPSMA N AAKVL+L+GNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRI GLL EE LECRRLYFLLPMDLLYSVLTIEEMSSLTYIAT+
Subjt:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATE

Query:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRE-NRSSKPVQRLMSKQRSWKPALETIAETSCT
        ALK GNSSGFG+IFPVLISELCIFPSDVN +KS DGD E N+SSKPVQRLMSKQRSWKPALETIAETSCT
Subjt:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRE-NRSSKPVQRLMSKQRSWKPALETIAETSCT

A0A6J1JH13 uncharacterized protein LOC1114850402.9e-7688.1Show/hide
Query:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATE
        MGNSASCAPSMASN AAKVLSL+G LQS+TK V+AAELMIEHSGKFLCDS DL VGHRIQGLLP+EDLECRRLYFLLPMDLLYSVLT+EEM SLTY AT+
Subjt:  MGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATE

Query:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSSKPVQRLMSKQRSWKPALETIAETSC
        ALKHGNSSGFG+IFPVLI++LCI  SDVNR+KS DGDRENRSSKPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSSKPVQRLMSKQRSWKPALETIAETSC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G18290.1 unknown protein4.4e-2438.67Show/hide
Query:  MGNSASCAP----SMASNAAAKVLS-LEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRR-LYFLLPMDLLYSVLTIEEMSSL
        MGN++SCAP    + +S+   K+L+   G L+ F+KP+K ++++  HSG F+ DS  L++ HR+  + P+E L  RR LY LLP D+L+SVLT EE+S +
Subjt:  MGNSASCAP----SMASNAAAKVLS-LEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRR-LYFLLPMDLLYSVLTIEEMSSL

Query:  TYIATEALKHGNSSGFGKIFPVLISELCIF---------PSDVNRVKSTDGDRENRSSKPVQRLMSKQRSWKPALETIAET
        +  A E L     +   +IFPV     CIF         PS VN  ++ DG     +        SK  SW+P LETI E+
Subjt:  TYIATEALKHGNSSGFGKIFPVLISELCIF---------PSDVNRVKSTDGDRENRSSKPVQRLMSKQRSWKPALETIAET

AT3G50800.1 unknown protein2.6e-0832.26Show/hide
Query:  AKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATEALKHGNSSG
        AK++  +G LQ F+ PVK  +++ ++   F+C+S D+     +  +   EDL    LYF+LP+  L   L  +EM++L   A+ AL      G
Subjt:  AKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATEALKHGNSSG

AT4G37240.1 unknown protein1.4e-0928.36Show/hide
Query:  CAPSMASN-AAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATEALKHG
        C+ S ++  A AK++  +G +  F  PVK   +++++   F+C+S D+     +  +  +E+L+  ++YF LP+  L   L  EEM++L   A+ AL  G
Subjt:  CAPSMASN-AAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATEALKHG

Query:  NSSGFGKIFPVLISELCIFP--SDVNRVKSTDGD
           G            C+ P  SD  R++   GD
Subjt:  NSSGFGKIFPVLISELCIFP--SDVNRVKSTDGD

AT5G17350.1 unknown protein2.6e-0829.1Show/hide
Query:  MGNSASCA---PSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECR--RLYFLLPMDLLYSVLTIEEMSSLT
        MGN  S A    S +S++AAKV+  +G +++   P+KAAELM+E    FL D+  LK+G +   L  ++DL+ +   +Y   PM    S     +++ L 
Subjt:  MGNSASCA---PSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECR--RLYFLLPMDLLYSVLTIEEMSSLT

Query:  YIATEALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDG-----------------DRENRSSKPVQRLMSKQRSWKPALETIAETS
          A +  +H   S          S   +     N   S DG                 D E  S+      +S  +S KP LETI E S
Subjt:  YIATEALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDG-----------------DRENRSSKPVQRLMSKQRSWKPALETIAETS

AT5G66580.1 unknown protein1.8e-0931.58Show/hide
Query:  AAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATEALKHGNSSGF
        +AK++ L+G LQ F+ PVK  +++ ++   F+C+S ++     +  +   E+L   +LYF+LP+  L   L  EEM++L   A+ AL      G+
Subjt:  AAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLLYSVLTIEEMSSLTYIATEALKHGNSSGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATAAATTCAAGCACAGCGAGAGAGATTTCATTATCGGCGCAGCTCCAACTTTCCCCACTTTTGCCAAACACGCGCTCTCCATGGGGAACTCAGCGTCCTGTGCCCC
TTCAATGGCCTCCAATGCCGCGGCGAAGGTCTTATCCTTAGAAGGAAACTTGCAGAGTTTCACGAAGCCAGTGAAGGCCGCGGAATTGATGATCGAGCATTCCGGAAAAT
TCCTCTGCGATTCCGGAGATCTCAAGGTCGGCCATCGGATTCAAGGCCTTCTGCCGGAGGAAGATCTCGAGTGCCGCCGATTGTACTTTCTACTTCCGATGGATCTTCTT
TACTCTGTTTTGACGATCGAAGAAATGAGCTCTCTCACTTACATCGCGACAGAGGCTCTGAAACATGGAAATTCGAGTGGATTCGGAAAGATCTTCCCTGTTTTGATCAG
TGAACTCTGTATTTTTCCGTCGGATGTGAATCGAGTGAAATCGACGGACGGCGATCGAGAGAATCGAAGTTCGAAGCCGGTGCAGAGATTGATGTCGAAACAGAGATCGT
GGAAGCCGGCGCTCGAAACCATTGCTGAAACTTCGTGCACATAG
mRNA sequenceShow/hide mRNA sequence
ATGCATAAATTCAAGCACAGCGAGAGAGATTTCATTATCGGCGCAGCTCCAACTTTCCCCACTTTTGCCAAACACGCGCTCTCCATGGGGAACTCAGCGTCCTGTGCCCC
TTCAATGGCCTCCAATGCCGCGGCGAAGGTCTTATCCTTAGAAGGAAACTTGCAGAGTTTCACGAAGCCAGTGAAGGCCGCGGAATTGATGATCGAGCATTCCGGAAAAT
TCCTCTGCGATTCCGGAGATCTCAAGGTCGGCCATCGGATTCAAGGCCTTCTGCCGGAGGAAGATCTCGAGTGCCGCCGATTGTACTTTCTACTTCCGATGGATCTTCTT
TACTCTGTTTTGACGATCGAAGAAATGAGCTCTCTCACTTACATCGCGACAGAGGCTCTGAAACATGGAAATTCGAGTGGATTCGGAAAGATCTTCCCTGTTTTGATCAG
TGAACTCTGTATTTTTCCGTCGGATGTGAATCGAGTGAAATCGACGGACGGCGATCGAGAGAATCGAAGTTCGAAGCCGGTGCAGAGATTGATGTCGAAACAGAGATCGT
GGAAGCCGGCGCTCGAAACCATTGCTGAAACTTCGTGCACATAG
Protein sequenceShow/hide protein sequence
MHKFKHSERDFIIGAAPTFPTFAKHALSMGNSASCAPSMASNAAAKVLSLEGNLQSFTKPVKAAELMIEHSGKFLCDSGDLKVGHRIQGLLPEEDLECRRLYFLLPMDLL
YSVLTIEEMSSLTYIATEALKHGNSSGFGKIFPVLISELCIFPSDVNRVKSTDGDRENRSSKPVQRLMSKQRSWKPALETIAETSCT