; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021970 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021970
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNHL domain protein
Genome locationscaffold2:5488769..5489278
RNA-Seq ExpressionSpg021970
SyntenySpg021970
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589457.1 hypothetical protein SDJN03_14880, partial [Cucurbita argyrosperma subsp. sororia]5.6e-7884.62Show/hide
Query:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MAY DA EPPPCEA  SNSLLP TRC CFRLSC  SRR  A GPSWWERIRASQVHSEGRWWARGVRV+LK+REWSEIVAGPRWKTFIRRFNRNR+GGGG
Subjt:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  GGAKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA
           +PGKF YDPLSYAMNFDEGSR+IGELDDD DDF+GYRNFSARYASIPAPLKTGGT   GKD+STVA
Subjt:  GGAKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA

KAG7023141.1 hypothetical protein SDJN02_14166, partial [Cucurbita argyrosperma subsp. argyrosperma]7.3e-7883.82Show/hide
Query:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MAY DA EPPPCEA  SNSLLP TRC CFRLSC  SRR  A GPSWWERIRASQVHSEGRWWARGVRV+LK+REWSEIVAGPRWKTFIRRFNRNR+GGGG
Subjt:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  GG----AKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA
        GG     +PGKF YDPLSYAMNFDEGSR+IGELDDD DDF+GYRNFSARYASIPAPLKTGGT   GKD+STVA
Subjt:  GG----AKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA

XP_022988416.1 uncharacterized protein LOC111485666 [Cucurbita maxima]1.2e-7783.82Show/hide
Query:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MAY DA EPPP EAD SNSLLP TRC CFRLSC  SRR  A GPSWWERIR SQVHSEGRWWARGVRV+LK+REWSEIVAGPRWKTFIRRFNRNR+GGGG
Subjt:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  GG----AKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA
        GG     +PGKFQYDPLSYAMNFDEGSR+IGELDDD DDF+GYRNFSARYASIPAPLKTGGT   GKD+STVA
Subjt:  GG----AKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA

XP_023516338.1 uncharacterized protein LOC111780229 [Cucurbita pepo subsp. pepo]1.9e-7884.39Show/hide
Query:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MAY DA EPPPCEAD SNSLLP TRC CFRLSC  SRR  A GPSWWERIRASQVHSEGRWWARGVRV+LK+REWSEIVAGPRWKTFIRRFNRNR+G GG
Subjt:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  GGA----KPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA
        GGA    +PGKFQYDPLSYAMNFDEGSR+IGELDDD DDF+GYRNFSARYASIP PLKTGG  A GKD+STVA
Subjt:  GGA----KPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA

XP_038878924.1 uncharacterized protein LOC120071012 [Benincasa hispida]5.0e-7986.39Show/hide
Query:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MA+TD  E  PCEADKSNSLLP  RC CFRLSCF SRR A  GPSWWER+RASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
Subjt:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  GGAKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA
        GG + GKFQYDPLSYAMNFDEGSR+IGELDDDIDDF+GYRNFSARYASIPAPLKTGGT    KDVS+VA
Subjt:  GGAKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA

TrEMBL top hitse value%identityAlignment
A0A5D3BK38 Putative Serine/arginine repetitive matrix protein 22.4e-7480.79Show/hide
Query:  MAYTDAG--EPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--
        MA+TD    E P CEADKSNSLLP  RC CFRLSCF SRR A  GPSWWERIRASQVHSEGRWWARGV+V+LKLREWSEIVAGPRWKTFIRRFNRNRS  
Subjt:  MAYTDAG--EPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--

Query:  ----GGGGGGAKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA
            GGGG G + GKFQYDPLSYAMNFDEGSR+IGELDDD+DDF+GYRNFSARYASIPAPLKTGGT    KDVS VA
Subjt:  ----GGGGGGAKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA

A0A6J1E188 uncharacterized protein LOC1114298651.7e-7784.62Show/hide
Query:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MAY DA EPPPCE D SNSLLP TRC CFRLSC   RR  A GPSWWERIRASQVHSEGRWWARGVRV+LK+REWSEIVAGPRWKTFIRRFNRNR+GG G
Subjt:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  GGAKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA
         G +PGKFQYDPLSYAMNFDEGSR+IGELDDD DDF+GYRNFSARYASIPAPLKTGGT   GKD+STVA
Subjt:  GGAKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA

A0A6J1EK25 uncharacterized protein LOC1114350613.6e-7583.33Show/hide
Query:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MAY  A EPP CEADKSNSLLP T C C RLSCF SRRTAAGGPSWWERIRAS+VH+EGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS G  
Subjt:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  GGAKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTV
          A+PGKFQYDPLSYAMNFDEGS +IGELDDDIDDFSG+RNFSARYASIP  LKT GT   GKD  TV
Subjt:  GGAKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTV

A0A6J1JH52 uncharacterized protein LOC1114856666.0e-7883.82Show/hide
Query:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MAY DA EPPP EAD SNSLLP TRC CFRLSC  SRR  A GPSWWERIR SQVHSEGRWWARGVRV+LK+REWSEIVAGPRWKTFIRRFNRNR+GGGG
Subjt:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  GG----AKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA
        GG     +PGKFQYDPLSYAMNFDEGSR+IGELDDD DDF+GYRNFSARYASIPAPLKTGGT   GKD+STVA
Subjt:  GG----AKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA

A0A6J1KTA4 uncharacterized protein LOC1114974232.5e-7684.52Show/hide
Query:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MAY  A EPP CEADKSNSLLP T CSC RLSCF SRRTAAGGPSWWERIRAS+VH+EGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS G  
Subjt:  MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  GGAKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTV
          A+PGKFQYDPLSYAMNFDEGS +IGELDDDIDDFSG+RNFSARYASIP  LKTGGT   GKD  TV
Subjt:  GGAKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)8.7e-2941.67Show/hide
Query:  PPPCEADKSNSL---LPATRCSCFRLSCFASRR-TAAGGPSWWERI-RASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNR----------
        PP  E D ++ +   L A R  CF + C AS + +  GG  WW+RI    ++  + RWW RG R   ++REWSE+VAGPRWKT+IRRF R          
Subjt:  PPPCEADKSNSL---LPATRCSCFRLSCFASRR-TAAGGPSWWERI-RASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNR----------

Query:  --NRSGGGGGGAKP------GKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDV
          N SGG GGGA P      GKF+YD LSY++NFD+G+ + G  DD+      YR++S R+A+   P+ T  +  F  D+
Subjt:  --NRSGGGGGGAKP------GKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDV

AT3G48020.1 unknown protein1.4e-2352.83Show/hide
Query:  SWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGGGAKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSA
        SWW+RI  +  H E RWW   VR  LK+REWSEIVAGPRWKTFIRRFNR+   G        KF+YDP+SY ++F++  ++    DDD     G R+FS 
Subjt:  SWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGGGAKPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSA

Query:  RYASIP
        RYAS+P
Subjt:  RYASIP

AT5G14890.1 NHL domain-containing protein6.4e-2440.13Show/hide
Query:  DKSNSLLPATRCSCFRLSCF-ASRRTAAGGPSWWERIR-ASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNR--SGGGGGGAKPGK---
        D+ +  + A R  CF L C  +S+ +   G  WW+RIR   ++  + RWW  G    +K+REWSEIVAGP+WKTFIRRF RN   +GG  GG    +   
Subjt:  DKSNSLLPATRCSCFRLSCF-ASRRTAAGGPSWWERIR-ASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNR--SGGGGGGAKPGK---

Query:  FQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKD
        F+YD  SY++NFD+G ++ G  +D+      YR++S R+A+   P+ T  +  F  D
Subjt:  FQYDPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKD

AT5G25240.1 unknown protein3.9e-0535Show/hide
Query:  ATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGGGAKPGKFQYDPLSYAMNFDEG
        A  C C     F+  R   G      R   S    E R    G   L  L+E SE +AGP+WK FIR F    S G     +   F YD  +Y++NFD+G
Subjt:  ATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGGGAKPGKFQYDPLSYAMNFDEG

AT5G62865.1 unknown protein2.9e-2451.13Show/hide
Query:  RCSCFRLSCFASRRTAAGGPSWWERIRA--SQVHS-----EGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGGGAKPGKFQYDPLSYAM
        RC CF  S   SR + A G S W RIR      HS     E RWW   +R  LK+REWSEIVAGPRWKTFIRRFNR+   G    A   KFQYDPLSY++
Subjt:  RCSCFRLSCFASRRTAAGGPSWWERIRA--SQVHS-----EGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGGGAKPGKFQYDPLSYAM

Query:  NFDEGSREIGELDDDIDDFSGYRNFSARYASIP
        NFD+   E     D+     G R+FS R+AS+P
Subjt:  NFDEGSREIGELDDDIDDFSGYRNFSARYASIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTATACAGATGCGGGAGAGCCTCCTCCTTGTGAAGCGGATAAATCCAATTCTCTGTTGCCGGCCACCAGGTGCAGTTGTTTCCGGTTATCGTGCTTCGCATCTCG
CCGAACGGCCGCCGGCGGGCCGTCGTGGTGGGAGCGGATTAGGGCATCGCAGGTTCACAGCGAGGGGCGCTGGTGGGCGCGAGGCGTTAGGGTTCTTTTGAAGCTCAGAG
AGTGGTCGGAGATCGTCGCTGGACCGAGATGGAAGACGTTCATCCGCCGATTTAACCGGAATCGGAGTGGTGGTGGTGGTGGTGGTGCAAAGCCTGGAAAATTCCAATAC
GATCCGTTGAGTTACGCCATGAATTTCGACGAAGGTTCGAGGGAGATTGGGGAATTGGACGACGACATCGACGATTTCAGCGGGTATCGGAACTTCTCCGCTCGTTACGC
TTCGATTCCGGCGCCGTTGAAGACTGGAGGAACCGCCGCATTCGGGAAGGATGTTTCTACCGTCGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTATACAGATGCGGGAGAGCCTCCTCCTTGTGAAGCGGATAAATCCAATTCTCTGTTGCCGGCCACCAGGTGCAGTTGTTTCCGGTTATCGTGCTTCGCATCTCG
CCGAACGGCCGCCGGCGGGCCGTCGTGGTGGGAGCGGATTAGGGCATCGCAGGTTCACAGCGAGGGGCGCTGGTGGGCGCGAGGCGTTAGGGTTCTTTTGAAGCTCAGAG
AGTGGTCGGAGATCGTCGCTGGACCGAGATGGAAGACGTTCATCCGCCGATTTAACCGGAATCGGAGTGGTGGTGGTGGTGGTGGTGCAAAGCCTGGAAAATTCCAATAC
GATCCGTTGAGTTACGCCATGAATTTCGACGAAGGTTCGAGGGAGATTGGGGAATTGGACGACGACATCGACGATTTCAGCGGGTATCGGAACTTCTCCGCTCGTTACGC
TTCGATTCCGGCGCCGTTGAAGACTGGAGGAACCGCCGCATTCGGGAAGGATGTTTCTACCGTCGCGTAA
Protein sequenceShow/hide protein sequence
MAYTDAGEPPPCEADKSNSLLPATRCSCFRLSCFASRRTAAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGGGAKPGKFQY
DPLSYAMNFDEGSREIGELDDDIDDFSGYRNFSARYASIPAPLKTGGTAAFGKDVSTVA