; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036306 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036306
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNHL domain protein
Genome locationscaffold5:45355549..45356046
RNA-Seq ExpressionSpg036306
SyntenySpg036306
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584232.1 hypothetical protein SDJN03_20164, partial [Cucurbita argyrosperma subsp. sororia]3.2e-7887.2Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ESIGEIDGADALLAS+RYGCFCFPCFG  +SASDELSWWERAKAKAK+TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT
        NRPA VK GKFQYDPISYALNFDEGHNGD DFEG+EY+GGFQ+FS RF+AVPA  KSS AA A+
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT

XP_022923865.1 uncharacterized protein LOC111431456 [Cucurbita moschata]1.3e-7988.41Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ESIGEIDGADALLAS+RYGCFCFPCFG  +SASDELSWWERAKAKAK+TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT
        NRPA VKLGKFQYDPISYALNFDEGHNGD DFEG+EY+GGFQNFS RF+AVPA  KSS AA A+
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT

XP_023000665.1 uncharacterized protein LOC111495035 [Cucurbita maxima]1.1e-7887.2Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ESIGEIDGADALLAS+RY CFCFPCFG  +SASDELSWWERAKAKAK+TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT
        NRPA VKLGKFQYDPISYALNFDEGHNGD +FEG+EY+GGFQNFS RF+AVPA  KSS AA A+
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT

XP_023519821.1 uncharacterized protein LOC111783153 [Cucurbita pepo subsp. pepo]6.4e-7987.8Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ESIGEIDGADALLAS+RY CFCFPCFG  +SASDELSWWERAKAKAK+TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT
        NRPA VKLGKFQYDPISYALNFDEGHNGD DFEG+EY+GGFQNFS RF+AVPA  KSS AA A+
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT

XP_038894780.1 uncharacterized protein LOC120083201 [Benincasa hispida]9.9e-8087.35Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGAFKE ESIGEIDGADALLASKRY CFCFPCFGPR+SASDE+SWWER K KAK+TKFDG+DHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYN-GGFQNFSGRFAAVPAPVKSSAAAAATG
        NRPATVKLGKFQYDPISYALNFD+GHNGD DF+G+EY+ GGFQNFS RFAAVP PVKSS A A  G
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYN-GGFQNFSGRFAAVPAPVKSSAAAAATG

TrEMBL top hitse value%identityAlignment
A0A0A0LU30 Uncharacterized protein2.0e-7383.43Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDD-HWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFN
        MGDRDGAFKE ES   IDGADALL SKRY CFCFPCFGP +S SDELSWWER K KAK+TKFD +D HWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFN
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDD-HWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFN

Query:  RNRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYN--GGFQNFSGRFAAV-PAPVKSSAAAAATG
        RNRPATVKLGKFQYDPISYALNFDEGHNGD DF+G+EYN  GGFQNFS RFAA+ PAPVKSS++AA  G
Subjt:  RNRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYN--GGFQNFSGRFAAV-PAPVKSSAAAAATG

A0A5A7ULQ3 Uncharacterized protein6.3e-7283.53Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGAFKE ES   IDGADALLASKRY CFCFPCFGP +S SDELSWWER K KAK+TKFDG+DHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGH-NGDGDFEG-EEY--NGGFQNFSGRFAAV-PAPVKSSAAAAATG
        NRPATVKLGKFQYDPISYALNFDEGH NGD DFEG  EY   GGFQNFS RFAAV PAP+KSS++ A  G
Subjt:  NRPATVKLGKFQYDPISYALNFDEGH-NGDGDFEG-EEY--NGGFQNFSGRFAAV-PAPVKSSAAAAATG

A0A5D3BLD0 Uncharacterized protein5.3e-7182.94Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGAFKE ES   IDGADALLASKRY CFCFPCFGP +S SDELSWWER K KAK+TKFDG+DHWW+G IRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGH-NGDGDFEG-EEY--NGGFQNFSGRFAAV-PAPVKSSAAAAATG
        NRPATVKLGKFQYDPISYALNFDEGH NGD DFEG  EY   GGFQNFS RFAAV PAP+KSS++ A  G
Subjt:  NRPATVKLGKFQYDPISYALNFDEGH-NGDGDFEG-EEY--NGGFQNFSGRFAAV-PAPVKSSAAAAATG

A0A6J1ED51 uncharacterized protein LOC1114314566.3e-8088.41Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ESIGEIDGADALLAS+RYGCFCFPCFG  +SASDELSWWERAKAKAK+TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT
        NRPA VKLGKFQYDPISYALNFDEGHNGD DFEG+EY+GGFQNFS RF+AVPA  KSS AA A+
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT

A0A6J1KJ01 uncharacterized protein LOC1114950355.3e-7987.2Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ESIGEIDGADALLAS+RY CFCFPCFG  +SASDELSWWERAKAKAK+TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT
        NRPA VKLGKFQYDPISYALNFDEGHNGD +FEG+EY+GGFQNFS RF+AVPA  KSS AA A+
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)2.9e-2136.16Show/hide
Query:  ESESIGEIDGADAL---LASKRYGCFCFPCFGPRQSASDELS-WWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNR---
        +S  I E+D  D +   L +KR  CF  PC    Q ++   S WW+R        K + D+ WW   IR  +++REWSE+VAGPRWKT+IRRF R+    
Subjt:  ESESIGEIDGADAL---LASKRYGCFCFPCFGPRQSASDELS-WWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNR---

Query:  -------------------PATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAA
                             +   GKF+YD +SY+LNFD+G N  G F+ E     ++++S RFAA   PV +  +
Subjt:  -------------------PATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAA

AT3G48020.1 unknown protein5.2e-1842.62Show/hide
Query:  SASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATV---KLGKFQYDPISYALNFDEGHNGDGDFEGEEYN
        S + + SWW+R            +  WW   +R+  K+REWSEIVAGPRWKTFIRRFNR+           KF+YDP+SY L+F++    D D  G    
Subjt:  SASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATV---KLGKFQYDPISYALNFDEGHNGDGDFEGEEYN

Query:  GGFQNFSGRFAAVPAPVKSSAA
        GG ++FS R+A+VP     S A
Subjt:  GGFQNFSGRFAAVPAPVKSSAA

AT5G14890.1 NHL domain-containing protein3.5e-2240.72Show/hide
Query:  ESESIGEIDGADAL---LASKRYGCFCFPCFGPRQSASDELS-WWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR-----
        ES  I E+D  D +   + +KR  CF  PC G  Q +    S WW+R +      K + D+ WW  G     K+REWSEIVAGP+WKTFIRRF R     
Subjt:  ESESIGEIDGADAL---LASKRYGCFCFPCFGPRQSASDELS-WWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR-----

Query:  -------NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAA
               NRP  V    F+YD  SY+LNFD+G    G FE E     ++++S RFAA   PV +  +
Subjt:  -------NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAA

AT5G25240.1 unknown protein2.2e-0847.37Show/hide
Query:  GIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVKLGKFQYDPISYALNFDEGHNG
        G   LK L+E SE +AGP+WK FIR F+  R    +   F YD  +Y+LNFD+G +G
Subjt:  GIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVKLGKFQYDPISYALNFDEGHNG

AT5G62865.1 unknown protein4.2e-2045.83Show/hide
Query:  CFCFPCF-GPRQSASDELSWWERAKA--KAKATKFDGDD-HWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVK----LGKFQYDPISYALNF
        C CFP F   R S +   S W R +    +  +   GD+  WW   IR+  K+REWSEIVAGPRWKTFIRRFNR+ P   +      KFQYDP+SY+LNF
Subjt:  CFCFPCF-GPRQSASDELSWWERAKA--KAKATKFDGDD-HWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVK----LGKFQYDPISYALNF

Query:  DEGHNGDGDFEGEEY--NGGFQNFSGRFAAVPAPVKSSAAAAAT
        D+      D E +EY   GG ++FS RFA+VP     + A + T
Subjt:  DEGHNGDGDFEGEEY--NGGFQNFSGRFAAVPAPVKSSAAAAAT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACCGTGACGGAGCTTTCAAAGAATCAGAATCGATCGGAGAAATCGACGGCGCCGATGCTCTTTTGGCGTCTAAACGATATGGTTGTTTCTGCTTCCCTTGCTT
TGGACCTCGCCAGTCGGCTTCCGACGAGCTCTCCTGGTGGGAACGGGCGAAGGCGAAGGCGAAAGCCACGAAGTTCGACGGCGACGATCACTGGTGGTCCGGTGGAATCA
GATCCCTGAAGAAGCTTCGTGAATGGTCCGAGATCGTCGCCGGTCCCAGATGGAAGACTTTCATTCGCCGGTTCAACCGGAACCGGCCCGCCACCGTGAAGCTTGGAAAA
TTCCAATACGATCCCATTAGTTACGCTTTGAATTTCGACGAGGGCCATAATGGTGATGGGGATTTCGAAGGGGAGGAATACAACGGGGGGTTTCAAAACTTCTCCGGCCG
GTTTGCTGCCGTGCCGGCGCCGGTGAAGTCGTCGGCGGCGGCGGCGGCGACTGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACCGTGACGGAGCTTTCAAAGAATCAGAATCGATCGGAGAAATCGACGGCGCCGATGCTCTTTTGGCGTCTAAACGATATGGTTGTTTCTGCTTCCCTTGCTT
TGGACCTCGCCAGTCGGCTTCCGACGAGCTCTCCTGGTGGGAACGGGCGAAGGCGAAGGCGAAAGCCACGAAGTTCGACGGCGACGATCACTGGTGGTCCGGTGGAATCA
GATCCCTGAAGAAGCTTCGTGAATGGTCCGAGATCGTCGCCGGTCCCAGATGGAAGACTTTCATTCGCCGGTTCAACCGGAACCGGCCCGCCACCGTGAAGCTTGGAAAA
TTCCAATACGATCCCATTAGTTACGCTTTGAATTTCGACGAGGGCCATAATGGTGATGGGGATTTCGAAGGGGAGGAATACAACGGGGGGTTTCAAAACTTCTCCGGCCG
GTTTGCTGCCGTGCCGGCGCCGGTGAAGTCGTCGGCGGCGGCGGCGGCGACTGGATAG
Protein sequenceShow/hide protein sequence
MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPRQSASDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVKLGK
FQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAATG