; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024475 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024475
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionNHL domain protein
Genome locationchr10:3376530..3377027
RNA-Seq ExpressionLag0024475
SyntenyLag0024475
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584232.1 hypothetical protein SDJN03_20164, partial [Cucurbita argyrosperma subsp. sororia]9.3e-7886.59Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ESIGEIDGADALLAS+RYGCFCFPCFG  +S SDELSWWERAKAKAK+TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT
        NRPA VK GKFQYDPISYALNFDEGHNGD DFEG+EY+GGFQ+FS RF+AVPA  KSS AA A+
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT

XP_022923865.1 uncharacterized protein LOC111431456 [Cucurbita moschata]3.8e-7987.8Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ESIGEIDGADALLAS+RYGCFCFPCFG  +S SDELSWWERAKAKAK+TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT
        NRPA VKLGKFQYDPISYALNFDEGHNGD DFEG+EY+GGFQNFS RF+AVPA  KSS AA A+
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT

XP_023000665.1 uncharacterized protein LOC111495035 [Cucurbita maxima]3.2e-7886.59Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ESIGEIDGADALLAS+RY CFCFPCFG  +S SDELSWWERAKAKAK+TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT
        NRPA VKLGKFQYDPISYALNFDEGHNGD +FEG+EY+GGFQNFS RF+AVPA  KSS AA A+
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT

XP_023519821.1 uncharacterized protein LOC111783153 [Cucurbita pepo subsp. pepo]1.9e-7887.2Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ESIGEIDGADALLAS+RY CFCFPCFG  +S SDELSWWERAKAKAK+TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT
        NRPA VKLGKFQYDPISYALNFDEGHNGD DFEG+EY+GGFQNFS RF+AVPA  KSS AA A+
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT

XP_038894780.1 uncharacterized protein LOC120083201 [Benincasa hispida]2.4e-7886.14Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGAFKE ESIGEIDGADALLASKRY CFCFPCFGP +S SDE+SWWER K KAK+TKFDG+DHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYN-GGFQNFSGRFAAVPAPVKSSAAAAATG
        NRPATVKLGKFQYDPISYALNFD+GHNGD DF+G+EY+ GGFQNFS RFAAVP PVKSS A A  G
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYN-GGFQNFSGRFAAVPAPVKSSAAAAATG

TrEMBL top hitse value%identityAlignment
A0A0A0LU30 Uncharacterized protein2.5e-7383.43Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDD-HWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFN
        MGDRDGAFKE ES   IDGADALL SKRY CFCFPCFGP +S SDELSWWER K KAK+TKFD +D HWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFN
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDD-HWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFN

Query:  RNRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYN--GGFQNFSGRFAAV-PAPVKSSAAAAATG
        RNRPATVKLGKFQYDPISYALNFDEGHNGD DF+G+EYN  GGFQNFS RFAA+ PAPVKSS++AA  G
Subjt:  RNRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYN--GGFQNFSGRFAAV-PAPVKSSAAAAATG

A0A5A7ULQ3 Uncharacterized protein1.8e-7183.53Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGAFKE ES   IDGADALLASKRY CFCFPCFGP +S SDELSWWER K KAK+TKFDG+DHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGH-NGDGDFEG-EEY--NGGFQNFSGRFAAV-PAPVKSSAAAAATG
        NRPATVKLGKFQYDPISYALNFDEGH NGD DFEG  EY   GGFQNFS RFAAV PAP+KSS++ A  G
Subjt:  NRPATVKLGKFQYDPISYALNFDEGH-NGDGDFEG-EEY--NGGFQNFSGRFAAV-PAPVKSSAAAAATG

A0A5D3BLD0 Uncharacterized protein1.5e-7082.94Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGAFKE ES   IDGADALLASKRY CFCFPCFGP +S SDELSWWER K KAK+TKFDG+DHWW+G IRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGH-NGDGDFEG-EEY--NGGFQNFSGRFAAV-PAPVKSSAAAAATG
        NRPATVKLGKFQYDPISYALNFDEGH NGD DFEG  EY   GGFQNFS RFAAV PAP+KSS++ A  G
Subjt:  NRPATVKLGKFQYDPISYALNFDEGH-NGDGDFEG-EEY--NGGFQNFSGRFAAV-PAPVKSSAAAAATG

A0A6J1ED51 uncharacterized protein LOC1114314561.8e-7987.8Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ESIGEIDGADALLAS+RYGCFCFPCFG  +S SDELSWWERAKAKAK+TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT
        NRPA VKLGKFQYDPISYALNFDEGHNGD DFEG+EY+GGFQNFS RF+AVPA  KSS AA A+
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT

A0A6J1KJ01 uncharacterized protein LOC1114950351.5e-7886.59Show/hide
Query:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ESIGEIDGADALLAS+RY CFCFPCFG  +S SDELSWWERAKAKAK+TKFDG+DHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT
        NRPA VKLGKFQYDPISYALNFDEGHNGD +FEG+EY+GGFQNFS RF+AVPA  KSS AA A+
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAAT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)5.0e-2136.16Show/hide
Query:  ESESIGEIDGADAL---LASKRYGCFCFPCFGPCQ-SPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNR---
        +S  I E+D  D +   L +KR  CF  PC    Q S      WW+R        K + D+ WW   IR  +++REWSE+VAGPRWKT+IRRF R+    
Subjt:  ESESIGEIDGADAL---LASKRYGCFCFPCFGPCQ-SPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNR---

Query:  -------------------PATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAA
                             +   GKF+YD +SY+LNFD+G N  G F+ E     ++++S RFAA   PV +  +
Subjt:  -------------------PATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAA

AT3G48020.1 unknown protein1.0e-1842.74Show/hide
Query:  CQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATV---KLGKFQYDPISYALNFDEGHNGDGDFEGEE
        C S + + SWW+R            +  WW   +R+  K+REWSEIVAGPRWKTFIRRFNR+           KF+YDP+SY L+F++    D D  G  
Subjt:  CQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATV---KLGKFQYDPISYALNFDEGHNGDGDFEGEE

Query:  YNGGFQNFSGRFAAVPAPVKSSAA
          GG ++FS R+A+VP     S A
Subjt:  YNGGFQNFSGRFAAVPAPVKSSAA

AT5G14890.1 NHL domain-containing protein5.9e-2240.72Show/hide
Query:  ESESIGEIDGADAL---LASKRYGCFCFPCFGPCQSPSDELS-WWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR-----
        ES  I E+D  D +   + +KR  CF  PC G  Q      S WW+R +      K + D+ WW  G     K+REWSEIVAGP+WKTFIRRF R     
Subjt:  ESESIGEIDGADAL---LASKRYGCFCFPCFGPCQSPSDELS-WWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNR-----

Query:  -------NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAA
               NRP  V    F+YD  SY+LNFD+G    G FE E     ++++S RFAA   PV +  +
Subjt:  -------NRPATVKLGKFQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAA

AT5G25240.1 unknown protein2.2e-0847.37Show/hide
Query:  GIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVKLGKFQYDPISYALNFDEGHNG
        G   LK L+E SE +AGP+WK FIR F+  R    +   F YD  +Y+LNFD+G +G
Subjt:  GIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVKLGKFQYDPISYALNFDEGHNG

AT5G62865.1 unknown protein2.1e-1945.14Show/hide
Query:  CFCFPCFGPCQSPSD-ELSWWERAKA--KAKATKFDGDD-HWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVK----LGKFQYDPISYALNF
        C CFP F   +S +    S W R +    +  +   GD+  WW   IR+  K+REWSEIVAGPRWKTFIRRFNR+ P   +      KFQYDP+SY+LNF
Subjt:  CFCFPCFGPCQSPSD-ELSWWERAKA--KAKATKFDGDD-HWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVK----LGKFQYDPISYALNF

Query:  DEGHNGDGDFEGEEY--NGGFQNFSGRFAAVPAPVKSSAAAAAT
        D+      D E +EY   GG ++FS RFA+VP     + A + T
Subjt:  DEGHNGDGDFEGEEY--NGGFQNFSGRFAAVPAPVKSSAAAAAT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACCGTGACGGAGCTTTCAAAGAATCAGAATCGATCGGAGAAATCGACGGCGCCGATGCTCTTTTGGCGTCTAAACGATATGGTTGTTTCTGCTTCCCTTGCTT
TGGACCTTGCCAGTCGCCTTCCGACGAGCTCTCCTGGTGGGAACGGGCGAAGGCGAAGGCGAAAGCCACGAAGTTCGACGGCGACGATCACTGGTGGTCCGGCGGAATCA
GATCCCTGAAGAAGCTCCGCGAATGGTCCGAGATCGTCGCCGGTCCGAGATGGAAGACGTTCATTCGCCGGTTCAACAGGAACCGGCCCGCCACCGTGAAGCTTGGGAAA
TTCCAGTACGATCCCATTAGTTACGCTTTGAATTTCGACGAGGGACATAATGGTGATGGGGATTTCGAAGGGGAGGAATACAACGGGGGGTTTCAAAACTTCTCCGGCCG
GTTTGCTGCCGTTCCGGCGCCGGTGAAGTCGTCGGCGGCGGCGGCGGCGACTGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACCGTGACGGAGCTTTCAAAGAATCAGAATCGATCGGAGAAATCGACGGCGCCGATGCTCTTTTGGCGTCTAAACGATATGGTTGTTTCTGCTTCCCTTGCTT
TGGACCTTGCCAGTCGCCTTCCGACGAGCTCTCCTGGTGGGAACGGGCGAAGGCGAAGGCGAAAGCCACGAAGTTCGACGGCGACGATCACTGGTGGTCCGGCGGAATCA
GATCCCTGAAGAAGCTCCGCGAATGGTCCGAGATCGTCGCCGGTCCGAGATGGAAGACGTTCATTCGCCGGTTCAACAGGAACCGGCCCGCCACCGTGAAGCTTGGGAAA
TTCCAGTACGATCCCATTAGTTACGCTTTGAATTTCGACGAGGGACATAATGGTGATGGGGATTTCGAAGGGGAGGAATACAACGGGGGGTTTCAAAACTTCTCCGGCCG
GTTTGCTGCCGTTCCGGCGCCGGTGAAGTCGTCGGCGGCGGCGGCGGCGACTGGATAG
Protein sequenceShow/hide protein sequence
MGDRDGAFKESESIGEIDGADALLASKRYGCFCFPCFGPCQSPSDELSWWERAKAKAKATKFDGDDHWWSGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVKLGK
FQYDPISYALNFDEGHNGDGDFEGEEYNGGFQNFSGRFAAVPAPVKSSAAAAATG