; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013368 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013368
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionWW domain-containing protein
Genome locationChr02:825152..825864
RNA-Seq ExpressionHG10013368
SyntenyHG10013368
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR036020 - WW domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591744.1 hypothetical protein SDJN03_14090, partial [Cucurbita argyrosperma subsp. sororia]9.1e-7277.03Show/hide
Query:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLNCQS-------KKKS
        MKRKWEDPQ E FNSPT+  IEL LETPLPLEWQRCLDIQSG+I+FYNTKTQKRTSMDPRRK E P    SLGGE LSLDLELNLNCQS       KKKS
Subjt:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLNCQS-------KKKS

Query:  NIDGVMARGL---MKQANGSSPPWLRFER-EQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQKPPTTTTTTFFIPNSLPTENNNQTQSLKT-
          DG+M  GL   +KQ NG  PPWLRFER +QQEM ARVCMQCHLLVMLLKSSPTCPNCKFIHPITD LQ PP   TTTFFIPNSLPT+ +NQT SLK+ 
Subjt:  NIDGVMARGL---MKQANGSSPPWLRFER-EQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQKPPTTTTTTFFIPNSLPTENNNQTQSLKT-

Query:  FPNPIQNKV
        FPNPIQNKV
Subjt:  FPNPIQNKV

KAG7024627.1 hypothetical protein SDJN02_13445 [Cucurbita argyrosperma subsp. argyrosperma]2.6e-7176.56Show/hide
Query:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLNCQS-------KKKS
        MKRKWEDPQ E FNSPT+  IEL LETPLPLEWQRCLDIQSG+I+FYNTKTQKRTSMDPRRK E P    SLGGE LSLDLELNLNCQS       KKKS
Subjt:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLNCQS-------KKKS

Query:  NIDGVMARGL---MKQANGSSPPWLRFER-EQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQKPPTTTTTTFFIPNSLPTENNNQTQSLKT-
          DG+M  GL   +KQ NG  PPWLRFER +QQEM ARVCMQCHLLVMLLKSSPTCPNCKFIHPITD LQ PP   TTTFFIPNSLPT+ +NQT SLK+ 
Subjt:  NIDGVMARGL---MKQANGSSPPWLRFER-EQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQKPPTTTTTTFFIPNSLPTENNNQTQSLKT-

Query:  FPNPIQNKV
        FPNPIQN+V
Subjt:  FPNPIQNKV

XP_004136212.1 uncharacterized protein LOC101215652 [Cucumis sativus]1.6e-7676.79Show/hide
Query:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRR-KLESP--TDNRSLGG---EALSLDLELNLNCQSKKK--
        MKRKWEDPQ +NFNSPT DNIELHLETPLPLEWQRCLDIQSGEIHF+NTKTQKRTSMDPRR KLE P  T NRS      +ALSLDLELNLNCQSKKK  
Subjt:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRR-KLESP--TDNRSLGG---EALSLDLELNLNCQSKKK--

Query:  SNIDGVMARGLMK-QAN--GSSPPWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQ----KPPTT-------TTTTFFIPNSLPTE
        SN +     GL+K QAN  G S PWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDH Q     PPTT       TTTTFF+PNSLPTE
Subjt:  SNIDGVMARGLMK-QAN--GSSPPWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQ----KPPTT-------TTTTFFIPNSLPTE

Query:  NNN----QTQSLKT-FPNPIQNKV
        NNN    QT+SLK+ FPNPIQNKV
Subjt:  NNN----QTQSLKT-FPNPIQNKV

XP_008466035.2 PREDICTED: uncharacterized protein LOC103503587 [Cucumis melo]3.0e-7577.63Show/hide
Query:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRR-KLESPTDNRSLG----GEALSLDLELNLNCQSKKK---
        MKRKWEDPQ ENFNSPT DNIELHLETPLPLEWQRCLDIQSGEIHF+NTKTQKRTSMDPRR KLE PT   S       +ALSLDLELNLNCQS KK   
Subjt:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRR-KLESPTDNRSLG----GEALSLDLELNLNCQSKKK---

Query:  SNIDGVMARGLMK-QAN--GSSPPWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQ----KPP--TTTTTTFFIPNSLPTENNN--
        +N D     GLMK QAN  G S PWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDH Q     PP  TTTTTTFF+PNSLPTEN+N  
Subjt:  SNIDGVMARGLMK-QAN--GSSPPWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQ----KPP--TTTTTTFFIPNSLPTENNN--

Query:  --QTQSLKT-FPNPIQNKV
          QT+SLK+ FPNPIQNKV
Subjt:  --QTQSLKT-FPNPIQNKV

XP_038899250.1 uncharacterized protein LOC120086593 [Benincasa hispida]6.3e-8988.38Show/hide
Query:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLNCQSKKKSNIDGVMA
        MKRKWED Q ENFNSPT DNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRK ES T NRS G EALSLDLELNLNCQSKKKS+ DGVM 
Subjt:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLNCQSKKKSNIDGVMA

Query:  RGLMKQANGSSPPWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQKPPTTTTTTFFIPNSLPTENNNQTQSLKT-FPNPIQNKV
          LMKQAN  S PWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQ   T TTTTFF+PNSLPTENNNQTQSLK+ FPNPIQNKV
Subjt:  RGLMKQANGSSPPWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQKPPTTTTTTFFIPNSLPTENNNQTQSLKT-FPNPIQNKV

TrEMBL top hitse value%identityAlignment
A0A0A0LHX2 Uncharacterized protein7.7e-7776.79Show/hide
Query:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRR-KLESP--TDNRSLGG---EALSLDLELNLNCQSKKK--
        MKRKWEDPQ +NFNSPT DNIELHLETPLPLEWQRCLDIQSGEIHF+NTKTQKRTSMDPRR KLE P  T NRS      +ALSLDLELNLNCQSKKK  
Subjt:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRR-KLESP--TDNRSLGG---EALSLDLELNLNCQSKKK--

Query:  SNIDGVMARGLMK-QAN--GSSPPWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQ----KPPTT-------TTTTFFIPNSLPTE
        SN +     GL+K QAN  G S PWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDH Q     PPTT       TTTTFF+PNSLPTE
Subjt:  SNIDGVMARGLMK-QAN--GSSPPWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQ----KPPTT-------TTTTFFIPNSLPTE

Query:  NNN----QTQSLKT-FPNPIQNKV
        NNN    QT+SLK+ FPNPIQNKV
Subjt:  NNN----QTQSLKT-FPNPIQNKV

A0A1S3CQA1 uncharacterized protein LOC1035035871.5e-7577.63Show/hide
Query:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRR-KLESPTDNRSLG----GEALSLDLELNLNCQSKKK---
        MKRKWEDPQ ENFNSPT DNIELHLETPLPLEWQRCLDIQSGEIHF+NTKTQKRTSMDPRR KLE PT   S       +ALSLDLELNLNCQS KK   
Subjt:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRR-KLESPTDNRSLG----GEALSLDLELNLNCQSKKK---

Query:  SNIDGVMARGLMK-QAN--GSSPPWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQ----KPP--TTTTTTFFIPNSLPTENNN--
        +N D     GLMK QAN  G S PWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDH Q     PP  TTTTTTFF+PNSLPTEN+N  
Subjt:  SNIDGVMARGLMK-QAN--GSSPPWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQ----KPP--TTTTTTFFIPNSLPTENNN--

Query:  --QTQSLKT-FPNPIQNKV
          QT+SLK+ FPNPIQNKV
Subjt:  --QTQSLKT-FPNPIQNKV

A0A6J1CC88 uncharacterized protein LOC1110093952.1e-5870.3Show/hide
Query:  MKRKWED-PQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGE--ALSLDLELNLNCQSKKKSNIDG
        MKRKWED PQ EN +    D IELHLETPLPLEWQRCLDIQSG+IHFYNTKT+KR+  DPRR    P   RS  GE   LSLDLELNLNCQSKKKS  DG
Subjt:  MKRKWED-PQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGE--ALSLDLELNLNCQSKKKSNIDG

Query:  VMARGLMKQANGSSPPWLRFEREQQ-EMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQ-KPPTTTTTTFFIPNSLPTENNNQTQSLKTFPNPIQN
        V+  GL  + +   P WLR ERE+Q EMVARVCMQCHLLVM+LKSSPTCPNCKFIHPI   LQ  PPTTTTTT FI  S PT+  +QTQSLK F N IQN
Subjt:  VMARGLMKQANGSSPPWLRFEREQQ-EMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQ-KPPTTTTTTFFIPNSLPTENNNQTQSLKTFPNPIQN

Query:  KV
        KV
Subjt:  KV

A0A6J1F8N8 uncharacterized protein LOC1114431368.3e-7176.08Show/hide
Query:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLNCQS-------KKKS
        MKRKWEDPQ E FNSPT+  IEL LETPLPLEWQRCLDIQSG+I+FYNTKTQKRTSMDPRRK E P    SLGGE LSLDLELNLNCQS       KKKS
Subjt:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLNCQS-------KKKS

Query:  NIDGVMARGL---MKQANGSSPPWLRFE-REQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQKPPTTTTTTFFIPNSLPTENNNQTQSLKT-
          DG+M  GL   +KQ NG  PPWLRFE  +QQEM ARVCMQCHLLVMLLKSSPTCPNCKFIHPITD LQ PP   TTTFFIPNSLPT+ +NQT SLK+ 
Subjt:  NIDGVMARGL---MKQANGSSPPWLRFE-REQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQKPPTTTTTTFFIPNSLPTENNNQTQSLKT-

Query:  FPNPIQNKV
        FPNPIQN+V
Subjt:  FPNPIQNKV

A0A6J1IME5 uncharacterized protein LOC1114765642.0e-6975.12Show/hide
Query:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLNCQS-------KKKS
        MKRKWEDPQ E FNSPT+  IEL LETPLPLEWQRCLDIQSG+I+FYNTKTQKRTSMDPRRK E    + SLGGE LSLDLELNLNCQS       KKKS
Subjt:  MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLNCQS-------KKKS

Query:  NIDGVMARGL---MKQANGSSPPWLRFER-EQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQKPPTTTTTTFFIPNSLPTENNNQTQSLKT-
          DG+M  GL   +KQ NG  PPWLRFER +QQEM ARVCMQCHLLVMLLKSSPTCPNCKF HPITD LQ PP   TTTFFIPNSLPT+ +NQT SLK+ 
Subjt:  NIDGVMARGL---MKQANGSSPPWLRFER-EQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQKPPTTTTTTFFIPNSLPTENNNQTQSLKT-

Query:  FPNPIQNKV
        FPN IQN+V
Subjt:  FPNPIQNKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G22250.1 unknown protein1.2e-1033.08Show/hide
Query:  IHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLNCQSKKKSNI-------------DGVMARGLMKQANGS-----SPPWLRFE------R
        + F  +K+     + P  +L      +S   E    DL+LNLN  S   S++               V+     K  +G      SP WL FE      +
Subjt:  IHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLNCQSKKKSNI-------------DGVMARGLMKQANGS-----SPPWLRFE------R

Query:  EQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIH
        ++QEM+  VCM+CH+LVML KS+  CPNCKF+H
Subjt:  EQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIH

AT1G28070.1 unknown protein8.6e-0422.14Show/hide
Query:  IELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEA---LSLDLELNLNCQSKKKSNIDGVMARGLMKQANGSSPPWLRF
        +EL     +P   ++CLD+++GEI++ +  +  R   DPR+ +          GE+   +    E++   +S++ S+           +++ SS  + + 
Subjt:  IELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEA---LSLDLELNLNCQSKKKSNIDGVMARGLMKQANGSSPPWLRF

Query:  EREQQEMVARVCMQCHLLVMLLKSSPTCPNC
        E+++  +V   C  C +  M+ K    CP C
Subjt:  EREQQEMVARVCMQCHLLVMLLKSSPTCPNC

AT1G78170.1 unknown protein3.1e-1735Show/hide
Query:  EDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLN-------------------
        E P+ E+  S T D  ELHL TPLP +WQ              TK   RTS + R     P D    G   +SLDLELNL+                   
Subjt:  EDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLN-------------------

Query:  ---CQSKKKSNIDGVMARGLMKQANGSSPPWLRFE--------REQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQKPPTTTTTTFFIPNSL
             S K   +     + ++      SP WL FE         + QEMV  VCM+CH+LVML  S+P CPNCKF+HP  DH       ++T  F P++L
Subjt:  ---CQSKKKSNIDGVMARGLMKQANGSSPPWLRFE--------REQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQKPPTTTTTTFFIPNSL

AT2G33510.1 CONTAINS InterPro DOMAIN/s: WW/Rsp5/WWP (InterPro:IPR001202)7.5e-0826.09Show/hide
Query:  ENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLNCQSKKKSNIDGVMARGLMKQANGS
        E+    ++  +EL+    LP  W++CLD+++GEI++ N K   R   DPR+ + +  D+    G   S   E +    S++ S+     +R   K+    
Subjt:  ENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLNCQSKKKSNIDGVMARGLMKQANGS

Query:  SPPWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNC
               E E+  +V   C  C +  M+ K    CP C
Subjt:  SPPWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNC

AT4G08910.1 unknown protein1.1e-0953.7Show/hide
Query:  REQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQKPPTTTTTTFFIP
        R ++EMVARVCM+CH+LVML K+SP CPNCKF+H        P  T+ +  F P
Subjt:  REQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQKPPTTTTTTFFIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGAAAATGGGAAGACCCACAAGTTGAAAACTTCAATTCTCCCACTGAGGACAATATTGAGCTCCACCTTGAGACTCCATTACCTTTGGAGTGGCAACGATGTCT
GGATATTCAGTCAGGAGAAATACATTTCTATAACACAAAGACTCAAAAAAGAACCTCAATGGATCCAAGAAGAAAATTAGAGAGTCCTACTGATAATAGAAGCCTTGGAG
GTGAGGCCTTGAGCTTGGACCTTGAGCTCAATCTAAATTGCCAATCAAAGAAGAAGAGCAATATTGATGGGGTTATGGCTAGAGGTTTGATGAAACAAGCAAATGGGAGT
TCTCCTCCATGGCTAAGATTTGAAAGAGAGCAACAAGAAATGGTTGCAAGAGTTTGCATGCAGTGTCACTTATTGGTTATGCTTTTGAAGTCCTCACCAACATGCCCCAA
TTGCAAATTCATACACCCAATAACAGATCACCTCCAAAAACCTCCAACTACAACAACAACAACCTTCTTTATACCTAATTCTCTCCCTACTGAGAATAATAATCAAACCC
AGTCCTTAAAGACTTTCCCAAATCCTATTCAAAATAAAGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGAAAATGGGAAGACCCACAAGTTGAAAACTTCAATTCTCCCACTGAGGACAATATTGAGCTCCACCTTGAGACTCCATTACCTTTGGAGTGGCAACGATGTCT
GGATATTCAGTCAGGAGAAATACATTTCTATAACACAAAGACTCAAAAAAGAACCTCAATGGATCCAAGAAGAAAATTAGAGAGTCCTACTGATAATAGAAGCCTTGGAG
GTGAGGCCTTGAGCTTGGACCTTGAGCTCAATCTAAATTGCCAATCAAAGAAGAAGAGCAATATTGATGGGGTTATGGCTAGAGGTTTGATGAAACAAGCAAATGGGAGT
TCTCCTCCATGGCTAAGATTTGAAAGAGAGCAACAAGAAATGGTTGCAAGAGTTTGCATGCAGTGTCACTTATTGGTTATGCTTTTGAAGTCCTCACCAACATGCCCCAA
TTGCAAATTCATACACCCAATAACAGATCACCTCCAAAAACCTCCAACTACAACAACAACAACCTTCTTTATACCTAATTCTCTCCCTACTGAGAATAATAATCAAACCC
AGTCCTTAAAGACTTTCCCAAATCCTATTCAAAATAAAGTTTAA
Protein sequenceShow/hide protein sequence
MKRKWEDPQVENFNSPTEDNIELHLETPLPLEWQRCLDIQSGEIHFYNTKTQKRTSMDPRRKLESPTDNRSLGGEALSLDLELNLNCQSKKKSNIDGVMARGLMKQANGS
SPPWLRFEREQQEMVARVCMQCHLLVMLLKSSPTCPNCKFIHPITDHLQKPPTTTTTTFFIPNSLPTENNNQTQSLKTFPNPIQNKV