; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G015840 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G015840
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPolyglutamine tract-binding protein 1
Genome locationchr03:26865933..26869861
RNA-Seq ExpressionLsi03G015840
SyntenyLsi03G015840
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001202 - WW domain
IPR036020 - WW domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053859.1 WW domain-containing protein [Cucumis melo var. makuwa]8.1e-7371.96Show/hide
Query:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANG---KNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSS
        +L   + H +E+ + +   +  E GNLEIGNGYGVPGGCA YGASKPGIVANG   K  FLLLDFDVD        VLLSGNN  GQKIQGQV+E EQSS
Subjt:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANG---KNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSS

Query:  VAKALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVS
         AKALPEYLKQKLRARGILKEDAEHSN           + TNSDA+SN  L GEKLPHGWVEAKDP SGVSYYYNESSGKSQWERPSE SSDTQLSSAVS
Subjt:  VAKALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVS

Query:  LPEDWMEALDQTAG
        LPEDWMEA+DQT+G
Subjt:  LPEDWMEALDQTAG

XP_004136655.1 uncharacterized protein LOC101203374 [Cucumis sativus]1.1e-6466.82Show/hide
Query:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAK
        +L   + H +E+ I +   +  E GNLEIGNGYGVPGGCAFYGASKPGIVAN                         GNN  GQKIQGQ++EAEQSS +K
Subjt:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAK

Query:  ALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPE
        ALPEYLKQKLRARGILKEDAEHSNS          + TNSDA+SN  LQGEKLPHGWVEAKDP SGVSYYYNESSGKSQWERPSE SS+TQLSSAVSLPE
Subjt:  ALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPE

Query:  DWMEALDQTAG
        DWMEA+DQT+G
Subjt:  DWMEALDQTAG

XP_038906172.1 uncharacterized protein LOC120092051 isoform X1 [Benincasa hispida]1.3e-6567.77Show/hide
Query:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAK
        +L   + H +E+ + +   +  E GNLEIGNGYGVPGGCAFYGASKPG+VA                          GNN IGQKIQGQVRE EQSS  K
Subjt:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAK

Query:  ALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPE
        ALPEYLKQKLRARGILKE+AEHSNS            T+SDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSA SLPE
Subjt:  ALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPE

Query:  DWMEALDQTAG
        DWMEALDQ  G
Subjt:  DWMEALDQTAG

XP_038906174.1 uncharacterized protein LOC120092051 isoform X2 [Benincasa hispida]1.3e-6567.77Show/hide
Query:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAK
        +L   + H +E+ + +   +  E GNLEIGNGYGVPGGCAFYGASKPG+VA                          GNN IGQKIQGQVRE EQSS  K
Subjt:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAK

Query:  ALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPE
        ALPEYLKQKLRARGILKE+AEHSNS            T+SDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSA SLPE
Subjt:  ALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPE

Query:  DWMEALDQTAG
        DWMEALDQ  G
Subjt:  DWMEALDQTAG

XP_038906175.1 uncharacterized protein LOC120092051 isoform X3 [Benincasa hispida]1.3e-6567.77Show/hide
Query:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAK
        +L   + H +E+ + +   +  E GNLEIGNGYGVPGGCAFYGASKPG+VA                          GNN IGQKIQGQVRE EQSS  K
Subjt:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAK

Query:  ALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPE
        ALPEYLKQKLRARGILKE+AEHSNS            T+SDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSA SLPE
Subjt:  ALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPE

Query:  DWMEALDQTAG
        DWMEALDQ  G
Subjt:  DWMEALDQTAG

TrEMBL top hitse value%identityAlignment
A0A0A0LFL2 Polyglutamine tract-binding protein 15.2e-6566.82Show/hide
Query:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAK
        +L   + H +E+ I +   +  E GNLEIGNGYGVPGGCAFYGASKPGIVAN                         GNN  GQKIQGQ++EAEQSS +K
Subjt:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAK

Query:  ALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPE
        ALPEYLKQKLRARGILKEDAEHSNS          + TNSDA+SN  LQGEKLPHGWVEAKDP SGVSYYYNESSGKSQWERPSE SS+TQLSSAVSLPE
Subjt:  ALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPE

Query:  DWMEALDQTAG
        DWMEA+DQT+G
Subjt:  DWMEALDQTAG

A0A1S4DUP7 uncharacterized protein LOC103486911 isoform X32.8e-6365.88Show/hide
Query:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAK
        +L   + H +E+ + +   +  E GNLEIGNGYGVPGGCA YGASKPGIVAN                         GNN  GQKIQGQV+E EQSS AK
Subjt:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAK

Query:  ALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPE
        ALPEYLKQKLRARGILKEDAEHSN             TNSDA+SN  L GEKLPHGWVEAKDP SGVSYYYNESSGKSQWERPSE SSDTQLSSAVSLPE
Subjt:  ALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPE

Query:  DWMEALDQTAG
        DWMEA+DQT+G
Subjt:  DWMEALDQTAG

A0A1S4DVH1 uncharacterized protein LOC103486911 isoform X13.7e-6365.88Show/hide
Query:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAK
        +L   + H +E+ + +   +  E GNLEIGNGYGVPGGCA YGASKPGIVAN                         GNN  GQKIQGQV+E EQSS AK
Subjt:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAK

Query:  ALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPE
        ALPEYLKQKLRARGILKEDAEHSN           + TNSDA+SN  L GEKLPHGWVEAKDP SGVSYYYNESSGKSQWERPSE SSDTQLSSAVSLPE
Subjt:  ALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPE

Query:  DWMEALDQTAG
        DWMEA+DQT+G
Subjt:  DWMEALDQTAG

A0A5A7UK56 Polyglutamine tract-binding protein 13.9e-7371.96Show/hide
Query:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANG---KNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSS
        +L   + H +E+ + +   +  E GNLEIGNGYGVPGGCA YGASKPGIVANG   K  FLLLDFDVD        VLLSGNN  GQKIQGQV+E EQSS
Subjt:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANG---KNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSS

Query:  VAKALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVS
         AKALPEYLKQKLRARGILKEDAEHSN           + TNSDA+SN  L GEKLPHGWVEAKDP SGVSYYYNESSGKSQWERPSE SSDTQLSSAVS
Subjt:  VAKALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVS

Query:  LPEDWMEALDQTAG
        LPEDWMEA+DQT+G
Subjt:  LPEDWMEALDQTAG

A0A5D3DPP7 Polyglutamine tract-binding protein 12.8e-6365.88Show/hide
Query:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAK
        +L   + H +E+ + +   +  E GNLEIGNGYGVPGGCA YGASKPGIVAN                         GNN  GQKIQGQV+E EQSS AK
Subjt:  ILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAK

Query:  ALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPE
        ALPEYLKQKLRARGILKEDAEHSN             TNSDA+SN  L GEKLPHGWVEAKDP SGVSYYYNESSGKSQWERPSE SSDTQLSSAVSLPE
Subjt:  ALPEYLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPE

Query:  DWMEALDQTAG
        DWMEA+DQT+G
Subjt:  DWMEALDQTAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41020.1 WW domain-containing protein1.3e-2034.57Show/hide
Query:  GNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKEDAEHSN
        GN+++GNGYG+PGG A+ G S+                             LSG             + E ++ +  LPEYLKQKL+ARGIL++ A    
Subjt:  GNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKEDAEHSN

Query:  SYLCTSGSNYSSMT-NSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPEDWMEALDQTAG
          + ++  + S+++ N  A          LP GWV+AKDP SG +YYYN+ +G  QWERP E S  T  +  V   E+W+E  D+ +G
Subjt:  SYLCTSGSNYSSMT-NSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPEDWMEALDQTAG

AT2G41020.2 WW domain-containing protein1.3e-2034.57Show/hide
Query:  GNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKEDAEHSN
        GN+++GNGYG+PGG A+ G S+                             LSG             + E ++ +  LPEYLKQKL+ARGIL++ A    
Subjt:  GNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKEDAEHSN

Query:  SYLCTSGSNYSSMT-NSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPEDWMEALDQTAG
          + ++  + S+++ N  A          LP GWV+AKDP SG +YYYN+ +G  QWERP E S  T  +  V   E+W+E  D+ +G
Subjt:  SYLCTSGSNYSSMT-NSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPEDWMEALDQTAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCTTATCCTCCCTATCTTGCCAACAAAGAACACACATGTAAGTGAACTTCTGATTTTGAAATGTGATGGCCATTACATTGAGTTAGGGAACTTGGAAATTGGAAA
TGGATATGGCGTACCTGGTGGATGTGCTTTCTATGGTGCTTCAAAGCCTGGAATTGTTGCCAATGGTAAAAACCTTTTCCTTTTGCTTGATTTCGATGTTGACCTTCCAT
TTGGTTCCCTTAACGTAGTCCTGCTTTCAGGAAATAACGCGATTGGCCAGAAAATCCAGGGACAGGTTAGGGAAGCCGAACAAAGTTCTGTTGCCAAAGCATTGCCCGAG
TACCTCAAGCAGAAGCTAAGAGCTAGGGGTATTCTTAAAGAAGATGCAGAACATAGCAATTCTTATTTGTGTACTTCTGGTAGCAATTATAGTTCAATGACAAATTCTGA
TGCTATTTCAAATCAAACGTTGCAAGGAGAAAAGCTGCCTCATGGATGGGTGGAGGCTAAAGACCCTGGCAGTGGTGTTTCATATTATTATAATGAAAGTAGTGGGAAGA
GTCAATGGGAAAGGCCCTCTGAATCTTCTTCTGATACGCAACTATCATCAGCTGTATCCCTTCCAGAAGATTGGATGGAGGCACTCGATCAAACAGCAGGTTGGAATGAC
ATATCTGTAAAGTTTAGACTGAGTGGCCTTCAGTTCCAAGTTTCTAGTTTGGAATTGCTGTATCGCAGCTATCAGGAAACTTTCAAACTTGTACAGTCAAAATCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCTTATCCTCCCTATCTTGCCAACAAAGAACACACATGTAAGTGAACTTCTGATTTTGAAATGTGATGGCCATTACATTGAGTTAGGGAACTTGGAAATTGGAAA
TGGATATGGCGTACCTGGTGGATGTGCTTTCTATGGTGCTTCAAAGCCTGGAATTGTTGCCAATGGTAAAAACCTTTTCCTTTTGCTTGATTTCGATGTTGACCTTCCAT
TTGGTTCCCTTAACGTAGTCCTGCTTTCAGGAAATAACGCGATTGGCCAGAAAATCCAGGGACAGGTTAGGGAAGCCGAACAAAGTTCTGTTGCCAAAGCATTGCCCGAG
TACCTCAAGCAGAAGCTAAGAGCTAGGGGTATTCTTAAAGAAGATGCAGAACATAGCAATTCTTATTTGTGTACTTCTGGTAGCAATTATAGTTCAATGACAAATTCTGA
TGCTATTTCAAATCAAACGTTGCAAGGAGAAAAGCTGCCTCATGGATGGGTGGAGGCTAAAGACCCTGGCAGTGGTGTTTCATATTATTATAATGAAAGTAGTGGGAAGA
GTCAATGGGAAAGGCCCTCTGAATCTTCTTCTGATACGCAACTATCATCAGCTGTATCCCTTCCAGAAGATTGGATGGAGGCACTCGATCAAACAGCAGGTTGGAATGAC
ATATCTGTAAAGTTTAGACTGAGTGGCCTTCAGTTCCAAGTTTCTAGTTTGGAATTGCTGTATCGCAGCTATCAGGAAACTTTCAAACTTGTACAGTCAAAATCCTAA
Protein sequenceShow/hide protein sequence
MVLILPILPTKNTHVSELLILKCDGHYIELGNLEIGNGYGVPGGCAFYGASKPGIVANGKNLFLLLDFDVDLPFGSLNVVLLSGNNAIGQKIQGQVREAEQSSVAKALPE
YLKQKLRARGILKEDAEHSNSYLCTSGSNYSSMTNSDAISNQTLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAVSLPEDWMEALDQTAGWND
ISVKFRLSGLQFQVSSLELLYRSYQETFKLVQSKS