; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022786 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022786
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPolyglutamine tract-binding protein 1
Genome locationchr7:37949199..37955898
RNA-Seq ExpressionLag0022786
SyntenyLag0022786
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001202 - WW domain
IPR036020 - WW domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016899708.1 PREDICTED: uncharacterized protein LOC103486911 isoform X1 [Cucumis melo]1.9e-5179.49Show/hide
Query:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG
        D+  GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG WN+QTLEQ+KCITCG G+T+VQGSRYCN CTS VSTSST G WQDQSSE NKCMGCGGWG
Subjt:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG

Query:  IGLVQAWGYCNHCTRYM
        +GLVQAWGYCNHCTR +
Subjt:  IGLVQAWGYCNHCTRYM

XP_016899711.1 PREDICTED: uncharacterized protein LOC103486911 isoform X2 [Cucumis melo]1.4e-5177.05Show/hide
Query:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG
        D+  GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG WN+QTLEQ+KCITCG G+T+VQGSRYCN CTS VSTSST G WQDQSSE NKCMGCGGWG
Subjt:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG

Query:  IGLVQAWGYCNHCTRYMLMSYI
        +GLVQAWGYCNHCTR     Y+
Subjt:  IGLVQAWGYCNHCTRYMLMSYI

XP_038906172.1 uncharacterized protein LOC120092051 isoform X1 [Benincasa hispida]2.9e-5283.76Show/hide
Query:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG
        D+  GLKYYYN+R+QVTQWEPPVASHQ TLTHSNDNV GSWNNQTLEQ+KCITCG GIT+VQGSRYCN CTS VSTSST G+WQDQSSE NKCMGC GWG
Subjt:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG

Query:  IGLVQAWGYCNHCTRYM
        +GLVQAWGYCNHCTR +
Subjt:  IGLVQAWGYCNHCTRYM

XP_038906174.1 uncharacterized protein LOC120092051 isoform X2 [Benincasa hispida]2.9e-5283.76Show/hide
Query:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG
        D+  GLKYYYN+R+QVTQWEPPVASHQ TLTHSNDNV GSWNNQTLEQ+KCITCG GIT+VQGSRYCN CTS VSTSST G+WQDQSSE NKCMGC GWG
Subjt:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG

Query:  IGLVQAWGYCNHCTRYM
        +GLVQAWGYCNHCTR +
Subjt:  IGLVQAWGYCNHCTRYM

XP_038906175.1 uncharacterized protein LOC120092051 isoform X3 [Benincasa hispida]2.9e-5283.76Show/hide
Query:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG
        D+  GLKYYYN+R+QVTQWEPPVASHQ TLTHSNDNV GSWNNQTLEQ+KCITCG GIT+VQGSRYCN CTS VSTSST G+WQDQSSE NKCMGC GWG
Subjt:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG

Query:  IGLVQAWGYCNHCTRYM
        +GLVQAWGYCNHCTR +
Subjt:  IGLVQAWGYCNHCTRYM

TrEMBL top hitse value%identityAlignment
A0A1S4DUP7 uncharacterized protein LOC103486911 isoform X39.0e-5279.49Show/hide
Query:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG
        D+  GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG WN+QTLEQ+KCITCG G+T+VQGSRYCN CTS VSTSST G WQDQSSE NKCMGCGGWG
Subjt:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG

Query:  IGLVQAWGYCNHCTRYM
        +GLVQAWGYCNHCTR +
Subjt:  IGLVQAWGYCNHCTRYM

A0A1S4DUQ3 uncharacterized protein LOC103486911 isoform X26.9e-5277.05Show/hide
Query:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG
        D+  GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG WN+QTLEQ+KCITCG G+T+VQGSRYCN CTS VSTSST G WQDQSSE NKCMGCGGWG
Subjt:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG

Query:  IGLVQAWGYCNHCTRYMLMSYI
        +GLVQAWGYCNHCTR     Y+
Subjt:  IGLVQAWGYCNHCTRYMLMSYI

A0A1S4DVH1 uncharacterized protein LOC103486911 isoform X19.0e-5279.49Show/hide
Query:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG
        D+  GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG WN+QTLEQ+KCITCG G+T+VQGSRYCN CTS VSTSST G WQDQSSE NKCMGCGGWG
Subjt:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG

Query:  IGLVQAWGYCNHCTRYM
        +GLVQAWGYCNHCTR +
Subjt:  IGLVQAWGYCNHCTRYM

A0A5A7UK56 Polyglutamine tract-binding protein 19.0e-5279.49Show/hide
Query:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG
        D+  GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG WN+QTLEQ+KCITCG G+T+VQGSRYCN CTS VSTSST G WQDQSSE NKCMGCGGWG
Subjt:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG

Query:  IGLVQAWGYCNHCTRYM
        +GLVQAWGYCNHCTR +
Subjt:  IGLVQAWGYCNHCTRYM

A0A5D3DPP7 Polyglutamine tract-binding protein 19.0e-5279.49Show/hide
Query:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG
        D+  GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG WN+QTLEQ+KCITCG G+T+VQGSRYCN CTS VSTSST G WQDQSSE NKCMGCGGWG
Subjt:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG

Query:  IGLVQAWGYCNHCTRYM
        +GLVQAWGYCNHCTR +
Subjt:  IGLVQAWGYCNHCTRYM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41020.1 WW domain-containing protein6.1e-1636.52Show/hide
Query:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG
        DE  G KY+YN R+ V+QWEPP +  +   T+SN                                     + V+ S+  GK +   S+L +C GCGGWG
Subjt:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG

Query:  IGLVQAWGYCNHCTR
        +GLVQ WGYC HCTR
Subjt:  IGLVQAWGYCNHCTR

AT2G41020.2 WW domain-containing protein6.1e-1636.52Show/hide
Query:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG
        DE  G KY+YN R+ V+QWEPP +  +   T+SN                                     + V+ S+  GK +   S+L +C GCGGWG
Subjt:  DEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG

Query:  IGLVQAWGYCNHCTR
        +GLVQ WGYC HCTR
Subjt:  IGLVQAWGYCNHCTR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAAATCTCCTCATCTTTCAATAGATGACGAAAAACGAGGCCTTAAATACTACTACAATGTGAGAAGTCAGGTAACCCAGTGGGAGCCGCCTGTTGCATCTCATCA
GGTAACTTTGACACACTCAAATGATAATGTTCCTGGGTCTTGGAACAATCAAACTTTGGAGCAAAATAAATGCATCACATGTGGAAGGGGAATCACCGTCGTGCAGGGTT
CAAGATACTGCAACGGTTGTACAAGTGAGGTTTCTACAAGTTCAACCATTGGGAAATGGCAGGATCAATCGTCTGAGCTAAATAAATGCATGGGATGTGGTGGTTGGGGA
ATTGGCCTTGTGCAAGCTTGGGGTTATTGCAATCATTGTACACGGTATATGTTAATGTCCTATATCATTCGGTACTATTACTTGGGGGCATGGATTTTTGGCCTTTGCTT
GGCTAGGCTTTCAAATGCTGATTCTCTCAGCAGTCCACACACCTTTGTCAGCCTCCCTCCCTATATTAAAGGTACTCAGTTTATCGACCAAGGCTGCCCAACTGGCAACC
TCATGATCAAAAACACCTCTGAGAAAGCCCTCGGTTCCAAGACTGTTTTTCCTCGTCCCAGCAATCATAGATGGAGTATTCTTTTATGTGTGGACAGCAAGGCCATCTAT
CTAATGAATGCCTTCAATGATGGACACTTGCTATTCATGAGGAAACTCAGGGTGATGATACTTTCAATTCTGAAGATTCGGATGGATTTGAATTTGCGAAACTCGACTCA
TATGGATATCAACTTTTTTGCATTCTTCAAAGAATCCTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAAATCTCCTCATCTTTCAATAGATGACGAAAAACGAGGCCTTAAATACTACTACAATGTGAGAAGTCAGGTAACCCAGTGGGAGCCGCCTGTTGCATCTCATCA
GGTAACTTTGACACACTCAAATGATAATGTTCCTGGGTCTTGGAACAATCAAACTTTGGAGCAAAATAAATGCATCACATGTGGAAGGGGAATCACCGTCGTGCAGGGTT
CAAGATACTGCAACGGTTGTACAAGTGAGGTTTCTACAAGTTCAACCATTGGGAAATGGCAGGATCAATCGTCTGAGCTAAATAAATGCATGGGATGTGGTGGTTGGGGA
ATTGGCCTTGTGCAAGCTTGGGGTTATTGCAATCATTGTACACGGTATATGTTAATGTCCTATATCATTCGGTACTATTACTTGGGGGCATGGATTTTTGGCCTTTGCTT
GGCTAGGCTTTCAAATGCTGATTCTCTCAGCAGTCCACACACCTTTGTCAGCCTCCCTCCCTATATTAAAGGTACTCAGTTTATCGACCAAGGCTGCCCAACTGGCAACC
TCATGATCAAAAACACCTCTGAGAAAGCCCTCGGTTCCAAGACTGTTTTTCCTCGTCCCAGCAATCATAGATGGAGTATTCTTTTATGTGTGGACAGCAAGGCCATCTAT
CTAATGAATGCCTTCAATGATGGACACTTGCTATTCATGAGGAAACTCAGGGTGATGATACTTTCAATTCTGAAGATTCGGATGGATTTGAATTTGCGAAACTCGACTCA
TATGGATATCAACTTTTTTGCATTCTTCAAAGAATCCTTCTAA
Protein sequenceShow/hide protein sequence
MLKSPHLSIDDEKRGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNNQTLEQNKCITCGRGITVVQGSRYCNGCTSEVSTSSTIGKWQDQSSELNKCMGCGGWG
IGLVQAWGYCNHCTRYMLMSYIIRYYYLGAWIFGLCLARLSNADSLSSPHTFVSLPPYIKGTQFIDQGCPTGNLMIKNTSEKALGSKTVFPRPSNHRWSILLCVDSKAIY
LMNAFNDGHLLFMRKLRVMILSILKIRMDLNLRNSTHMDINFFAFFKESF