; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018806 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018806
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionfibroin heavy chain-like
Genome locationchr5:34797570..34798235
RNA-Seq ExpressionLag0018806
SyntenyLag0018806
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060661.1 hypothetical protein E6C27_scaffold22G005540 [Cucumis melo var. makuwa]1.3e-3450.68Show/hide
Query:  MSSLKYLLLSPFLFLCLSSTFANKLLNSNYG--HGSGPDIGFGPRAGPGVEGDAINIGAGPKAGPRAGPG-GKGVSNFG--AGPKAGPG-------GVNH
        M+SLKY LLSPFLFLCLS TFA+ + N ++G   GS       P AGPGV+    NIG GPKAGPRAG G G G+S+     GPKAGP        GV+ 
Subjt:  MSSLKYLLLSPFLFLCLSSTFANKLLNSNYG--HGSGPDIGFGPRAGPGVEGDAINIGAGPKAGPRAGPG-GKGVSNFG--AGPKAGPG-------GVNH

Query:  VGGGPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVNGVGVGYRSGFGPGWNGPGVGL---YDDCILGYVCPANERRECNKFAYESCESFNFHPLTASMDL
        +  GPRAGP    G  VG G          + P+  G  +G + G G GW  PG  +   Y +C+LGYVCP N    C+KFAY  C+S+NFHPL+AS DL
Subjt:  VGGGPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVNGVGVGYRSGFGPGWNGPGVGL---YDDCILGYVCPANERRECNKFAYESCESFNFHPLTASMDL

Query:  HEVEINWAKSKPVEMAQHRKS
        HEV+INWAKSKP   AQH +S
Subjt:  HEVEINWAKSKPVEMAQHRKS

KAG6577377.1 hypothetical protein SDJN03_24951, partial [Cucurbita argyrosperma subsp. sororia]7.4e-3843.99Show/hide
Query:  MSSLKYLLLSPFLFLCLSSTFANKLLNSNYGHGSGPDIGFG----PRAGPGVEGDAINIGAGP-------------KAGPRAGPGGKG-VSNF-------
        M SLKY LLSPF+FLCLS TFAN++ NS+   GSG D+G G    P AGPGVE    N+ AGP             KAGP+AGPG +G VS+        
Subjt:  MSSLKYLLLSPFLFLCLSSTFANKLLNSNYGHGSGPDIGFG----PRAGPGVEGDAINIGAGP-------------KAGPRAGPGGKG-VSNF-------

Query:  -GAGPKAGPGGVNHVG-------GGPRAGPGIEGGVN-----VGAGPRAGPGA--------------------------------RRGVDPIVNGVG---
          AGPKAGPG    V         GP+AGPG EG V+     + AGP+AGPGA                                RR VDP++NG+G   
Subjt:  -GAGPKAGPGGVNHVG-------GGPRAGPGIEGGVN-----VGAGPRAGPGA--------------------------------RRGVDPIVNGVG---

Query:  ---VGYRSGF------GPGWNGPGVGL-----YDDCILGYVCPANERRECNKFAYESCESFNFHPLTASMDLHEVEINWAK-SKPVEMAQH
           +GYRSGF      G  W GPG+G+      ++C LGYVCP   RR C+KF+Y +C+++ FHPL ASM LHEVE+ WAK SKP    Q+
Subjt:  ---VGYRSGF------GPGWNGPGVGL-----YDDCILGYVCPANERRECNKFAYESCESFNFHPLTASMDLHEVEINWAK-SKPVEMAQH

KGN56231.1 hypothetical protein Csa_011503 [Cucumis sativus]3.3e-3851.27Show/hide
Query:  MSSLKYLLLSPFLFLCLSSTFANKLLNSNYGHGSG------PDIGFGPRAGPGVEGDAINIGAGPKAGPRAGPGGKGVSNF----GAGPKAGPG---GVN
        M+SLKY LLSPFLFLCLS TFAN + N + G G G      PD    P AGP V+    N G GPKAGPRAG G  G+SN       GPKAGPG    ++
Subjt:  MSSLKYLLLSPFLFLCLSSTFANKLLNSNYGHGSG------PDIGFGPRAGPGVEGDAINIGAGPKAGPRAGPGGKGVSNF----GAGPKAGPG---GVN

Query:  HVGGGPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVNGVGVGY-----------RSGFGP---GWNGPGVGL---YDDCILGYVCPANERRECNKFAYES
        +VG GPR  P + G  ++ AGPRAGP   +GVDPIV G+GVG            + G  P   GW GPG  +   Y++C+LGYVCP N    C K  Y  
Subjt:  HVGGGPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVNGVGVGY-----------RSGFGP---GWNGPGVGL---YDDCILGYVCPANERRECNKFAYES

Query:  CESFNFHPLTASMDLHEVEINWAKSKPVEMAQHRKS
        CES+NF PL+AS +LH+V+INWAKSK VE AQH +S
Subjt:  CESFNFHPLTASMDLHEVEINWAKSKPVEMAQHRKS

XP_022929340.1 fibroin heavy chain-like [Cucurbita moschata]4.6e-4048.4Show/hide
Query:  MSSLKYLLLSPFLFLCLSSTFANKLLNSNYGHGSGPDIGFG----PRAGPGVEGDAINIGAGP-------------KAGPRAGPGGKG-----VSNFGAG
        M SLKY LLSPF+FLCLS TFAN++ NS+   GSG D+G G    P AGPGVE    N+ AGP             KAGP+AGPG +G      +   AG
Subjt:  MSSLKYLLLSPFLFLCLSSTFANKLLNSNYGHGSGPDIGFG----PRAGPGVEGDAINIGAGP-------------KAGPRAGPGGKG-----VSNFGAG

Query:  PKAGPGGVNHVGG-------GPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVNGVG------VGYRSGF------GPGWNGPGVGL-----YDDCILGYV
        PKAGPG    V         GPRA PG EGGV+   G     G RR VDP++NG+G      +GYRSGF      G  W GPG+G+      ++C LGYV
Subjt:  PKAGPGGVNHVGG-------GPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVNGVG------VGYRSGF------GPGWNGPGVGL-----YDDCILGYV

Query:  CPANERRECNKFAYESCESFNFHPLTASMDLHEVEINWAK-SKPVEMAQH
        CP   RR C+KF+Y +C+++ FHPL ASM LHEVE+ WAK SKP    Q+
Subjt:  CPANERRECNKFAYESCESFNFHPLTASMDLHEVEINWAK-SKPVEMAQH

XP_023551824.1 fibroin heavy chain-like isoform X2 [Cucurbita pepo subsp. pepo]7.9e-3249.46Show/hide
Query:  DIGFGPRAGPGVEGDAINIGAGPKAGPRAGPGGKG-VSNFGAGPKAGPGGVNHVGGGPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVNGVG------VG
        D+  GP+AGPG EG   ++ AGP+AGP+AGPG +G V+N  AGP            GPRA PG EGGV+   G     G RR VDP++NG+G      +G
Subjt:  DIGFGPRAGPGVEGDAINIGAGPKAGPRAGPGGKG-VSNFGAGPKAGPGGVNHVGGGPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVNGVG------VG

Query:  YRSGF------GPGWNGPGV---GLYDDCILGYVCPANERRECNKFAYESCESFNFHPLTASMDLHEVEINWAK-SKPVEMAQH
        YRSGF      G  W GPG+   G+ ++C LGYVCP   RR C+KF+Y +C+S+ FHPL ASM LHEVE+ WAK SKP    Q+
Subjt:  YRSGF------GPGWNGPGV---GLYDDCILGYVCPANERRECNKFAYESCESFNFHPLTASMDLHEVEINWAK-SKPVEMAQH

TrEMBL top hitse value%identityAlignment
A0A0A0L7X7 Uncharacterized protein1.6e-3851.27Show/hide
Query:  MSSLKYLLLSPFLFLCLSSTFANKLLNSNYGHGSG------PDIGFGPRAGPGVEGDAINIGAGPKAGPRAGPGGKGVSNF----GAGPKAGPG---GVN
        M+SLKY LLSPFLFLCLS TFAN + N + G G G      PD    P AGP V+    N G GPKAGPRAG G  G+SN       GPKAGPG    ++
Subjt:  MSSLKYLLLSPFLFLCLSSTFANKLLNSNYGHGSG------PDIGFGPRAGPGVEGDAINIGAGPKAGPRAGPGGKGVSNF----GAGPKAGPG---GVN

Query:  HVGGGPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVNGVGVGY-----------RSGFGP---GWNGPGVGL---YDDCILGYVCPANERRECNKFAYES
        +VG GPR  P + G  ++ AGPRAGP   +GVDPIV G+GVG            + G  P   GW GPG  +   Y++C+LGYVCP N    C K  Y  
Subjt:  HVGGGPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVNGVGVGY-----------RSGFGP---GWNGPGVGL---YDDCILGYVCPANERRECNKFAYES

Query:  CESFNFHPLTASMDLHEVEINWAKSKPVEMAQHRKS
        CES+NF PL+AS +LH+V+INWAKSK VE AQH +S
Subjt:  CESFNFHPLTASMDLHEVEINWAKSKPVEMAQHRKS

A0A5A7V4J6 Uncharacterized protein6.3e-3550.68Show/hide
Query:  MSSLKYLLLSPFLFLCLSSTFANKLLNSNYG--HGSGPDIGFGPRAGPGVEGDAINIGAGPKAGPRAGPG-GKGVSNFG--AGPKAGPG-------GVNH
        M+SLKY LLSPFLFLCLS TFA+ + N ++G   GS       P AGPGV+    NIG GPKAGPRAG G G G+S+     GPKAGP        GV+ 
Subjt:  MSSLKYLLLSPFLFLCLSSTFANKLLNSNYG--HGSGPDIGFGPRAGPGVEGDAINIGAGPKAGPRAGPG-GKGVSNFG--AGPKAGPG-------GVNH

Query:  VGGGPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVNGVGVGYRSGFGPGWNGPGVGL---YDDCILGYVCPANERRECNKFAYESCESFNFHPLTASMDL
        +  GPRAGP    G  VG G          + P+  G  +G + G G GW  PG  +   Y +C+LGYVCP N    C+KFAY  C+S+NFHPL+AS DL
Subjt:  VGGGPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVNGVGVGYRSGFGPGWNGPGVGL---YDDCILGYVCPANERRECNKFAYESCESFNFHPLTASMDL

Query:  HEVEINWAKSKPVEMAQHRKS
        HEV+INWAKSKP   AQH +S
Subjt:  HEVEINWAKSKPVEMAQHRKS

A0A6I8PB04 Bassoon presynaptic cytomatrix protein6.6e-0850.41Show/hide
Query:  GSGPDIGFGPRAGPGVEGDAINIGAGPKAGPRAGPG-GKGVS-------NFGAGPKAGPGGVNHVGGGPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVN
        G+GP  G GPRAGPG  G     G GP AGPRAGPG G G           GAGP+AGPG     G GPR GPG   G   G GPRAGPG      P   
Subjt:  GSGPDIGFGPRAGPGVEGDAINIGAGPKAGPRAGPG-GKGVS-------NFGAGPKAGPGGVNHVGGGPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVN

Query:  GVGVGYRSGFGPGWN---GPGVG
        G G G R+G GPG     GPG G
Subjt:  GVGVGYRSGFGPGWN---GPGVG

A0A6J1EU53 fibroin heavy chain-like2.2e-4048.4Show/hide
Query:  MSSLKYLLLSPFLFLCLSSTFANKLLNSNYGHGSGPDIGFG----PRAGPGVEGDAINIGAGP-------------KAGPRAGPGGKG-----VSNFGAG
        M SLKY LLSPF+FLCLS TFAN++ NS+   GSG D+G G    P AGPGVE    N+ AGP             KAGP+AGPG +G      +   AG
Subjt:  MSSLKYLLLSPFLFLCLSSTFANKLLNSNYGHGSGPDIGFG----PRAGPGVEGDAINIGAGP-------------KAGPRAGPGGKG-----VSNFGAG

Query:  PKAGPGGVNHVGG-------GPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVNGVG------VGYRSGF------GPGWNGPGVGL-----YDDCILGYV
        PKAGPG    V         GPRA PG EGGV+   G     G RR VDP++NG+G      +GYRSGF      G  W GPG+G+      ++C LGYV
Subjt:  PKAGPGGVNHVGG-------GPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVNGVG------VGYRSGF------GPGWNGPGVGL-----YDDCILGYV

Query:  CPANERRECNKFAYESCESFNFHPLTASMDLHEVEINWAK-SKPVEMAQH
        CP   RR C+KF+Y +C+++ FHPL ASM LHEVE+ WAK SKP    Q+
Subjt:  CPANERRECNKFAYESCESFNFHPLTASMDLHEVEINWAK-SKPVEMAQH

U3KJI4 Uncharacterized protein8.6e-0853.57Show/hide
Query:  GSGPDIGFGPRAGPGVEGDAINIGAGPKAGPRAGPGGKGVSNFGAGPKAGPGGVNHVGGGPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVNGVGVGYRS
        G+GP  G GP AGPG  G     GAGP AGP AGPG    +  GAGP AGPG     G GP AGPG   G   GAGP AGPGA  G  P   G G G   
Subjt:  GSGPDIGFGPRAGPGVEGDAINIGAGPKAGPRAGPGGKGVSNFGAGPKAGPGGVNHVGGGPRAGPGIEGGVNVGAGPRAGPGARRGVDPIVNGVGVGYRS

Query:  GFGPGWNGPGVG
        G GPG  GPG G
Subjt:  GFGPGWNGPGVG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCCTCAAATATCTCCTTCTCTCTCCCTTTCTTTTCCTCTGCTTAAGCTCCACCTTCGCCAACAAACTTCTCAACTCCAACTATGGACACGGTTCTGGTCCCGA
TATCGGGTTTGGACCGAGAGCTGGGCCGGGTGTTGAGGGAGACGCAATCAATATCGGAGCTGGTCCAAAAGCCGGGCCAAGGGCTGGCCCAGGAGGTAAGGGAGTAAGCA
ATTTTGGGGCCGGTCCGAAAGCTGGGCCAGGAGGAGTAAACCATGTCGGGGGTGGCCCGAGAGCTGGGCCGGGAATTGAGGGAGGAGTGAATGTTGGGGCTGGGCCGAGA
GCGGGGCCGGGAGCTAGGAGAGGTGTTGATCCAATTGTTAATGGAGTCGGAGTTGGATATAGGTCAGGATTTGGGCCGGGCTGGAATGGGCCAGGAGTTGGGCTGTACGA
TGATTGCATATTGGGCTACGTTTGTCCAGCGAATGAACGTAGAGAATGCAACAAATTTGCTTATGAAAGTTGCGAATCTTTTAACTTTCATCCATTGACTGCTTCGATGG
ATCTGCACGAAGTTGAAATCAATTGGGCCAAAAGCAAGCCTGTTGAAATGGCCCAACATCGCAAATCTGAATTCCACGTATCACCGCCCACTAAGAAGGCCCAAAATGGT
GTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCCTCAAATATCTCCTTCTCTCTCCCTTTCTTTTCCTCTGCTTAAGCTCCACCTTCGCCAACAAACTTCTCAACTCCAACTATGGACACGGTTCTGGTCCCGA
TATCGGGTTTGGACCGAGAGCTGGGCCGGGTGTTGAGGGAGACGCAATCAATATCGGAGCTGGTCCAAAAGCCGGGCCAAGGGCTGGCCCAGGAGGTAAGGGAGTAAGCA
ATTTTGGGGCCGGTCCGAAAGCTGGGCCAGGAGGAGTAAACCATGTCGGGGGTGGCCCGAGAGCTGGGCCGGGAATTGAGGGAGGAGTGAATGTTGGGGCTGGGCCGAGA
GCGGGGCCGGGAGCTAGGAGAGGTGTTGATCCAATTGTTAATGGAGTCGGAGTTGGATATAGGTCAGGATTTGGGCCGGGCTGGAATGGGCCAGGAGTTGGGCTGTACGA
TGATTGCATATTGGGCTACGTTTGTCCAGCGAATGAACGTAGAGAATGCAACAAATTTGCTTATGAAAGTTGCGAATCTTTTAACTTTCATCCATTGACTGCTTCGATGG
ATCTGCACGAAGTTGAAATCAATTGGGCCAAAAGCAAGCCTGTTGAAATGGCCCAACATCGCAAATCTGAATTCCACGTATCACCGCCCACTAAGAAGGCCCAAAATGGT
GTCTAA
Protein sequenceShow/hide protein sequence
MSSLKYLLLSPFLFLCLSSTFANKLLNSNYGHGSGPDIGFGPRAGPGVEGDAINIGAGPKAGPRAGPGGKGVSNFGAGPKAGPGGVNHVGGGPRAGPGIEGGVNVGAGPR
AGPGARRGVDPIVNGVGVGYRSGFGPGWNGPGVGLYDDCILGYVCPANERRECNKFAYESCESFNFHPLTASMDLHEVEINWAKSKPVEMAQHRKSEFHVSPPTKKAQNG
V