; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041229 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041229
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationchr13:14040212..14040595
RNA-Seq ExpressionLag0041229
SyntenyLag0041229
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600825.1 hypothetical protein SDJN03_06058, partial [Cucurbita argyrosperma subsp. sororia]1.5e-3570.08Show/hide
Query:  MFIQRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQD
        M IQ RP YAL  + + F +LSINVV  HG +SSKKLDE+TGGDD  VKCTPCT    PPPPPPPPKKP   YCPPPP PPSSFIY+LGPPGNLYPID+D
Subjt:  MFIQRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQD

Query:  FAGA-RRKMAVELPVVALFGMIGFIAL
        FAGA RR++ VEL  VALFG+IGF+ +
Subjt:  FAGA-RRKMAVELPVVALFGMIGFIAL

KAG7031461.1 hypothetical protein SDJN02_05501, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-3571.65Show/hide
Query:  MFIQRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQD
        M IQ RP YAL I   LF +LSINVV  HG +SSKKLDE+TGGDD  VKCTPCT    PPPPPPPPKKP   YCPPPP PPSSFIY+LGPPGNLYPID+D
Subjt:  MFIQRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQD

Query:  FAGA-RRKMAVELPVVALFGMIGFIAL
        FAGA RR++ VEL  VALFG+IGF+ +
Subjt:  FAGA-RRKMAVELPVVALFGMIGFIAL

XP_022155381.1 acrosin-like [Momordica charantia]6.3e-4282.46Show/hide
Query:  LINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQDFAGARRKMAVELP
        ++NL LILSINVVTIHGLISS KLD +TGG DS+VKCTPCTRY PPPPPPPPPKKPPS+YCPPPP PPSSFIY+LGPPGNLYPI QDFAG RR++AVELP
Subjt:  LINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQDFAGARRKMAVELP

Query:  VVALFGMIGFIALW
        VVAL G++GFIA+W
Subjt:  VVALFGMIGFIALW

XP_022942055.1 probable glycosyltransferase 4 [Cucurbita moschata]6.1e-3772.44Show/hide
Query:  MFIQRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQD
        M IQ RP YAL IL   F  LS NVV  HG +SSKKLDE+TGGDD SVKCTPCT    PPPPPPPPKKP   YCPPPP PPSSFIY+LGPPGNLYPID+D
Subjt:  MFIQRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQD

Query:  FAGA-RRKMAVELPVVALFGMIGFIAL
        FAGA RR++AVEL  VALFG+IGF+ +
Subjt:  FAGA-RRKMAVELPVVALFGMIGFIAL

XP_023546806.1 formin-like protein 20 [Cucurbita pepo subsp. pepo]2.3e-3671.65Show/hide
Query:  MFIQRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQD
        M IQ RP YAL I       LSINVV  HG +SSKKLDE+TGGDD SVKCTPCT    PPPPPPPPKKP   YCPPPP PPSSFIY+LGPPGNLYPID+D
Subjt:  MFIQRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQD

Query:  FAGA-RRKMAVELPVVALFGMIGFIAL
        FAGA RR++AVEL  VALFG+IGF+ +
Subjt:  FAGA-RRKMAVELPVVALFGMIGFIAL

TrEMBL top hitse value%identityAlignment
A0A067FBM4 Uncharacterized protein2.8e-1948.97Show/hide
Query:  MFIQRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTP-CTRYPPPP--------PPPPP---------PKKPPSTYCPPPPSPPS
        M IQ+     L +L    L+ +I    ++GL  S+KLDE+TG  D  +KCTP CT+ PPPP        PPPPP         PKKPPS YCPPP  PP 
Subjt:  MFIQRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTP-CTRYPPPP--------PPPPP---------PKKPPSTYCPPPPSPPS

Query:  SFIYVLGPPGNLYPIDQDFAGARRKMAVELPVVALFGMIGFIALW
        SFIY+ GPPGNLYP+D DF GA +K+   LP++   G++GF+ALW
Subjt:  SFIYVLGPPGNLYPIDQDFAGARRKMAVELPVVALFGMIGFIALW

A0A0A0KSV7 Uncharacterized protein4.0e-3471.65Show/hide
Query:  MFIQRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQD
        M IQ  P Y+L IL + FLILSIN+  IHG ISSKKLDE     DSSVKCTPCTRY   PPPPPPPKKPP  YCPPPP PPSSFIY+LGPP NLYPI+ D
Subjt:  MFIQRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQD

Query:  FAGA-RRKMAVELPVVALFGMIGFIAL
        FA A RR +A+ELPVVA FG+IG IAL
Subjt:  FAGA-RRKMAVELPVVALFGMIGFIAL

A0A6J1DRJ1 acrosin-like3.1e-4282.46Show/hide
Query:  LINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQDFAGARRKMAVELP
        ++NL LILSINVVTIHGLISS KLD +TGG DS+VKCTPCTRY PPPPPPPPPKKPPS+YCPPPP PPSSFIY+LGPPGNLYPI QDFAG RR++AVELP
Subjt:  LINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQDFAGARRKMAVELP

Query:  VVALFGMIGFIALW
        VVAL G++GFIA+W
Subjt:  VVALFGMIGFIALW

A0A6J1FVH5 probable glycosyltransferase 43.0e-3772.44Show/hide
Query:  MFIQRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQD
        M IQ RP YAL IL   F  LS NVV  HG +SSKKLDE+TGGDD SVKCTPCT    PPPPPPPPKKP   YCPPPP PPSSFIY+LGPPGNLYPID+D
Subjt:  MFIQRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQD

Query:  FAGA-RRKMAVELPVVALFGMIGFIAL
        FAGA RR++AVEL  VALFG+IGF+ +
Subjt:  FAGA-RRKMAVELPVVALFGMIGFIAL

A5B8N9 Uncharacterized protein1.1e-1849.6Show/hide
Query:  LINLFLILSINVVTIHGLISSKKLDE--ATGGDDSSVKC-------TPCTRYPPPPP---PPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQDF
        L+ +FL+++     ++G  +S+KLDE  A+G  D+ VKC        PC + PPPPP   PPPPPKKP + YCPPPP PP+SF+YV GPPG LYPIDQD+
Subjt:  LINLFLILSINVVTIHGLISSKKLDE--ATGGDDSSVKC-------TPCTRYPPPPP---PPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQDF

Query:  AGARRKMAVELPVVALFGMIGFIAL
         GA R   V LP++A  G++  + L
Subjt:  AGARRKMAVELPVVALFGMIGFIAL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02405.1 proline-rich family protein5.0e-0532.09Show/hide
Query:  LAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPP---------------PPPPKK---PPSTYCPPPPSPPSSFIYVLGPPG
        LA  I + +++  + + I+   SS ++D         V C PC +  PPPPP               PPPPKK   PPS   PPPP PP ++++   PPG
Subjt:  LAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPP---------------PPPPKK---PPSTYCPPPPSPPSSFIYVLGPPG

Query:  NLYPIDQDFAGARRKMAVELPVVALFGMIGFIAL
        +LYPI+  +  A    +  +  + +FG++ F+ L
Subjt:  NLYPIDQDFAGARRKMAVELPVVALFGMIGFIAL

AT1G23040.1 hydroxyproline-rich glycoprotein family protein9.5e-1239.55Show/hide
Query:  LFLILSINVVT----IHGLISSKKLDEATGGDD-SSVKCTP-CTRYPPPPPPPPPPKKPP----------------STYCPPPPSPPSSFIYVLGPPGNL
        +F ++ I ++T    I+G  SS    EA   D+   +KC+P C + PPPP PPPP   PP                S+YCPPP  PP++F+Y+ GPPGNL
Subjt:  LFLILSINVVT----IHGLISSKKLDEATGGDD-SSVKCTP-CTRYPPPPPPPPPPKKPP----------------STYCPPPPSPPSSFIYVLGPPGNL

Query:  YPIDQDFAGARRK--MAVELPVVALFGMIGFIAL
        YP+D+ F  A  K  M V+L  +  FG++ F+ L
Subjt:  YPIDQDFAGARRK--MAVELPVVALFGMIGFIAL

AT1G70990.1 proline-rich family protein1.6e-1135.04Show/hide
Query:  QRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPP------------PSPPSSFIYVLGPP
        +R    A  +L  +  +    ++      +++KL+E        +KCTPC +  PPP PPPP   PPS  CPPP            P PPS++IY+ GPP
Subjt:  QRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPP------------PSPPSSFIYVLGPP

Query:  GNLYPIDQDFAGARRK--MAVELPVVALFGMIGFIAL
        G LYPIDQ F  A  K    V++  +  FG++ F+ +
Subjt:  GNLYPIDQDFAGARRK--MAVELPVVALFGMIGFIAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCATCCAAAGGCGCCCTTGTTATGCTCTAGCAATCCTCATCAACTTGTTCTTGATTCTTTCCATCAATGTGGTAACCATCCATGGCTTGATCTCTTCCAAGAAGCT
TGACGAGGCGACCGGTGGCGACGATTCGAGCGTCAAGTGTACGCCCTGCACCCGTTATCCACCACCGCCACCTCCGCCGCCTCCACCAAAGAAACCGCCATCGACGTACT
GCCCTCCGCCCCCGTCTCCTCCGTCGTCTTTCATTTACGTTCTCGGCCCGCCGGGAAACTTGTATCCCATTGACCAAGACTTCGCCGGTGCCCGGAGGAAGATGGCCGTG
GAGTTGCCGGTGGTTGCTCTCTTTGGAATGATTGGTTTCATTGCTTTGTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCATCCAAAGGCGCCCTTGTTATGCTCTAGCAATCCTCATCAACTTGTTCTTGATTCTTTCCATCAATGTGGTAACCATCCATGGCTTGATCTCTTCCAAGAAGCT
TGACGAGGCGACCGGTGGCGACGATTCGAGCGTCAAGTGTACGCCCTGCACCCGTTATCCACCACCGCCACCTCCGCCGCCTCCACCAAAGAAACCGCCATCGACGTACT
GCCCTCCGCCCCCGTCTCCTCCGTCGTCTTTCATTTACGTTCTCGGCCCGCCGGGAAACTTGTATCCCATTGACCAAGACTTCGCCGGTGCCCGGAGGAAGATGGCCGTG
GAGTTGCCGGTGGTTGCTCTCTTTGGAATGATTGGTTTCATTGCTTTGTGGTGA
Protein sequenceShow/hide protein sequence
MFIQRRPCYALAILINLFLILSINVVTIHGLISSKKLDEATGGDDSSVKCTPCTRYPPPPPPPPPPKKPPSTYCPPPPSPPSSFIYVLGPPGNLYPIDQDFAGARRKMAV
ELPVVALFGMIGFIALW