; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026727 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026727
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionproteoglycan 4
Genome locationchr10:41079646..41085471
RNA-Seq ExpressionLag0026727
SyntenyLag0026727
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573906.1 Inactive protein RESTRICTED TEV MOVEMENT 2, partial [Cucurbita argyrosperma subsp. sororia]1.5e-1247.01Show/hide
Query:  GDPHPPCRRPSSVVPSRLLLAPHPPSILLLHEDCTISQKFKPRPCSCQRRLRKTKAEEEAPTPAPAVVPPPV---KSPAKGGSGEVKTTPDEKISSPNQK
        G+  PP + P    P +      P     +       ++ K +    Q++    KAEEEAPTPAP  VPPP      P +G SG+ KTTPDEKI++PNQK
Subjt:  GDPHPPCRRPSSVVPSRLLLAPHPPSILLLHEDCTISQKFKPRPCSCQRRLRKTKAEEEAPTPAPAVVPPPV---KSPAKGGSGEVKTTPDEKISSPNQK

Query:  LTEKKEIENQNGEKGKESKTEEVGKNREESKIDT
         TEK   ENQN EKGKESKTEEVGKN +  KI T
Subjt:  LTEKKEIENQNGEKGKESKTEEVGKNREESKIDT

KAG7012971.1 Inactive protein RESTRICTED TEV MOVEMENT 2, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-1246.15Show/hide
Query:  GDPHPPCRRPSSVVPSRLLLAPHPPSILLLHEDCTISQKFKPRPCSCQRRLRKTKAEEEAPTPAPAVVPPPV---KSPAKGGSGEVKTTPDEKISSPNQK
        G+  PP + P    P +      P     +       ++ K +    Q++    KAEEEAPTPAP  VPPP      P +G SG+ KTTPDEKI++PNQK
Subjt:  GDPHPPCRRPSSVVPSRLLLAPHPPSILLLHEDCTISQKFKPRPCSCQRRLRKTKAEEEAPTPAPAVVPPPV---KSPAKGGSGEVKTTPDEKISSPNQK

Query:  LTEKKEIENQNGEKGKESKTEEVGKNREESKI--DTENRSATT
         TEK   ENQN EKGKESKTEEVGKN +  KI   T ++ ATT
Subjt:  LTEKKEIENQNGEKGKESKTEEVGKNREESKI--DTENRSATT

XP_022945513.1 proteoglycan 4 isoform X1 [Cucurbita moschata]1.3e-1162.92Show/hide
Query:  KAEEEAPTPAPAVVPPPV---KSPAKGGSGEVKTTPDEKISSPNQKLTEKKEIENQNGEKGKESKTEEVGKNREESKI--DTENRSATT
        KA EEAPTPAP  VPPP      P +G SG+ KTTPDEKI +PNQK TEK   ENQN EKGKESKTE+VGKN +  KI   T ++ ATT
Subjt:  KAEEEAPTPAPAVVPPPV---KSPAKGGSGEVKTTPDEKISSPNQKLTEKKEIENQNGEKGKESKTEEVGKNREESKI--DTENRSATT

XP_022968176.1 proteoglycan 4 [Cucurbita maxima]7.5e-1253.57Show/hide
Query:  EDCTISQKFKPRPCSCQRRLRKTKAEEEAPTPAPAVVPPPV---KSPAKGGSGEVKTTPDEKISSPNQKLTEKKEIENQNGEKGKESKTEEVGKNREESK
        E+ +  ++ K +    Q++    KAEEEAPT AP  VPPP      P +G SG+ KTTPDEKI +PNQK TEK   ENQN EKGKESKTE+VGKN +  K
Subjt:  EDCTISQKFKPRPCSCQRRLRKTKAEEEAPTPAPAVVPPPV---KSPAKGGSGEVKTTPDEKISSPNQKLTEKKEIENQNGEKGKESKTEEVGKNREESK

Query:  I--DTENRSATT
        I   T +R ATT
Subjt:  I--DTENRSATT

XP_023541033.1 proteoglycan 4 [Cucurbita pepo subsp. pepo]1.3e-1144.76Show/hide
Query:  GDPHPPCRRPSSVVPSRLLLAPHPPSILLLHEDCTISQKFKPRPCSCQRRLRKTKAEEEAPTPAPAVVPPPV---KSPAKGGSGEVKTTPDEKISSPNQK
        G+  PP + P    P +              E+ +  ++ K +    Q++    KAEEEAPTPAP  VPPP      P +G SG+ KT PDEKI++PNQK
Subjt:  GDPHPPCRRPSSVVPSRLLLAPHPPSILLLHEDCTISQKFKPRPCSCQRRLRKTKAEEEAPTPAPAVVPPPV---KSPAKGGSGEVKTTPDEKISSPNQK

Query:  LTEKKEIENQNGEKGKESKTEEVGKNREESKI--DTENRSATT
         TEK    NQN EKGKESKTEEVGKN +  KI   T ++ ATT
Subjt:  LTEKKEIENQNGEKGKESKTEEVGKNREESKI--DTENRSATT

TrEMBL top hitse value%identityAlignment
A0A6J1DAL2 inactive protein RESTRICTED TEV MOVEMENT 23.2e-0860.24Show/hide
Query:  AEEEAPTPAPAVVPPPVKSPAKGGSGEVKTT-PDEKISSPNQKLTEKKEIENQNGEKGKESKTEEVGKNREESKIDTENRSAT
        A+EEAP    A      K PA+G SGE KTT  DEKI SP++K TEK+EIENQN E+GKESKTEEV KN+EE K+ T   S T
Subjt:  AEEEAPTPAPAVVPPPVKSPAKGGSGEVKTT-PDEKISSPNQKLTEKKEIENQNGEKGKESKTEEVGKNREESKIDTENRSAT

A0A6J1G142 proteoglycan 4 isoform X16.2e-1262.92Show/hide
Query:  KAEEEAPTPAPAVVPPPV---KSPAKGGSGEVKTTPDEKISSPNQKLTEKKEIENQNGEKGKESKTEEVGKNREESKI--DTENRSATT
        KA EEAPTPAP  VPPP      P +G SG+ KTTPDEKI +PNQK TEK   ENQN EKGKESKTE+VGKN +  KI   T ++ ATT
Subjt:  KAEEEAPTPAPAVVPPPV---KSPAKGGSGEVKTTPDEKISSPNQKLTEKKEIENQNGEKGKESKTEEVGKNREESKI--DTENRSATT

A0A6J1G164 proteoglycan 4 isoform X26.2e-1262.92Show/hide
Query:  KAEEEAPTPAPAVVPPPV---KSPAKGGSGEVKTTPDEKISSPNQKLTEKKEIENQNGEKGKESKTEEVGKNREESKI--DTENRSATT
        KA EEAPTPAP  VPPP      P +G SG+ KTTPDEKI +PNQK TEK   ENQN EKGKESKTE+VGKN +  KI   T ++ ATT
Subjt:  KAEEEAPTPAPAVVPPPV---KSPAKGGSGEVKTTPDEKISSPNQKLTEKKEIENQNGEKGKESKTEEVGKNREESKI--DTENRSATT

A0A6J1GZF3 uncharacterized protein LOC1114586273.0e-0645.38Show/hide
Query:  PSILLLHEDCTISQKFKPRPCSCQRRLRKTKAEEEAPTPAPAVVPPPVKSPAKGGSGEVKTTPDEKISSPNQKL-TEKKEIENQNGEKGKESKTEEVGKN
        P    L +D + S K K +    Q++    KA EE P  A        +   +G SG+ +TT D KISSP+QK  TEKKEIENQN EKG+ESKTEEV KN
Subjt:  PSILLLHEDCTISQKFKPRPCSCQRRLRKTKAEEEAPTPAPAVVPPPVKSPAKGGSGEVKTTPDEKISSPNQKL-TEKKEIENQNGEKGKESKTEEVGKN

Query:  REESKIDTENRSATTRSEG
           +KID    S  T   G
Subjt:  REESKIDTENRSATTRSEG

A0A6J1HU50 proteoglycan 43.6e-1253.57Show/hide
Query:  EDCTISQKFKPRPCSCQRRLRKTKAEEEAPTPAPAVVPPPV---KSPAKGGSGEVKTTPDEKISSPNQKLTEKKEIENQNGEKGKESKTEEVGKNREESK
        E+ +  ++ K +    Q++    KAEEEAPT AP  VPPP      P +G SG+ KTTPDEKI +PNQK TEK   ENQN EKGKESKTE+VGKN +  K
Subjt:  EDCTISQKFKPRPCSCQRRLRKTKAEEEAPTPAPAVVPPPV---KSPAKGGSGEVKTTPDEKISSPNQKLTEKKEIENQNGEKGKESKTEEVGKNREESK

Query:  I--DTENRSATT
        I   T +R ATT
Subjt:  I--DTENRSATT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAGGCCGAGCACATGGTCGGCCTCGGCCATGGGCCGAGGCCGACCCTCGGCCCGCTCGTGCGGGCCGAGCTCGTTTGGTCCCGTCTGGTCCCCACCGCCTCTGG
ATGCCCCGGTTTCGCCTGGTTTGACCTAAAACGCCTCCGAAACCCTAAAAAGGCTAGGAGGATGAACAGGTATTTATATACCTCTTCGCCACTGAAGAGAGGATCCCGAA
TTCCATCCCTGAACTATATTCTCTATTCTCTGTTTTCTGCTCTTGCTCTTACTTTTCCACGCCCTACCGTTCTGTTTGCTGACTTAAGCATCGGAGCCGGTGTGGCGAGC
ACCACACCGAACCGCCTTCATGCACCAGCCCGTCCGTCCATCTTCATCTTCCACAACCCTCCCTTCCCACCATTGCCCCCCTTGAAAGACTTAATCCTCCATTTCCCACC
ATTTTCCACCCCCCAACACAGAGTGTATCCCAATCCCTACCTCTCGCATCGCGCCGGCGACCCACATCCGCCGTGCCGACGACCTTCTTCCGTCGTGCCGTCGAGACTCT
TGCTTGCCCCACATCCTCCATCGATTCTCCTTTTGCACGAGGACTGTACAATTTCTCAGAAATTCAAACCAAGACCGTGTAGCTGCCAGCGCCGTTTGAGGAAAACGAAG
GCTGAGGAAGAAGCTCCGACGCCGGCGCCGGCGGTGGTGCCACCGCCTGTGAAGAGTCCGGCTAAAGGAGGTTCCGGCGAGGTTAAAACAACACCGGATGAGAAAATAAG
CAGCCCGAATCAGAAACTAACAGAGAAGAAAGAAATTGAAAATCAAAACGGAGAAAAGGGGAAGGAATCTAAAACAGAGGAGGTGGGTAAGAATCGAGAAGAGTCGAAGA
TCGACACCGAAAATCGATCCGCGACTACACGGTCAGAAGGATGTCGTTACCGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGAGGCCGAGCACATGGTCGGCCTCGGCCATGGGCCGAGGCCGACCCTCGGCCCGCTCGTGCGGGCCGAGCTCGTTTGGTCCCGTCTGGTCCCCACCGCCTCTGG
ATGCCCCGGTTTCGCCTGGTTTGACCTAAAACGCCTCCGAAACCCTAAAAAGGCTAGGAGGATGAACAGGTATTTATATACCTCTTCGCCACTGAAGAGAGGATCCCGAA
TTCCATCCCTGAACTATATTCTCTATTCTCTGTTTTCTGCTCTTGCTCTTACTTTTCCACGCCCTACCGTTCTGTTTGCTGACTTAAGCATCGGAGCCGGTGTGGCGAGC
ACCACACCGAACCGCCTTCATGCACCAGCCCGTCCGTCCATCTTCATCTTCCACAACCCTCCCTTCCCACCATTGCCCCCCTTGAAAGACTTAATCCTCCATTTCCCACC
ATTTTCCACCCCCCAACACAGAGTGTATCCCAATCCCTACCTCTCGCATCGCGCCGGCGACCCACATCCGCCGTGCCGACGACCTTCTTCCGTCGTGCCGTCGAGACTCT
TGCTTGCCCCACATCCTCCATCGATTCTCCTTTTGCACGAGGACTGTACAATTTCTCAGAAATTCAAACCAAGACCGTGTAGCTGCCAGCGCCGTTTGAGGAAAACGAAG
GCTGAGGAAGAAGCTCCGACGCCGGCGCCGGCGGTGGTGCCACCGCCTGTGAAGAGTCCGGCTAAAGGAGGTTCCGGCGAGGTTAAAACAACACCGGATGAGAAAATAAG
CAGCCCGAATCAGAAACTAACAGAGAAGAAAGAAATTGAAAATCAAAACGGAGAAAAGGGGAAGGAATCTAAAACAGAGGAGGTGGGTAAGAATCGAGAAGAGTCGAAGA
TCGACACCGAAAATCGATCCGCGACTACACGGTCAGAAGGATGTCGTTACCGGTGA
Protein sequenceShow/hide protein sequence
MAEAEHMVGLGHGPRPTLGPLVRAELVWSRLVPTASGCPGFAWFDLKRLRNPKKARRMNRYLYTSSPLKRGSRIPSLNYILYSLFSALALTFPRPTVLFADLSIGAGVAS
TTPNRLHAPARPSIFIFHNPPFPPLPPLKDLILHFPPFSTPQHRVYPNPYLSHRAGDPHPPCRRPSSVVPSRLLLAPHPPSILLLHEDCTISQKFKPRPCSCQRRLRKTK
AEEEAPTPAPAVVPPPVKSPAKGGSGEVKTTPDEKISSPNQKLTEKKEIENQNGEKGKESKTEEVGKNREESKIDTENRSATTRSEGCRYR