; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020866 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020866
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionglycine-rich protein A3
Genome locationchr7:2694165..2695404
RNA-Seq ExpressionLag0020866
SyntenyLag0020866
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582545.1 hypothetical protein SDJN03_22547, partial [Cucurbita argyrosperma subsp. sororia]1.7e-5777.65Show/hide
Query:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTA-----------YPYHGGYPPAGYP
        MGGGKE +S DKGLFS MA+FAAG HY HPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPA YPP GG+P  A           YP HG YPPAGYP
Subjt:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTA-----------YPYHGGYPPAGYP

Query:  GPHHYHGHGHGHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFGKRWKHG-GKFKKWK
         PHHY GHG    GMG +LAGGAAAAAAAYGAHHLA H RP+G FGHGKFKHGKFGKRWKHG GKFKKWK
Subjt:  GPHHYHGHGHGHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFGKRWKHG-GKFKKWK

XP_008438206.1 PREDICTED: glycine-rich protein A3 [Cucumis melo]9.9e-6182.72Show/hide
Query:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGPHHYHGHGHG
        MGGGKEN+  DKGLFS+MAAFAAGHHYPH HGY PPPPY GAGYPPPGGYPPA      GYPPAGYPP+GGHPHTAYPY GGYPPAGYPGPHHY G+GHG
Subjt:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGPHHYHGHGHG

Query:  H-HGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFGKRWKHGGK---FKKWK
        H HG+G +LAGGAAAAAAAYGAHHLA H RP+G FGHGKFKHGKFGKRWKHGGK   FKKWK
Subjt:  H-HGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFGKRWKHGGK---FKKWK

XP_022147022.1 glycine-rich protein A3 [Momordica charantia]2.0e-6181.6Show/hide
Query:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGY-PPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGPHHYHGHGH
        MGGGKE DS+++GLFSH+ AFAAGHHYPH HGY PPPPPY GAGYPPPGGYPPAGYPPPGGYPPAGY PHGGHP +AYPY GGYPPAGYPGPHH+ GHG 
Subjt:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGY-PPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGPHHYHGHGH

Query:  GHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGF----FGHGKFKHGKFGKRWKHGGKFKKWK
          HGMG MLAGGAAAAAAAYGAHHL  H RP+GF    F HGKFKHGKFGKRWKHGGKF K+K
Subjt:  GHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGF----FGHGKFKHGKFGKRWKHGGKFKKWK

XP_022924320.1 glycine-rich protein A3-like [Cucurbita moschata]1.7e-5777.65Show/hide
Query:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTA-----------YPYHGGYPPAGYP
        MGGGKE +S DKGLFS MA+FAAG HY HPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPA YPP GG+P  A           YP HG YPPAGYP
Subjt:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTA-----------YPYHGGYPPAGYP

Query:  GPHHYHGHGHGHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFGKRWKHG-GKFKKWK
         PHHY GHG    GMG +LAGGAAAAAAAYGAHHLA H RP+G FGHGKFKHGKFGKRWKHG GKFKKWK
Subjt:  GPHHYHGHGHGHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFGKRWKHG-GKFKKWK

XP_022980434.1 glycine-rich protein A3-like [Cucurbita maxima]1.7e-5777.65Show/hide
Query:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTA-----------YPYHGGYPPAGYP
        MGGGKE +S DKGLFS MA+FAAG HY HPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPA YPP GG+P  A           YP HG YPPAGYP
Subjt:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTA-----------YPYHGGYPPAGYP

Query:  GPHHYHGHGHGHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFGKRWKH-GGKFKKWK
         PHHY GHG    GMG +LAGGAAAAAAAYGAHHLA H RP+G FGHGKFKHGKFGKRWKH GGKFKKWK
Subjt:  GPHHYHGHGHGHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFGKRWKH-GGKFKKWK

TrEMBL top hitse value%identityAlignment
A0A0A0L4F2 Uncharacterized protein2.7e-5677.11Show/hide
Query:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGPHHYHGHGHG
        MGGGKEN+  DKGLFS+MAAFAAGHHYPH HGY PPPPY GA YPPPGGYPP       GYPP GYPP+GGHPHTAYP  GGYPPAGYPGPHHY G+GHG
Subjt:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGPHHYHGHGHG

Query:  H---HGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGF----FGHGKFKHGKFGKRWKHGG-KFKKWK
        +   HG+G +LAGGAAAAAAAYGAHHLA H RP+GF    F HGKFKHGKFGKRWKHGG +FKKWK
Subjt:  H---HGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGF----FGHGKFKHGKFGKRWKHGG-KFKKWK

A0A1S3AWG9 glycine-rich protein A34.8e-6182.72Show/hide
Query:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGPHHYHGHGHG
        MGGGKEN+  DKGLFS+MAAFAAGHHYPH HGY PPPPY GAGYPPPGGYPPA      GYPPAGYPP+GGHPHTAYPY GGYPPAGYPGPHHY G+GHG
Subjt:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGPHHYHGHGHG

Query:  H-HGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFGKRWKHGGK---FKKWK
        H HG+G +LAGGAAAAAAAYGAHHLA H RP+G FGHGKFKHGKFGKRWKHGGK   FKKWK
Subjt:  H-HGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFGKRWKHGGK---FKKWK

A0A6J1D180 glycine-rich protein A39.6e-6281.6Show/hide
Query:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGY-PPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGPHHYHGHGH
        MGGGKE DS+++GLFSH+ AFAAGHHYPH HGY PPPPPY GAGYPPPGGYPPAGYPPPGGYPPAGY PHGGHP +AYPY GGYPPAGYPGPHH+ GHG 
Subjt:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGY-PPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGPHHYHGHGH

Query:  GHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGF----FGHGKFKHGKFGKRWKHGGKFKKWK
          HGMG MLAGGAAAAAAAYGAHHL  H RP+GF    F HGKFKHGKFGKRWKHGGKF K+K
Subjt:  GHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGF----FGHGKFKHGKFGKRWKHGGKFKKWK

A0A6J1E983 glycine-rich protein A3-like8.4e-5877.65Show/hide
Query:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTA-----------YPYHGGYPPAGYP
        MGGGKE +S DKGLFS MA+FAAG HY HPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPA YPP GG+P  A           YP HG YPPAGYP
Subjt:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTA-----------YPYHGGYPPAGYP

Query:  GPHHYHGHGHGHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFGKRWKHG-GKFKKWK
         PHHY GHG    GMG +LAGGAAAAAAAYGAHHLA H RP+G FGHGKFKHGKFGKRWKHG GKFKKWK
Subjt:  GPHHYHGHGHGHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFGKRWKHG-GKFKKWK

A0A6J1IZ97 glycine-rich protein A3-like8.4e-5877.65Show/hide
Query:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTA-----------YPYHGGYPPAGYP
        MGGGKE +S DKGLFS MA+FAAG HY HPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPA YPP GG+P  A           YP HG YPPAGYP
Subjt:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTA-----------YPYHGGYPPAGYP

Query:  GPHHYHGHGHGHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFGKRWKH-GGKFKKWK
         PHHY GHG    GMG +LAGGAAAAAAAYGAHHLA H RP+G FGHGKFKHGKFGKRWKH GGKFKKWK
Subjt:  GPHHYHGHGHGHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFGKRWKH-GGKFKKWK

SwissProt top hitse value%identityAlignment
O16005 Rhodopsin2.7e-0558.06Show/hide
Query:  YPPPPPYAGAGYPPPGGYPPAGYPPP---GGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGP
        YPP        YPP GGYPP GYPPP   GGYPP GYPP    P   YP   GYPP GYP P
Subjt:  YPPPPPYAGAGYPPPGGYPPAGYPPP---GGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGP

P24639 Annexin A74.0e-0456.06Show/hide
Query:  YPHPHGYPPPPPY-AGAGYPPPGGYPP-AGYPPPGGYPP-AGYPPHGGHPHTAYPYHGGYPPAGYP
        YP   GYPP   Y    GYPP  GYPP  GYPP  GYPP  GYPP  G+P   YP   GYPP G P
Subjt:  YPHPHGYPPPPPY-AGAGYPPPGGYPP-AGYPPPGGYPP-AGYPPHGGHPHTAYPYHGGYPPAGYP

P37705 Glycine-rich protein A39.7e-1951.63Show/hide
Query:  MGGGKENDSNDKGLFSHMA-AFAAGHHYPHPHGYPPPP-PYAGAGYPPP-GGYPPAGYPPP-GGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGPHHYHG
        MGGG  ++  DKGLFS++A   A G HYP P  YPP    Y   GYPP  GGYPP GYPP  GGYPP GYPP G          GGYPP GYP   H+ G
Subjt:  MGGGKENDSNDKGLFSHMA-AFAAGHHYPHPHGYPPPP-PYAGAGYPPP-GGYPPAGYPPP-GGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGPHHYHG

Query:  ----HGHGHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFG
            H  GH G+  M+AGG AAAAAAYG HH+      +G  GHG + HG  G
Subjt:  ----HGHGHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFG

Arabidopsis top hitse value%identityAlignment
AT1G31750.1 proline-rich family protein1.8e-2051.19Show/hide
Query:  GGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPP---PPPYAGAGYPPPGGYPPAGY-PPPGGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGPHHYHGHG
        G  ++   DK  FS        HH  H HGYPP   PPP  GA YPPPGGYPP GY PPP GYPPA YPP            G YPPAGYPGP       
Subjt:  GGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPP---PPPYAGAGYPPPGGYPPAGY-PPPGGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGPHHYHGHG

Query:  HGHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFG----KRWKH----GGKFKKWK
         G  G+G ++AG A AAAAA G HH  HHG  YG  GHGK+K G FG    KR KH    GGK+K+ K
Subjt:  HGHHGMGPMLAGGAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFG----KRWKH----GGKFKKWK

AT4G19200.1 proline-rich family protein1.5e-2247.15Show/hide
Query:  MGGGKE--NDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYP-PHGGHPHTAYPYHGGYPPAGYPGPHHYHGH
        MGGGK+  +D  +KG       F  G HYP   G  PP      GYPP  GYPPAG  PP GYPP  YP   GG+P    P  GGYPPAGYP P  +H  
Subjt:  MGGGKE--NDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYP-PHGGHPHTAYPYHGGYPPAGYPGPHHYHGH

Query:  GHGHHGMGPMLAGGAAAAAAAYGAHHLAH-----------HG-------RPYGFFGHGKFKHGKFGKRWKH--------------GGKFKKWK
        GH   G+G M+AG A AAAAAYGAHH+ H           HG         +G  GHGKFKHGK G ++KH              GGKFKKWK
Subjt:  GHGHHGMGPMLAGGAAAAAAAYGAHHLAH-----------HG-------RPYGFFGHGKFKHGKFGKRWKH--------------GGKFKKWK

AT5G17650.1 glycine/proline-rich protein1.3e-2650Show/hide
Query:  GGKENDSNDKGLFSHMAAFAAGHHYPHPHG-----------YPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGP
        G  +++ +D+G F ++A FA G + PH HG           YP PPP      PPP GYPP  YPP GGYPPAGYPP  G+P   YP H GYP  GYP P
Subjt:  GGKENDSNDKGLFSHMAAFAAGHHYPHPHG-----------YPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGP

Query:  HHYHGHGHGHHGMGPMLAGGAAAAAAAYGAHHLAH----------HGRPYGFFGHGKFKHGKF--GKRWKHG------GK-FKKWK
         H    GH H G+G ++AGG AAAA   GAHH++H          HG  YG+ GHGKFKHGKF  GK  KHG      GK FKKWK
Subjt:  HHYHGHGHGHHGMGPMLAGGAAAAAAAYGAHHLAH----------HGRPYGFFGHGKFKHGKF--GKRWKHG------GK-FKKWK

AT5G45350.1 proline-rich family protein4.6e-2450.75Show/hide
Query:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGY------PPPGGYPPAGYPP------PGGYPPA----GYPP---HGGHPHTAYPYHG
        MGG  +ND  DKG           H YP P GYPPP  Y  AGY      PPPG YPPAGYPP      PGGYPPA    GYPP   +GG+P    P HG
Subjt:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGY------PPPGGYPPAGYPP------PGGYPPA----GYPP---HGGHPHTAYPYHG

Query:  GYPPAGYPGPHHYHGHGHGHHGMGPMLAGGAAAAAAAYGAHHLAH----------------HGRPYGF-FGHGKFKHGKFGKRWKH-------GGKFKKW
        GYPPAGYP   H+ GH     G+G M+AG    AAAAYGAHH+AH                HG  YG+  GHGKFKHGK GK +KH       GGKFKKW
Subjt:  GYPPAGYPGPHHYHGHGHGHHGMGPMLAGGAAAAAAAYGAHHLAH----------------HGRPYGF-FGHGKFKHGKFGKRWKH-------GGKFKKW

Query:  K
        K
Subjt:  K

AT5G45350.2 proline-rich family protein4.6e-2450.75Show/hide
Query:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGY------PPPGGYPPAGYPP------PGGYPPA----GYPP---HGGHPHTAYPYHG
        MGG  +ND  DKG           H YP P GYPPP  Y  AGY      PPPG YPPAGYPP      PGGYPPA    GYPP   +GG+P    P HG
Subjt:  MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGY------PPPGGYPPAGYPP------PGGYPPA----GYPP---HGGHPHTAYPYHG

Query:  GYPPAGYPGPHHYHGHGHGHHGMGPMLAGGAAAAAAAYGAHHLAH----------------HGRPYGF-FGHGKFKHGKFGKRWKH-------GGKFKKW
        GYPPAGYP   H+ GH     G+G M+AG    AAAAYGAHH+AH                HG  YG+  GHGKFKHGK GK +KH       GGKFKKW
Subjt:  GYPPAGYPGPHHYHGHGHGHHGMGPMLAGGAAAAAAAYGAHHLAH----------------HGRPYGF-FGHGKFKHGKFGKRWKH-------GGKFKKW

Query:  K
        K
Subjt:  K


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGCGGAAAGGAAAATGATAGTAATGACAAAGGCCTGTTTTCACATATGGCGGCGTTTGCTGCAGGGCACCACTATCCTCATCCTCATGGATATCCACCACCACC
GCCATACGCCGGAGCCGGATATCCCCCTCCGGGAGGGTACCCCCCAGCTGGGTATCCCCCTCCCGGCGGATATCCTCCGGCTGGCTATCCTCCTCACGGTGGACACCCAC
ATACAGCCTATCCTTATCATGGCGGATACCCTCCCGCTGGCTATCCTGGCCCCCATCATTACCATGGCCATGGACACGGACACCACGGCATGGGGCCAATGTTGGCTGGT
GGAGCCGCTGCTGCAGCCGCTGCTTACGGTGCTCATCATCTAGCTCATCATGGACGCCCATATGGCTTCTTTGGTCACGGAAAATTCAAACATGGGAAATTTGGCAAGCG
TTGGAAGCATGGAGGCAAGTTCAAGAAATGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGCGGAAAGGAAAATGATAGTAATGACAAAGGCCTGTTTTCACATATGGCGGCGTTTGCTGCAGGGCACCACTATCCTCATCCTCATGGATATCCACCACCACC
GCCATACGCCGGAGCCGGATATCCCCCTCCGGGAGGGTACCCCCCAGCTGGGTATCCCCCTCCCGGCGGATATCCTCCGGCTGGCTATCCTCCTCACGGTGGACACCCAC
ATACAGCCTATCCTTATCATGGCGGATACCCTCCCGCTGGCTATCCTGGCCCCCATCATTACCATGGCCATGGACACGGACACCACGGCATGGGGCCAATGTTGGCTGGT
GGAGCCGCTGCTGCAGCCGCTGCTTACGGTGCTCATCATCTAGCTCATCATGGACGCCCATATGGCTTCTTTGGTCACGGAAAATTCAAACATGGGAAATTTGGCAAGCG
TTGGAAGCATGGAGGCAAGTTCAAGAAATGGAAGTGA
Protein sequenceShow/hide protein sequence
MGGGKENDSNDKGLFSHMAAFAAGHHYPHPHGYPPPPPYAGAGYPPPGGYPPAGYPPPGGYPPAGYPPHGGHPHTAYPYHGGYPPAGYPGPHHYHGHGHGHHGMGPMLAG
GAAAAAAAYGAHHLAHHGRPYGFFGHGKFKHGKFGKRWKHGGKFKKWK