; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002824 (gene) of Snake gourd v1 genome

Gene IDTan0002824
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionglycine-rich protein A3
Genome locationLG10:3244975..3246195
RNA-Seq ExpressionTan0002824
SyntenyTan0002824
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582545.1 hypothetical protein SDJN03_22547, partial [Cucurbita argyrosperma subsp. sororia]1.6e-5272.35Show/hide
Query:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPP----YGYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGH--------
        MGGGK++++ DKG+FS MA+FAAG  H++HPHGYPPPP     GYPPPGGYPPAGYPPP GYPPA YPP GG+P  AYP  GGYPP+GYP H        
Subjt:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPP----YGYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGH--------

Query:  ---HHYPGHGH---GMGGLLAGGAAAAAAAYGAHHLVHARPFGYGHHGKFKHGKFGKRWKHHGGKFKKWK
           HHYPGHG    GMGGLLAGGAAAAAAAYGAHHL HARPFG+G HGKFKHGKFGKRWKH  GKFKKWK
Subjt:  ---HHYPGHGH---GMGGLLAGGAAAAAAAYGAHHLVHARPFGYGHHGKFKHGKFGKRWKHHGGKFKKWK

XP_004133933.1 glycine-rich protein A3 [Cucumis sativus]1.2e-5273.33Show/hide
Query:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPPYG---YPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGHHHYP-----
        MGGGK+ +  DKG+FS+MAAFAAG HH+ H HGYPPPPYG   YPPPGGYPP GYPP        YPP+GGHPHTAYP  GGYPP+GYPG HHYP     
Subjt:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPPYG---YPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGHHHYP-----

Query:  -GHGHGMGGLLAGGAAAAAAAYGAHHLVHARPFGYGH----HGKFKHGKFGKRWKHHGGKFKKWK
         GHGHG+GGLLAGGAAAAAAAYGAHHL HARPFG+GH    HGKFKHGKFGKRWKH G +FKKWK
Subjt:  -GHGHGMGGLLAGGAAAAAAAYGAHHLVHARPFGYGH----HGKFKHGKFGKRWKHHGGKFKKWK

XP_008438206.1 PREDICTED: glycine-rich protein A3 [Cucumis melo]7.8e-5577.02Show/hide
Query:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPPY---GYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGHHHYP----G
        MGGGK+ +  DKG+FS+MAAFAAG HH+ H HGYPPPPY   GYPPPGGY      PPAGYPPA YPP+GGHPHTAYPY GGYPP+GYPG HHYP    G
Subjt:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPPY---GYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGHHHYP----G

Query:  HGHGMGGLLAGGAAAAAAAYGAHHLVHARPFGYGHHGKFKHGKFGKRWKHHG--GKFKKWK
        HGHG+GGLLAGGAAAAAAAYGAHHL HARPFG+G HGKFKHGKFGKRWKH G  G+FKKWK
Subjt:  HGHGMGGLLAGGAAAAAAAYGAHHLVHARPFGYGHHGKFKHGKFGKRWKHHG--GKFKKWK

XP_022147022.1 glycine-rich protein A3 [Momordica charantia]5.8e-5876.69Show/hide
Query:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPP-----YGYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGHHHYPGHG
        MGGGK+KD+D++G+FSH+ AFAAG HH+ H HGYPPPP      GYPPPGGYPPAGYPPP GYPPA Y PHGGHP +AYPY GGYPP+GYPG HH+PGHG
Subjt:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPP-----YGYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGHHHYPGHG

Query:  HGMGGLLAGGAAAAAAAYGAHHLVHARPFGYGH----HGKFKHGKFGKRWKHHGG--KFKKWK
        HGMG +LAGGAAAAAAAYGAHHLVHARPFG+GH    HGKFKHGKFGKRWKH G   KFK+WK
Subjt:  HGMGGLLAGGAAAAAAAYGAHHLVHARPFGYGH----HGKFKHGKFGKRWKHHGG--KFKKWK

XP_022980434.1 glycine-rich protein A3-like [Cucurbita maxima]1.5e-5372.94Show/hide
Query:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPP----YGYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGH--------
        MGGGK++++ DKG+FS MA+FAAG  H++HPHGYPPPP     GYPPPGGYPPAGYPPP GYPPA YPP GG+P  AYP  GGYPP+GYP H        
Subjt:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPP----YGYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGH--------

Query:  ---HHYPGHGH---GMGGLLAGGAAAAAAAYGAHHLVHARPFGYGHHGKFKHGKFGKRWKHHGGKFKKWK
           HHYPGHG    GMGGLLAGGAAAAAAAYGAHHL HARPFG+G HGKFKHGKFGKRWKH GGKFKKWK
Subjt:  ---HHYPGHGH---GMGGLLAGGAAAAAAAYGAHHLVHARPFGYGHHGKFKHGKFGKRWKHHGGKFKKWK

TrEMBL top hitse value%identityAlignment
A0A0A0L4F2 Uncharacterized protein6.0e-5373.33Show/hide
Query:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPPYG---YPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGHHHYP-----
        MGGGK+ +  DKG+FS+MAAFAAG HH+ H HGYPPPPYG   YPPPGGYPP GYPP        YPP+GGHPHTAYP  GGYPP+GYPG HHYP     
Subjt:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPPYG---YPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGHHHYP-----

Query:  -GHGHGMGGLLAGGAAAAAAAYGAHHLVHARPFGYGH----HGKFKHGKFGKRWKHHGGKFKKWK
         GHGHG+GGLLAGGAAAAAAAYGAHHL HARPFG+GH    HGKFKHGKFGKRWKH G +FKKWK
Subjt:  -GHGHGMGGLLAGGAAAAAAAYGAHHLVHARPFGYGH----HGKFKHGKFGKRWKHHGGKFKKWK

A0A1S3AWG9 glycine-rich protein A33.8e-5577.02Show/hide
Query:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPPY---GYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGHHHYP----G
        MGGGK+ +  DKG+FS+MAAFAAG HH+ H HGYPPPPY   GYPPPGGY      PPAGYPPA YPP+GGHPHTAYPY GGYPP+GYPG HHYP    G
Subjt:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPPY---GYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGHHHYP----G

Query:  HGHGMGGLLAGGAAAAAAAYGAHHLVHARPFGYGHHGKFKHGKFGKRWKHHG--GKFKKWK
        HGHG+GGLLAGGAAAAAAAYGAHHL HARPFG+G HGKFKHGKFGKRWKH G  G+FKKWK
Subjt:  HGHGMGGLLAGGAAAAAAAYGAHHLVHARPFGYGHHGKFKHGKFGKRWKHHG--GKFKKWK

A0A6J1D180 glycine-rich protein A32.8e-5876.69Show/hide
Query:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPP-----YGYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGHHHYPGHG
        MGGGK+KD+D++G+FSH+ AFAAG HH+ H HGYPPPP      GYPPPGGYPPAGYPPP GYPPA Y PHGGHP +AYPY GGYPP+GYPG HH+PGHG
Subjt:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPP-----YGYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGHHHYPGHG

Query:  HGMGGLLAGGAAAAAAAYGAHHLVHARPFGYGH----HGKFKHGKFGKRWKHHGG--KFKKWK
        HGMG +LAGGAAAAAAAYGAHHLVHARPFG+GH    HGKFKHGKFGKRWKH G   KFK+WK
Subjt:  HGMGGLLAGGAAAAAAAYGAHHLVHARPFGYGH----HGKFKHGKFGKRWKHHGG--KFKKWK

A0A6J1E983 glycine-rich protein A3-like2.3e-5271.76Show/hide
Query:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPP----YGYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTA-----------YPYHGGYPPSGY
        MGGGK++++ DKG+FS MA+FAAG  H++HPHGYPPPP     GYPPPGGYPPAGYPPP GYPPA YPP GG+P  A           YP HG YPP+GY
Subjt:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPP----YGYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTA-----------YPYHGGYPPSGY

Query:  PGHHHYPGHGH---GMGGLLAGGAAAAAAAYGAHHLVHARPFGYGHHGKFKHGKFGKRWKHHGGKFKKWK
        P  HHYPGHG    GMGGLLAGGAAAAAAAYGAHHL HARPFG+G HGKFKHGKFGKRWKH  GKFKKWK
Subjt:  PGHHHYPGHGH---GMGGLLAGGAAAAAAAYGAHHLVHARPFGYGHHGKFKHGKFGKRWKHHGGKFKKWK

A0A6J1IZ97 glycine-rich protein A3-like7.1e-5472.94Show/hide
Query:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPP----YGYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGH--------
        MGGGK++++ DKG+FS MA+FAAG  H++HPHGYPPPP     GYPPPGGYPPAGYPPP GYPPA YPP GG+P  AYP  GGYPP+GYP H        
Subjt:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPP----YGYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGH--------

Query:  ---HHYPGHGH---GMGGLLAGGAAAAAAAYGAHHLVHARPFGYGHHGKFKHGKFGKRWKHHGGKFKKWK
           HHYPGHG    GMGGLLAGGAAAAAAAYGAHHL HARPFG+G HGKFKHGKFGKRWKH GGKFKKWK
Subjt:  ---HHYPGHGH---GMGGLLAGGAAAAAAAYGAHHLVHARPFGYGHHGKFKHGKFGKRWKHHGGKFKKWK

SwissProt top hitse value%identityAlignment
P37705 Glycine-rich protein A33.2e-1949.67Show/hide
Query:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPP-----PPYGYPPP-GGYPPAGYPPP-AGYPPACYPPHGGHPHTAYPYHGGYPPSGYP--GHH--
        MGGG   +  DKG+FS++A   AG  H+  P  YPP     PP GYPP  GGYPP GYPP   GYPP  YPP G          GGYPP GYP  GHH  
Subjt:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPP-----PPYGYPPP-GGYPPAGYPPP-AGYPPACYPPHGGHPHTAYPYHGGYPPSGYP--GHH--

Query:  ----HYPGHGHGMGGLLAGGAAAAAAAYGAHHLVHARPFGYGHHGKFKHGKFG
            H+ GHG G+ G++AGG AAAAAAYG HH+       +G HG + HG  G
Subjt:  ----HYPGHGHGMGGLLAGGAAAAAAAYGAHHLVHARPFGYGHHGKFKHGKFG

Arabidopsis top hitse value%identityAlignment
AT1G31750.1 proline-rich family protein2.1e-2150.92Show/hide
Query:  GGKDKDTDDKGMFSHMAAFAAGAHHHAH---PHGYPPPPYG-YPPPGGYPPAGYPPPA-GYPPACYPPHGGHPHTAYPYHGGYPPSGYPG-HHHYPGHGH
        G  D    DK  FSH        +HH H   P  YPPPP G YPPPGGYPP GYPPP  GYPPA YPP            G YPP+GYPG     PG G 
Subjt:  GGKDKDTDDKGMFSHMAAFAAGAHHHAH---PHGYPPPPYG-YPPPGGYPPAGYPPPA-GYPPACYPPHGGHPHTAYPYHGGYPPSGYPG-HHHYPGHGH

Query:  GMGGLLAGGAAAAAAAYGAHHLVHARPFGYGHHGKFKHGKFG----KRWKHH---GGKFKKWK
        G+GGL+AG A AAAAA G HH  H   +G+  HGK+K G FG    KR KH    GGK+K+ K
Subjt:  GMGGLLAGGAAAAAAAYGAHHLVHARPFGYGHHGKFKHGKFG----KRWKHH---GGKFKKWK

AT4G19200.1 proline-rich family protein2.4e-2548.66Show/hide
Query:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPPYGYPPPGGYPPAGYPPPAGYPPACYP-PHGGHPHTAYPYHGGYPPSGY--PGHHHYPGHGHG
        MGGGKDK  D++    H   F  G H+     GY  PP GYPP  GYPPAG  PPAGYPP  YP   GG+P    P  GGYPP+GY  PG HH    G G
Subjt:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPPYGYPPPGGYPPAGYPPPAGYPPACYP-PHGGHPHTAYPYHGGYPPSGY--PGHHHYPGHGHG

Query:  MGGLLAGGAAAAAAAYGAHHLVH-------------------ARPFGYGHHGKFKHGKFGKRWKH-------------HGGKFKKWK
        +GG++AG A AAAAAYGAHH+ H                   A  FG+G HGKFKHGK G ++KH              GGKFKKWK
Subjt:  MGGLLAGGAAAAAAAYGAHHLVH-------------------ARPFGYGHHGKFKHGKFGKRWKH-------------HGGKFKKWK

AT5G17650.1 glycine/proline-rich protein2.1e-2145.86Show/hide
Query:  GGKDKDTDDKGMFSHMAAFAAGAH--------HHAHPHG----YPPPPYGYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGHHH
        G    +  D+G F ++A FA G +        HH H +G    YPPP    PPP GYPP  YPP  GYPPA YPP  G+P   YP H GYP  GYP   H
Subjt:  GGKDKDTDDKGMFSHMAAFAAGAH--------HHAHPHG----YPPPPYGYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGHHH

Query:  YPGHGHGMGGLLAGGAAAAAAAYGAHHLV---------HARPFGYGH-------HGKFKHGKFGKR---WKHHGGKFKKWK
           H  G+G ++AGG AAAA   GAHH+          H   +GYG+       HGKFKHGKFGK     KH G  FKKWK
Subjt:  YPGHGHGMGGLLAGGAAAAAAAYGAHHLV---------HARPFGYGH-------HGKFKHGKFGKR---WKHHGGKFKKWK

AT5G45350.1 proline-rich family protein8.4e-2345.1Show/hide
Query:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPPYGYPPPGGYPPAGYP-----------PPAGYPPACYPP----------HGGHP-------HT
        MGG  D D  DKG                  HGYPP   GYPPPG YPPAGYP           PPAGYPP  YPP          +GG+P       + 
Subjt:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPPYGYPPPGGYPPAGYP-----------PPAGYPPACYPP----------HGGHP-------HT

Query:  AYPYHGGYPPSGYPGHHHYPGHGHGMGGLLAGGAAAAAAAYGAHHLVHA-----------------RPFGYGH-HGKFKHGKFGKRWKH------HGGKF
          P HGGYPP+GYP HH   GH  G+GG++AG    AAAAYGAHH+ H+                   +GYGH HGKFKHGK GK +KH       GGKF
Subjt:  AYPYHGGYPPSGYPGHHHYPGHGHGMGGLLAGGAAAAAAAYGAHHLVHA-----------------RPFGYGH-HGKFKHGKFGKRWKH------HGGKF

Query:  KKWK
        KKWK
Subjt:  KKWK

AT5G45350.2 proline-rich family protein8.4e-2345.1Show/hide
Query:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPPYGYPPPGGYPPAGYP-----------PPAGYPPACYPP----------HGGHP-------HT
        MGG  D D  DKG                  HGYPP   GYPPPG YPPAGYP           PPAGYPP  YPP          +GG+P       + 
Subjt:  MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPPYGYPPPGGYPPAGYP-----------PPAGYPPACYPP----------HGGHP-------HT

Query:  AYPYHGGYPPSGYPGHHHYPGHGHGMGGLLAGGAAAAAAAYGAHHLVHA-----------------RPFGYGH-HGKFKHGKFGKRWKH------HGGKF
          P HGGYPP+GYP HH   GH  G+GG++AG    AAAAYGAHH+ H+                   +GYGH HGKFKHGK GK +KH       GGKF
Subjt:  AYPYHGGYPPSGYPGHHHYPGHGHGMGGLLAGGAAAAAAAYGAHHLVHA-----------------RPFGYGH-HGKFKHGKFGKRWKH------HGGKF

Query:  KKWK
        KKWK
Subjt:  KKWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGTGGGAAGGACAAGGACACTGATGACAAAGGCATGTTTTCACATATGGCGGCGTTTGCTGCAGGGGCGCACCACCATGCTCATCCTCATGGATATCCACCGCC
GCCATATGGATATCCCCCTCCGGGAGGGTACCCTCCGGCTGGATATCCCCCACCCGCCGGGTATCCTCCGGCTTGCTATCCTCCTCACGGTGGACACCCTCATACAGCCT
ATCCATATCATGGCGGATACCCTCCCTCAGGCTATCCTGGCCACCATCATTACCCTGGCCATGGACACGGCATGGGGGGATTGTTGGCTGGTGGAGCAGCTGCTGCAGCC
GCTGCATACGGTGCTCATCATCTTGTTCATGCACGCCCATTTGGCTATGGTCATCACGGAAAGTTCAAACATGGGAAATTTGGCAAGCGTTGGAAGCATCATGGAGGCAA
GTTCAAGAAATGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGTGGGAAGGACAAGGACACTGATGACAAAGGCATGTTTTCACATATGGCGGCGTTTGCTGCAGGGGCGCACCACCATGCTCATCCTCATGGATATCCACCGCC
GCCATATGGATATCCCCCTCCGGGAGGGTACCCTCCGGCTGGATATCCCCCACCCGCCGGGTATCCTCCGGCTTGCTATCCTCCTCACGGTGGACACCCTCATACAGCCT
ATCCATATCATGGCGGATACCCTCCCTCAGGCTATCCTGGCCACCATCATTACCCTGGCCATGGACACGGCATGGGGGGATTGTTGGCTGGTGGAGCAGCTGCTGCAGCC
GCTGCATACGGTGCTCATCATCTTGTTCATGCACGCCCATTTGGCTATGGTCATCACGGAAAGTTCAAACATGGGAAATTTGGCAAGCGTTGGAAGCATCATGGAGGCAA
GTTCAAGAAATGGAAGTGA
Protein sequenceShow/hide protein sequence
MGGGKDKDTDDKGMFSHMAAFAAGAHHHAHPHGYPPPPYGYPPPGGYPPAGYPPPAGYPPACYPPHGGHPHTAYPYHGGYPPSGYPGHHHYPGHGHGMGGLLAGGAAAAA
AAYGAHHLVHARPFGYGHHGKFKHGKFGKRWKHHGGKFKKWK