; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002028 (gene) of Snake gourd v1 genome

Gene IDTan0002028
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionglycine-rich cell wall structural protein 2
Genome locationLG01:4216994..4220028
RNA-Seq ExpressionTan0002028
SyntenyTan0002028
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146895.1 glycine-rich cell wall structural protein 2 [Cucumis sativus]2.2e-1850.67Show/hide
Query:  MASSSAVHGGASGRKLLNFPDMSW-KPNGGGW---GNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFGP-SSVHQGWG--
        + SSS V  GA+ RKLLNFPDMSW  P+GGG    GNPTG YG  HGP  NW  NWG    P S WG+G      PT F KGYGYGFG  S    G+G  
Subjt:  MASSSAVHGGASGRKLLNFPDMSW-KPNGGGW---GNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFGP-SSVHQGWG--

Query:  -------GGGYWPRTAY---SGGGNGGGCGGSGGNEYHSPSPTTTKDESK
               GGGY   + Y    GGG+GGG GG  G+EY   SP TT+D+++
Subjt:  -------GGGYWPRTAY---SGGGNGGGCGGSGGNEYHSPSPTTTKDESK

XP_016901488.1 PREDICTED: glycine-rich cell wall structural protein 2 [Cucumis melo]2.6e-1951.68Show/hide
Query:  MASSSAVHGGASGRKLLNFPDMSW-KPNGGGW--GNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFGP-SSVHQGWG---
        + SSS V G A+ RKLLNFPDMSW  PNGGG   GNPTG YG  H P  NW  NWG    P S WG+G      PT F KGYGYGFG  S    G+G   
Subjt:  MASSSAVHGGASGRKLLNFPDMSW-KPNGGGW--GNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFGP-SSVHQGWG---

Query:  ------GGGYWPRTAY---SGGGNGGGCGGSGGNEYHSPSPTTTKDESK
              GGGY   + Y    GGG GGG GG  G+EY   SPTTT+D+++
Subjt:  ------GGGYWPRTAY---SGGGNGGGCGGSGGNEYHSPSPTTTKDESK

XP_022141166.1 putative glycine-rich cell wall structural protein 1 [Momordica charantia]2.9e-1853.02Show/hide
Query:  ASGRKLLNFPDMSWKPNGG---GWGNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFGP-SSVHQGWG---------GGGY
        ASGRKLLNFPDMSW P+GG   G GNP G+YG AHGP  NW  NWG    P S WG+G      PT F KGYGYGFG  S    G+G         GGGY
Subjt:  ASGRKLLNFPDMSWKPNGG---GWGNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFGP-SSVHQGWG---------GGGY

Query:  WPRTAY-------SGGG----NGGGCGGSGGNEYHSPSPTTTKD-ESKH
           + Y       +GGG     GGG GGSGGNEY   S T TKD  SKH
Subjt:  WPRTAY-------SGGG----NGGGCGGSGGNEYHSPSPTTTKD-ESKH

XP_022942617.1 glycine-rich cell wall structural protein 2-like [Cucurbita moschata]1.3e-1851.37Show/hide
Query:  MASSSAVHGGASGRKLLNFPDMSWKPNGGGWGNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG-----PTH-FKKGYGYGFGP-SSVHQGWG------
        + +SSAV      RKLLNFP MSW PNG G GNPTG YG +HGP  NW  NWG    P S WGYG     P++ F KGYGYGFG  S    GWG      
Subjt:  MASSSAVHGGASGRKLLNFPDMSWKPNGGGWGNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG-----PTH-FKKGYGYGFGP-SSVHQGWG------

Query:  ---GGGYWPRTAY---SGGGNGGGCGGSGGNEYHSPSPTTTKDESK
           GGGY   + Y    GGGNGGG   S G EYH  S T +KD++K
Subjt:  ---GGGYWPRTAY---SGGGNGGGCGGSGGNEYHSPSPTTTKDESK

XP_038899364.1 glycine-rich cell wall structural protein 2-like [Benincasa hispida]9.0e-2053.74Show/hide
Query:  ASSSAVHGGASGRKLLNFPDMSW-KPN-GGGWGNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFG-PSSVHQGWG-----
        A+SS V G A+ RKLLNFPDMSW  PN GGG GNPTG YG AHGP  NW  NWG    P S WGYG      PT F KGYGYG+G  S    G+G     
Subjt:  ASSSAVHGGASGRKLLNFPDMSW-KPN-GGGWGNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFG-PSSVHQGWG-----

Query:  ----GGGYWPRTAY---SGGGNGGGCGGSGGNEYHSPSPTTTKDESK
            GGGY   + Y    G G+GGG GG  G EY   SPTTTKD+++
Subjt:  ----GGGYWPRTAY---SGGGNGGGCGGSGGNEYHSPSPTTTKDESK

TrEMBL top hitse value%identityAlignment
A0A0A0KXP2 Uncharacterized protein1.1e-1850.67Show/hide
Query:  MASSSAVHGGASGRKLLNFPDMSW-KPNGGGW---GNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFGP-SSVHQGWG--
        + SSS V  GA+ RKLLNFPDMSW  P+GGG    GNPTG YG  HGP  NW  NWG    P S WG+G      PT F KGYGYGFG  S    G+G  
Subjt:  MASSSAVHGGASGRKLLNFPDMSW-KPNGGGW---GNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFGP-SSVHQGWG--

Query:  -------GGGYWPRTAY---SGGGNGGGCGGSGGNEYHSPSPTTTKDESK
               GGGY   + Y    GGG+GGG GG  G+EY   SP TT+D+++
Subjt:  -------GGGYWPRTAY---SGGGNGGGCGGSGGNEYHSPSPTTTKDESK

A0A1S4DZT0 glycine-rich cell wall structural protein 21.3e-1951.68Show/hide
Query:  MASSSAVHGGASGRKLLNFPDMSW-KPNGGGW--GNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFGP-SSVHQGWG---
        + SSS V G A+ RKLLNFPDMSW  PNGGG   GNPTG YG  H P  NW  NWG    P S WG+G      PT F KGYGYGFG  S    G+G   
Subjt:  MASSSAVHGGASGRKLLNFPDMSW-KPNGGGW--GNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFGP-SSVHQGWG---

Query:  ------GGGYWPRTAY---SGGGNGGGCGGSGGNEYHSPSPTTTKDESK
              GGGY   + Y    GGG GGG GG  G+EY   SPTTT+D+++
Subjt:  ------GGGYWPRTAY---SGGGNGGGCGGSGGNEYHSPSPTTTKDESK

A0A5A7TRC6 Glycine-rich cell wall structural protein 21.3e-1951.68Show/hide
Query:  MASSSAVHGGASGRKLLNFPDMSW-KPNGGGW--GNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFGP-SSVHQGWG---
        + SSS V G A+ RKLLNFPDMSW  PNGGG   GNPTG YG  H P  NW  NWG    P S WG+G      PT F KGYGYGFG  S    G+G   
Subjt:  MASSSAVHGGASGRKLLNFPDMSW-KPNGGGW--GNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFGP-SSVHQGWG---

Query:  ------GGGYWPRTAY---SGGGNGGGCGGSGGNEYHSPSPTTTKDESK
              GGGY   + Y    GGG GGG GG  G+EY   SPTTT+D+++
Subjt:  ------GGGYWPRTAY---SGGGNGGGCGGSGGNEYHSPSPTTTKDESK

A0A6J1CI83 putative glycine-rich cell wall structural protein 11.4e-1853.02Show/hide
Query:  ASGRKLLNFPDMSWKPNGG---GWGNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFGP-SSVHQGWG---------GGGY
        ASGRKLLNFPDMSW P+GG   G GNP G+YG AHGP  NW  NWG    P S WG+G      PT F KGYGYGFG  S    G+G         GGGY
Subjt:  ASGRKLLNFPDMSWKPNGG---GWGNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFGP-SSVHQGWG---------GGGY

Query:  WPRTAY-------SGGG----NGGGCGGSGGNEYHSPSPTTTKD-ESKH
           + Y       +GGG     GGG GGSGGNEY   S T TKD  SKH
Subjt:  WPRTAY-------SGGG----NGGGCGGSGGNEYHSPSPTTTKD-ESKH

A0A6J1FQS0 glycine-rich cell wall structural protein 2-like6.3e-1951.37Show/hide
Query:  MASSSAVHGGASGRKLLNFPDMSWKPNGGGWGNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG-----PTH-FKKGYGYGFGP-SSVHQGWG------
        + +SSAV      RKLLNFP MSW PNG G GNPTG YG +HGP  NW  NWG    P S WGYG     P++ F KGYGYGFG  S    GWG      
Subjt:  MASSSAVHGGASGRKLLNFPDMSWKPNGGGWGNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG-----PTH-FKKGYGYGFGP-SSVHQGWG------

Query:  ---GGGYWPRTAY---SGGGNGGGCGGSGGNEYHSPSPTTTKDESK
           GGGY   + Y    GGGNGGG   S G EYH  S T +KD++K
Subjt:  ---GGGYWPRTAY---SGGGNGGGCGGSGGNEYHSPSPTTTKDESK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G61660.1 glycine-rich protein7.4e-0439.58Show/hide
Query:  KPNGGGWGNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFGP---SSVHQGWGGGGYWPRTAYSGGGNGGGCGGSGG
        +P     GN  G+ G   GP  NW  NWG    P S WGYG      PT + +G GYG+G    S    G+G GG   R    G G+G G  G GG
Subjt:  KPNGGGWGNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYG------PTHFKKGYGYGFGP---SSVHQGWGGGGYWPRTAYSGGGNGGGCGGSGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTTCCTCCGCCGTCCACGGCGGCGCTAGTGGCCGAAAACTGCTCAACTTTCCAGACATGTCTTGGAAACCCAATGGAGGTGGATGGGGGAACCCCACTGGGGA
CTATGGCCATGCCCATGGTCCTAACTCTAACTGGTACGACAATTGGGGTTCGAGTTTCGGACCGAGTAGCGACTGGGGTTACGGCCCAACCCATTTCAAAAAAGGTTATG
GATATGGATTTGGACCGAGTTCAGTGCATCAAGGTTGGGGCGGCGGTGGATATTGGCCCAGAACCGCCTACAGCGGCGGCGGCAATGGTGGCGGATGCGGTGGTTCGGGT
GGCAACGAGTATCATTCACCTTCACCAACCACTACAAAGGACGAAAGCAAGCATGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCATCTTCCTCCGCCGTCCACGGCGGCGCTAGTGGCCGAAAACTGCTCAACTTTCCAGACATGTCTTGGAAACCCAATGGAGGTGGATGGGGGAACCCCACTGGGGA
CTATGGCCATGCCCATGGTCCTAACTCTAACTGGTACGACAATTGGGGTTCGAGTTTCGGACCGAGTAGCGACTGGGGTTACGGCCCAACCCATTTCAAAAAAGGTTATG
GATATGGATTTGGACCGAGTTCAGTGCATCAAGGTTGGGGCGGCGGTGGATATTGGCCCAGAACCGCCTACAGCGGCGGCGGCAATGGTGGCGGATGCGGTGGTTCGGGT
GGCAACGAGTATCATTCACCTTCACCAACCACTACAAAGGACGAAAGCAAGCATGAGTAG
Protein sequenceShow/hide protein sequence
MASSSAVHGGASGRKLLNFPDMSWKPNGGGWGNPTGDYGHAHGPNSNWYDNWGSSFGPSSDWGYGPTHFKKGYGYGFGPSSVHQGWGGGGYWPRTAYSGGGNGGGCGGSG
GNEYHSPSPTTTKDESKHE