; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015887 (gene) of Snake gourd v1 genome

Gene IDTan0015887
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionglycine-rich cell wall structural protein 2-like
Genome locationLG09:67850809..67851926
RNA-Seq ExpressionTan0015887
SyntenyTan0015887
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589746.1 hypothetical protein SDJN03_15169, partial [Cucurbita argyrosperma subsp. sororia]2.1e-3572.83Show/hide
Query:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYGGSEAGSYAGSYA---
        M SIR +AVLC V+CMSAIESQGR+ RKDLGL+LGGLGVG+GVG+GLGLG G GSGSGSGSGSGSGSGS SSS SSS S   S  GSEAGSYAGSYA   
Subjt:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYGGSEAGSYAGSYA---

Query:  -RSGSGRNDRNGGSGSGSEYGEGSDRGSGSGDGG--GEGYGEGRGYGEGSGRGRGGGDGDSRGEGYGEGRGYGEGRCYREGGNN
          S SGRN RNGGSG    YGEGS RG G G GG  GEGYGEGRGYGEGSGR           EGYGEGRGYGEGR Y EGGNN
Subjt:  -RSGSGRNDRNGGSGSGSEYGEGSDRGSGSGDGG--GEGYGEGRGYGEGSGRGRGGGDGDSRGEGYGEGRGYGEGRCYREGGNN

XP_022135257.1 glycine-rich cell wall structural protein 2-like [Momordica charantia]3.6e-2770.89Show/hide
Query:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYGGSEAGSYAGSYA---
        MASIR +A +C V+C  AIESQ R+ RKDLGL+LGGLG+G+G GI  GLG+GGGSGSG+G+GSGSGSGSGS SSSSSHSSSSSYGGS AGS AGSYA   
Subjt:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYGGSEAGSYAGSYA---

Query:  ---RSGSGRNDRNGGSGSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEGSGRGRGGGD
            SGSGRN RNGGSGSGS YGEGS RG+G G G G GYGEGRGYG     G GG +
Subjt:  ---RSGSGRNDRNGGSGSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEGSGRGRGGGD

XP_022988596.1 glycine-rich cell wall structural protein 2-like isoform X3 [Cucurbita maxima]2.1e-2761.46Show/hide
Query:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYG---------------
        MASIR +++L L+L +S I S+ R+ R DLGL+LGG+GVG+G G+GLGLG  GGSGSGSGSGSGSGSGS SSSSS S SSSS  G               
Subjt:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYG---------------

Query:  ------GSEAGSYAGSYA----RSGSGRNDRNGGSGSGSEYGEGSDRGSGSG--DGGGEGYGEGRGYGEGSGRGRGGGDGDSRGEGYGEGRGYGEGRCYR
              GSEAGSYAGSYA     S SGRN RNGGSG    YGEGS RG G G   GGGEGYGEGRGYGEGSGR           EGYG GRGYGEGR Y 
Subjt:  ------GSEAGSYAGSYA----RSGSGRNDRNGGSGSGSEYGEGSDRGSGSG--DGGGEGYGEGRGYGEGSGRGRGGGDGDSRGEGYGEGRGYGEGRCYR

Query:  EGGNN
        EGGNN
Subjt:  EGGNN

XP_023515819.1 putative glycine-rich cell wall structural protein 1 isoform X3 [Cucurbita pepo subsp. pepo]1.3e-2474.84Show/hide
Query:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYG---GSEAGSYAGSYA
        MASI  L V+C +L  SAI SQ R+ RKDLGL+LGGLGVG+G GIGLGLG G GSGSGSGSGSGSGSGS SSSSSSS+SSSS  G   GSEAGSYAGS A
Subjt:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYG---GSEAGSYAGSYA

Query:  RSGSGRNDRNGGSGSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEGSGRGRGGGD
         S SGRN RNGGSGSGS YG GS  G GSG+GGGEGYGEG GYGE  GRG GGG+
Subjt:  RSGSGRNDRNGGSGSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEGSGRGRGGGD

XP_023515820.1 putative glycine-rich cell wall structural protein 1 [Cucurbita pepo subsp. pepo]3.7e-3269.78Show/hide
Query:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYGGSEAGSYAGSYA---
        M SIR +AVLC V+CMSAIESQGR+ RKDLGL+LGGLGVG+GVG+GLGLG G GSGSGSGSGSGSGSGS SSS SSS S   S  GSEAGSYAGSYA   
Subjt:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYGGSEAGSYAGSYA---

Query:  -RSGSGRNDRNGGSGSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEGSGRGRGGGDGDSRGEGYGEGRGYGEGRCYREGGNN
          S SGRN RNGGSG           G G G G GEGYGEGRGYGEGSGR           EGYGEGRGYGEGR Y EGGNN
Subjt:  -RSGSGRNDRNGGSGSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEGSGRGRGGGDGDSRGEGYGEGRGYGEGRCYREGGNN

TrEMBL top hitse value%identityAlignment
A0A0A0LTK9 Uncharacterized protein2.6e-2371.7Show/hide
Query:  MASIRVLA-VLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYG---GSEAGSYAGSY
        MASI+ LA V+CL+L  SAI S+GR+ RKDLG++LGG+GVG+G GIGLG+   GGSGSGSGSGSGSGSGSGSSSSSSS+SSSSS G   GSEAGSYAGSY
Subjt:  MASIRVLA-VLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYG---GSEAGSYAGSY

Query:  A--RSGSGR-NDRNGGSGSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEGSGRGRGGGD
        A  R+GSG   +RNGGSG GS YG GS RG GS D  GEGYGEG GYGE  GRG GGG+
Subjt:  A--RSGSGR-NDRNGGSGSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEGSGRGRGGGD

A0A6J1C269 glycine-rich cell wall structural protein 2-like1.7e-2770.89Show/hide
Query:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYGGSEAGSYAGSYA---
        MASIR +A +C V+C  AIESQ R+ RKDLGL+LGGLG+G+G GI  GLG+GGGSGSG+G+GSGSGSGSGS SSSSSHSSSSSYGGS AGS AGSYA   
Subjt:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYGGSEAGSYAGSYA---

Query:  ---RSGSGRNDRNGGSGSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEGSGRGRGGGD
            SGSGRN RNGGSGSGS YGEGS RG+G G G G GYGEGRGYG     G GG +
Subjt:  ---RSGSGRNDRNGGSGSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEGSGRGRGGGD

A0A6J1E521 putative glycine-rich cell wall structural protein 1 isoform X33.1e-2474.19Show/hide
Query:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYG---GSEAGSYAGSYA
        MASI  L V+C +L  S I SQ R+ RKDLGL+LGGLGVG+G GIGLGLG G GSGSGSGSGSGSGSGS SSSSSSS+SSSS  G   GSEAGSYAGS A
Subjt:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYG---GSEAGSYAGSYA

Query:  RSGSGRNDRNGGSGSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEGSGRGRGGGD
         SGSG N RNGGSGSGS YG GS  G GSG+GGGEGYGEG GYGE  GRG GGG+
Subjt:  RSGSGRNDRNGGSGSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEGSGRGRGGGD

A0A6J1E720 glycine-rich protein DOT1-like8.1e-2573.43Show/hide
Query:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYGGSEAGSYAGSYARSG
        M SIR +AVLC V+CMSAIESQGR+ RKDLGL+LGGLGVG+GVG+GLGLG G GSGSGSGSGSGSGSGSGS S SSS SSSSSYGGS AGS AGSYA S 
Subjt:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYGGSEAGSYAGSYARSG

Query:  SGRNDRNGGSGSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEG
        +G     GGS SG  Y     R  GSG G G GYGEGRGYGEG
Subjt:  SGRNDRNGGSGSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEG

A0A6J1JHN1 glycine-rich cell wall structural protein 2-like isoform X31.0e-2761.46Show/hide
Query:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYG---------------
        MASIR +++L L+L +S I S+ R+ R DLGL+LGG+GVG+G G+GLGLG  GGSGSGSGSGSGSGSGS SSSSS S SSSS  G               
Subjt:  MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYG---------------

Query:  ------GSEAGSYAGSYA----RSGSGRNDRNGGSGSGSEYGEGSDRGSGSG--DGGGEGYGEGRGYGEGSGRGRGGGDGDSRGEGYGEGRGYGEGRCYR
              GSEAGSYAGSYA     S SGRN RNGGSG    YGEGS RG G G   GGGEGYGEGRGYGEGSGR           EGYG GRGYGEGR Y 
Subjt:  ------GSEAGSYAGSYA----RSGSGRNDRNGGSGSGSEYGEGSDRGSGSG--DGGGEGYGEGRGYGEGSGRGRGGGDGDSRGEGYGEGRGYGEGRCYR

Query:  EGGNN
        EGGNN
Subjt:  EGGNN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30450.1 glycine-rich protein2.3e-1161.11Show/hide
Query:  MASIRVLAVLCLVLCMSAIE-SQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYG-----GSEAGSYAG
        MA    L +L LV+  S +  ++ R+ RKDLG++LG  G+G+G+G+GLG+GLGGGSGSG+G+GSGSGSGS SSSSSSS SSSSS G     GS AGS+AG
Subjt:  MASIRVLAVLCLVLCMSAIE-SQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYG-----GSEAGSYAG

Query:  SYARSGSG
        S A SGSG
Subjt:  SYARSGSG

AT4G30460.1 glycine-rich protein9.5e-1857.99Show/hide
Query:  LAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYG------GSEAGSYAGSYARSG
        L ++ L+L  S + S+ R+ RKDLGL+LGG+G G+G+GIG+G   GGGSGSG+G+GSGSG G  SSSSSSS SSSSS G      GSEAGSYAGS+A SG
Subjt:  LAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYG------GSEAGSYAGSYARSG

Query:  SGRNDRNGGSGSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEGSGRGRGGGDGDSRGEGYGEGRGYGEG
        SG     G SGSG        RG GSG GG      G G G G G GRGGG G   GEGYGEG GYG G
Subjt:  SGRNDRNGGSGSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEGSGRGRGGGDGDSRGEGYGEGRGYGEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCCATCAGGGTTCTTGCTGTCCTTTGTTTAGTACTCTGTATGTCTGCGATTGAATCACAAGGCCGAATCATGAGGAAGGACTTGGGCCTTAATCTTGGTGGTTT
AGGAGTTGGAGTTGGAGTTGGTATAGGCTTGGGGTTAGGCTTAGGTGGTGGAAGTGGGTCGGGCTCTGGATCCGGGTCCGGATCTGGCTCCGGCTCCGGCTCGAGTTCAT
CTTCATCATCACACTCGTCAAGCTCTAGCTATGGCGGCTCCGAAGCTGGTTCCTATGCTGGTTCTTATGCAAGGTCAGGTTCAGGAAGGAATGATAGGAACGGTGGTTCG
GGGTCAGGCTCGGAATATGGCGAGGGCTCCGACAGAGGAAGCGGTAGTGGTGATGGCGGAGGAGAAGGTTACGGGGAGGGTCGTGGCTACGGCGAGGGTTCTGGCAGAGG
AAGAGGTGGCGGTGACGGTGACAGTAGAGGAGAAGGTTATGGGGAGGGTCGTGGATATGGGGAGGGACGTTGTTACCGAGAAGGCGGTAACAATTGA
mRNA sequenceShow/hide mRNA sequence
TCTCAATCGGCCTATAAATAGCTTCTCCAATCTTTCAACCGCTATAATACATTACCAAGTCAAGTTCACAATCCCATTCGATCGAGTCCATATCAGAAAATGGCCTCCAT
CAGGGTTCTTGCTGTCCTTTGTTTAGTACTCTGTATGTCTGCGATTGAATCACAAGGCCGAATCATGAGGAAGGACTTGGGCCTTAATCTTGGTGGTTTAGGAGTTGGAG
TTGGAGTTGGTATAGGCTTGGGGTTAGGCTTAGGTGGTGGAAGTGGGTCGGGCTCTGGATCCGGGTCCGGATCTGGCTCCGGCTCCGGCTCGAGTTCATCTTCATCATCA
CACTCGTCAAGCTCTAGCTATGGCGGCTCCGAAGCTGGTTCCTATGCTGGTTCTTATGCAAGGTCAGGTTCAGGAAGGAATGATAGGAACGGTGGTTCGGGGTCAGGCTC
GGAATATGGCGAGGGCTCCGACAGAGGAAGCGGTAGTGGTGATGGCGGAGGAGAAGGTTACGGGGAGGGTCGTGGCTACGGCGAGGGTTCTGGCAGAGGAAGAGGTGGCG
GTGACGGTGACAGTAGAGGAGAAGGTTATGGGGAGGGTCGTGGATATGGGGAGGGACGTTGTTACCGAGAAGGCGGTAACAATTGAAATCAAAAGTAAATAAGTGTGAGA
TATTTTGAATAGAACAATACCTAGTTGCACACCAAAGTATGGAAATCTTTCTACTCCATCATTCAATTGTTGACAATAAGAAAAGAGAAAATATTTTTTACCTTTTTTTT
TTAGTTGTGCTCCTCTCTCTTCGTTGTCAATTGAATGGTGGAGATATGAAGATTTCTGCAAACGAAAGTGTAGCTAAGTCTTTATCTTTCAAATGAAATCTCAATGCTAA
CTATAAATAGTTTTGTGTCATTGTGTGTGCTTATGATTTTCCTATAGTTTGAGGCAGCTTCAGATAAGTAAAATAAAAGTTGGTGTGTATTTTTACTGTACTGTTATGTG
TATACCTAAATATTGCTTTTGACTTGGGGCTTTTACTTCCACGGGGCTTGGACTGAGCTTTTAGGTAATAAAATAGGAAGATAATTAAGTGGCATGTTCCATTGTAATGA
AATCAAACCATTTGTGGC
Protein sequenceShow/hide protein sequence
MASIRVLAVLCLVLCMSAIESQGRIMRKDLGLNLGGLGVGVGVGIGLGLGLGGGSGSGSGSGSGSGSGSGSSSSSSSHSSSSSYGGSEAGSYAGSYARSGSGRNDRNGGS
GSGSEYGEGSDRGSGSGDGGGEGYGEGRGYGEGSGRGRGGGDGDSRGEGYGEGRGYGEGRCYREGGNN