; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017313 (gene) of Snake gourd v1 genome

Gene IDTan0017313
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionclassical arabinogalactan protein 26
Genome locationLG06:7543807..7544358
RNA-Seq ExpressionTan0017313
SyntenyTan0017313
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR039346 - Classical arabinogalactan protein 25/26


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575865.1 hypothetical protein SDJN03_26504, partial [Cucurbita argyrosperma subsp. sororia]1.3e-4583.7Show/hide
Query:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPEPSPSP--DISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPSPP
        MASIFSLYIPIFMAYTASIL FSFASYPN+FPPLISSISAAPEFSP PSPSP  DISPLFPTPGGA+LPPSSLPTIPSSPSPPNPDF  +APAPE+P PP
Subjt:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPEPSPSP--DISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPSPP

Query:  SQSLPFSAAAALNSGR---LFWLALAAAFVAELCR
        SQSLPFSAAA LNSG      W+AL AA VAE+CR
Subjt:  SQSLPFSAAAALNSGR---LFWLALAAAFVAELCR

XP_022953817.1 classical arabinogalactan protein 26 [Cucurbita moschata]2.9e-4582.96Show/hide
Query:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPEPSPSP--DISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPSPP
        MASIFSLYIPIFMAYTASIL FSFASYPN+FPPLISSISAAPEFSP PSPSP  DI+PLFPTPGGA+LPPSSLPTIPSSPSPPNPDF  +APAPE+P PP
Subjt:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPEPSPSP--DISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPSPP

Query:  SQSLPFSAAAALNSGR---LFWLALAAAFVAELCR
        SQSLPFSAAA LNSG      W+AL AA VAE+CR
Subjt:  SQSLPFSAAAALNSGR---LFWLALAAAFVAELCR

XP_022991733.1 classical arabinogalactan protein 26 [Cucurbita maxima]1.1e-4482.22Show/hide
Query:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPEPSPSP--DISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPSPP
        MASIFSLYIPIFMAYTASI AFSFASYPN+FPPLISSISAAPEFSP PSPSP  DISPLFPTPGGA+LPPSSLPTIPSSPSPPNPDF  + PAPE+P PP
Subjt:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPEPSPSP--DISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPSPP

Query:  SQSLPFSAAAALNSGR---LFWLALAAAFVAELCR
        SQSLPFSAAA LNSG      W+AL AA V E+CR
Subjt:  SQSLPFSAAAALNSGR---LFWLALAAAFVAELCR

XP_023548939.1 classical arabinogalactan protein 26-like [Cucurbita pepo subsp. pepo]1.1e-4482.22Show/hide
Query:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPEPSPSP--DISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPSPP
        MASIFSLYIPIFMAYTASI  FSFASYPN+FPPLISSISAAPEFSP PSPSP  DISPLFPTPGGA+LPPSSLPTIPSSPSPPNPDF  +APAPE+P PP
Subjt:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPEPSPSP--DISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPSPP

Query:  SQSLPFSAAAALNSGR---LFWLALAAAFVAELCR
        SQSLPFSA A LNSG      W+AL AA VAE+CR
Subjt:  SQSLPFSAAAALNSGR---LFWLALAAAFVAELCR

XP_038876736.1 classical arabinogalactan protein 26-like [Benincasa hispida]5.2e-3976.64Show/hide
Query:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPEPSPSP----DISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPS
        MASIFSLYIPIFMAYTASIL FS ASYPNY P LISSISAAPEFSP PSPSP    DISPLFPTPGGA+LPPSSLPTIPSSPSPPNPDF  +APAPE+  
Subjt:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPEPSPSP----DISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPS

Query:  PPSQSLPFSAAAALNSGRLFW---LALAAAFVAELCR
        PPSQSLPFSAA +LN     W   +AL  A   ELCR
Subjt:  PPSQSLPFSAAAALNSGRLFW---LALAAAFVAELCR

TrEMBL top hitse value%identityAlignment
A0A0A0K510 Uncharacterized protein2.4e-3776.12Show/hide
Query:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPP-LISSISAAPEFSPEPSPSPDISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPSPPS
        MASIFSLYIPIFMAYTAS+  FSFASYPN FP  LISSISAAPEFSP P+P+ DISPLFPTPG A+LPPSSLPTIPSSPSPPNPDF  +APAPE+P  PS
Subjt:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPP-LISSISAAPEFSPEPSPSPDISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPSPPS

Query:  QSLPFSAAAALNSGR---LFWLALAAAFVAELCR
        QSLPFS AAALNSG      ++AL    VAEL R
Subjt:  QSLPFSAAAALNSGR---LFWLALAAAFVAELCR

A0A1S3BRG9 classical arabinogalactan protein 268.4e-3571.94Show/hide
Query:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPP----LISSISAAPEFSPEPSPSPDISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPS
        MASIFSLYIPIFMAYTAS   F+FASYPNYFP     + SSISAAPEFSP P+PS DISPLFPTPG A+LPPSSLPTIPSSPSPPNPDF   APAPE+P 
Subjt:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPP----LISSISAAPEFSPEPSPSPDISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPS

Query:  PPSQSLPFSAAAALNSGRLFW-----LALAAAFVAELCR
         P+ S+PFSAAAALNS  + W     LAL  A  AEL R
Subjt:  PPSQSLPFSAAAALNSGRLFW-----LALAAAFVAELCR

A0A5A7UQZ3 Classical arabinogalactan protein 268.4e-3571.94Show/hide
Query:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPP----LISSISAAPEFSPEPSPSPDISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPS
        MASIFSLYIPIFMAYTAS   F+FASYPNYFP     + SSISAAPEFSP P+PS DISPLFPTPG A+LPPSSLPTIPSSPSPPNPDF   APAPE+P 
Subjt:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPP----LISSISAAPEFSPEPSPSPDISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPS

Query:  PPSQSLPFSAAAALNSGRLFW-----LALAAAFVAELCR
         P+ S+PFSAAAALNS  + W     LAL  A  AEL R
Subjt:  PPSQSLPFSAAAALNSGRLFW-----LALAAAFVAELCR

A0A6J1GPB2 classical arabinogalactan protein 261.4e-4582.96Show/hide
Query:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPEPSPSP--DISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPSPP
        MASIFSLYIPIFMAYTASIL FSFASYPN+FPPLISSISAAPEFSP PSPSP  DI+PLFPTPGGA+LPPSSLPTIPSSPSPPNPDF  +APAPE+P PP
Subjt:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPEPSPSP--DISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPSPP

Query:  SQSLPFSAAAALNSGR---LFWLALAAAFVAELCR
        SQSLPFSAAA LNSG      W+AL AA VAE+CR
Subjt:  SQSLPFSAAAALNSGR---LFWLALAAAFVAELCR

A0A6J1JX46 classical arabinogalactan protein 265.3e-4582.22Show/hide
Query:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPEPSPSP--DISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPSPP
        MASIFSLYIPIFMAYTASI AFSFASYPN+FPPLISSISAAPEFSP PSPSP  DISPLFPTPGGA+LPPSSLPTIPSSPSPPNPDF  + PAPE+P PP
Subjt:  MASIFSLYIPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPEPSPSP--DISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPSPP

Query:  SQSLPFSAAAALNSGR---LFWLALAAAFVAELCR
        SQSLPFSAAA LNSG      W+AL AA V E+CR
Subjt:  SQSLPFSAAAALNSGR---LFWLALAAAFVAELCR

SwissProt top hitse value%identityAlignment
Q94F57 Classical arabinogalactan protein 267.9e-0641.27Show/hide
Query:  IPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPE---------PSPSPDISPLFPTPGGASLPPSS-----LPTIPSSPSPPNPDFTVSAPAPEL
        + +F A+T  +L+    +  + F   +S+ISAAP F PE         P+ SPD SPLFPTPG + + PS      +PTIPSS SPPNPD     P  E+
Subjt:  IPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPE---------PSPSPDISPLFPTPGGASLPPSS-----LPTIPSSPSPPNPDFTVSAPAPEL

Query:  PSPPSQSLPFSAAAALNSGRLFWLAL
         SP    LP S++  L S +L  L L
Subjt:  PSPPSQSLPFSAAAALNSGRLFWLAL

Arabidopsis top hitse value%identityAlignment
AT2G47930.1 arabinogalactan protein 265.6e-0741.27Show/hide
Query:  IPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPE---------PSPSPDISPLFPTPGGASLPPSS-----LPTIPSSPSPPNPDFTVSAPAPEL
        + +F A+T  +L+    +  + F   +S+ISAAP F PE         P+ SPD SPLFPTPG + + PS      +PTIPSS SPPNPD     P  E+
Subjt:  IPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPE---------PSPSPDISPLFPTPGGASLPPSS-----LPTIPSSPSPPNPDFTVSAPAPEL

Query:  PSPPSQSLPFSAAAALNSGRLFWLAL
         SP    LP S++  L S +L  L L
Subjt:  PSPPSQSLPFSAAAALNSGRLFWLAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCATTTTCTCTCTCTACATTCCCATTTTCATGGCCTATACAGCTTCTATTTTGGCTTTCTCTTTCGCTTCTTATCCCAATTACTTCCCTCCCCTCATTTCCTC
CATCTCCGCCGCGCCGGAATTTTCGCCTGAGCCTTCCCCGTCGCCGGACATTTCTCCCCTCTTTCCGACCCCGGGCGGCGCGTCCCTTCCGCCGTCCTCCTTGCCGACCA
TCCCCTCCAGTCCCAGCCCTCCCAACCCGGATTTTACGGTCTCCGCACCGGCGCCGGAGCTGCCTTCACCGCCGTCTCAGTCCTTGCCGTTCTCCGCTGCTGCTGCTCTC
AACTCCGGTCGGTTGTTTTGGCTGGCTCTTGCGGCGGCTTTCGTGGCGGAGCTCTGCCGGTGGTGGTGTTAG
mRNA sequenceShow/hide mRNA sequence
CCATCACAGACACCCCCTCCCTTTTCAGCTTTAATCATAACCCCTTTTCTGCTCTGGTTTTTTCCTTCTCCGTTCTTCATTCTTCTTCTTCCTCTCAAATTCTTACCCAT
TTCTTTTTTTTTTTTGTTTCTCTCTCTCTCATCATTGTAAATGGCTTCCATTTTCTCTCTCTACATTCCCATTTTCATGGCCTATACAGCTTCTATTTTGGCTTTCTCTT
TCGCTTCTTATCCCAATTACTTCCCTCCCCTCATTTCCTCCATCTCCGCCGCGCCGGAATTTTCGCCTGAGCCTTCCCCGTCGCCGGACATTTCTCCCCTCTTTCCGACC
CCGGGCGGCGCGTCCCTTCCGCCGTCCTCCTTGCCGACCATCCCCTCCAGTCCCAGCCCTCCCAACCCGGATTTTACGGTCTCCGCACCGGCGCCGGAGCTGCCTTCACC
GCCGTCTCAGTCCTTGCCGTTCTCCGCTGCTGCTGCTCTCAACTCCGGTCGGTTGTTTTGGCTGGCTCTTGCGGCGGCTTTCGTGGCGGAGCTCTGCCGGTGGTGGTGTT
AG
Protein sequenceShow/hide protein sequence
MASIFSLYIPIFMAYTASILAFSFASYPNYFPPLISSISAAPEFSPEPSPSPDISPLFPTPGGASLPPSSLPTIPSSPSPPNPDFTVSAPAPELPSPPSQSLPFSAAAAL
NSGRLFWLALAAAFVAELCRWWC