; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006273 (gene) of Snake gourd v1 genome

Gene IDTan0006273
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionlysine-rich arabinogalactan protein 18-like
Genome locationLG05:1966557..1967048
RNA-Seq ExpressionTan0006273
SyntenyTan0006273
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0016301 - kinase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605547.1 hypothetical protein SDJN03_02864, partial [Cucurbita argyrosperma subsp. sororia]9.5e-5982.74Show/hide
Query:  MAKSVAFCCLLLL-----MNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADS
        MAKSVAFCCLLL+     M+AAFSLEQP EVPPSPSPESAADSPP PSPTPF HAP+SSP +SPL SPPAPPPSD TP+PSPA VPSPSP PA S TADS
Subjt:  MAKSVAFCCLLLL-----MNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADS

Query:  DSSNSTANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL
        D  NS ANGGG E+  SKGGMNGGKKAGIAVGVIAAACFVG+GGIVYKKRQDNIRRSQ+GNAARSSFL
Subjt:  DSSNSTANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL

XP_022958106.1 lysine-rich arabinogalactan protein 18-like [Cucurbita moschata]1.8e-5782.14Show/hide
Query:  MAKSVAFCCLLLL-----MNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADS
        MAKSVAFCCLLL+     M+AAFSLEQP EVPPSPSPESAADSPP PSPTPF HAP+SSP +SPL SPPAPP SD TP PSPA VPSPSP PA S TADS
Subjt:  MAKSVAFCCLLLL-----MNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADS

Query:  DSSNSTANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL
        D  NS ANGGG E+  SKGGMNGGKKAGIAVGVIAAACFVG+GGIVYKKRQDNIRRSQ+GNAARSSFL
Subjt:  DSSNSTANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL

XP_022995398.1 lysine-rich arabinogalactan protein 18-like [Cucurbita maxima]2.5e-5983.93Show/hide
Query:  MAKSVAFCCLLLL-----MNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADS
        MAKSVAFCCLLL+     M+AAFSLEQP EVPPSPSPESAADSPP PSPTPF HAP+SSP +SPL SPPAPPPSD TP+PSPA VPSPSP PA S TADS
Subjt:  MAKSVAFCCLLLL-----MNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADS

Query:  DSSNSTANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL
        D  NS ANGGG E+ ASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQ+GNAARSSFL
Subjt:  DSSNSTANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL

XP_023513399.1 classical arabinogalactan protein 2-like [Cucurbita pepo subsp. pepo]1.1e-5480.37Show/hide
Query:  MAKSVAFCCLLLLMNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADSDSSNS
        MAK +A+CC LLLMN AFSLEQ  +VPPSPSPESAA+ PP  SPTPF HAP SSPV+SPL+SPPAPPPSD  P+PSPA  PSPS  PA SP ADSD  NS
Subjt:  MAKSVAFCCLLLLMNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADSDSSNS

Query:  TANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL
         +N GGEES +SKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL
Subjt:  TANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL

XP_023534499.1 alpha carbonic anhydrase 8-like [Cucurbita pepo subsp. pepo]1.8e-5782.14Show/hide
Query:  MAKSVAFCCLLLL-----MNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADS
        MAKSVAF CLLL+     M+AAFSLEQP EVPPSPSPESAADSPP PSPTPF HAP+SSP +SPL SPPAPPPSD TP+PSPA VPSPSP PA S TADS
Subjt:  MAKSVAFCCLLLL-----MNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADS

Query:  DSSNSTANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL
        D  NS ANGGG E+  SKGGMNGGKKAGIAVGVIAAACFVG+GGIVYKKRQDNIRRSQ+GNAARSSFL
Subjt:  DSSNSTANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL

TrEMBL top hitse value%identityAlignment
A0A6J1CIM1 classical arabinogalactan protein 1-like4.8e-4873.45Show/hide
Query:  MAKSVAFCCLL----LLMNAAFSLE-----QPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDF----TPTPSPAPVPSPSPTP
        MAKSVAFCCLL    LL+N   SLE      PA   PSPSP+SAADSPP PSP PF HAPS+SP +SPLNSPPAPPPSD      PTPSP+P PSPSPTP
Subjt:  MAKSVAFCCLL----LLMNAAFSLE-----QPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDF----TPTPSPAPVPSPSPTP

Query:  AHSPTADSDSSNSTANGGGEES-VASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL
        A SP  DSD S S A+GG  ES  ASKGGMNGGKKAGIA GVIAA CFVGIGGIVYKKRQ+NIRRSQYG+AARSSFL
Subjt:  AHSPTADSDSSNSTANGGGEES-VASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL

A0A6J1FYT3 alpha carbonic anhydrase 8-like9.0e-5580.37Show/hide
Query:  MAKSVAFCCLLLLMNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADSDSSNS
        MAK +AFCC LLLMN AFSLEQ  EVPPSP PESAA  PP  SPTPF HAP+SSPV+SPL+SPPAPPPSD  P+PSPA  PSPS  PA SP AD D  NS
Subjt:  MAKSVAFCCLLLLMNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADSDSSNS

Query:  TANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL
         +N GGEES +SKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL
Subjt:  TANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL

A0A6J1H149 lysine-rich arabinogalactan protein 18-like8.7e-5882.14Show/hide
Query:  MAKSVAFCCLLLL-----MNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADS
        MAKSVAFCCLLL+     M+AAFSLEQP EVPPSPSPESAADSPP PSPTPF HAP+SSP +SPL SPPAPP SD TP PSPA VPSPSP PA S TADS
Subjt:  MAKSVAFCCLLLL-----MNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADS

Query:  DSSNSTANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL
        D  NS ANGGG E+  SKGGMNGGKKAGIAVGVIAAACFVG+GGIVYKKRQDNIRRSQ+GNAARSSFL
Subjt:  DSSNSTANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL

A0A6J1JA87 proline-rich receptor-like protein kinase PERK121.4e-5277.91Show/hide
Query:  MAKSVAFCCLLLLMNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADSDSSNS
        MAK +AFCC LLLMN AFSLEQ  EVPPS  PESA   PP  SP PF H P+SSPV+SPL+SPPAPPPSD  P+PSPA VPSPSP  A +P ADSD  NS
Subjt:  MAKSVAFCCLLLLMNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADSDSSNS

Query:  TANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL
         +N GG ES +SKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL
Subjt:  TANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL

A0A6J1K7T9 lysine-rich arabinogalactan protein 18-like1.2e-5983.93Show/hide
Query:  MAKSVAFCCLLLL-----MNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADS
        MAKSVAFCCLLL+     M+AAFSLEQP EVPPSPSPESAADSPP PSPTPF HAP+SSP +SPL SPPAPPPSD TP+PSPA VPSPSP PA S TADS
Subjt:  MAKSVAFCCLLLL-----MNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADS

Query:  DSSNSTANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL
        D  NS ANGGG E+ ASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQ+GNAARSSFL
Subjt:  DSSNSTANGGGEESVASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL

SwissProt top hitse value%identityAlignment
Q9FPQ6 Vegetative cell wall protein gp16.3e-0559.15Show/hide
Query:  PAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTA
        PA VPPSP+P S A SPP PSP P + +PS SP  SP  S P+P PS   P+PSP+P+PSPSP P+ SP A
Subjt:  PAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTA

Arabidopsis top hitse value%identityAlignment
AT2G28440.1 proline-rich family protein2.1e-0838.51Show/hide
Query:  SLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSP-------LNSPPAPPPSDFTPT----PSPAPVPSPS---------------PTPAHSPT
        S E  +  PPS SPE  ADSP  PS +P +++P  SP  SP         SPP PPP   +P+    P PAPVP+PS               P+PA SP 
Subjt:  SLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSP-------LNSPPAPPPSDFTPT----PSPAPVPSPS---------------PTPAHSPT

Query:  ADSDSSNSTANGGGEESVASKG---GMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL
                 ++  GEE    +G   GM+G +KAGIA+G I     + IG +VYKKR+DN+ R++Y       FL
Subjt:  ADSDSSNSTANGGGEESVASKG---GMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL

AT3G45230.1 hydroxyproline-rich glycoprotein family protein2.2e-1345.58Show/hide
Query:  PSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLN-SPPAPPPSDFTPTPSPAPVPSPSP-------------TPAHSPTA-DSDSSNSTANGGGEESVAS
        P+PSP+  ADSP   +  P      +SP +SP+  S P  P ++ +P+PSPA  PS SP             +P+ SP A D + S+ T   G +    S
Subjt:  PSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLN-SPPAPPPSDFTPTPSPAPVPSPSP-------------TPAHSPTA-DSDSSNSTANGGGEESVAS

Query:  KGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAAR
         GGM+GGKK G+A G IAA C VG+ G VYKKRQ+NIRRS+YG AAR
Subjt:  KGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAAR

AT5G60630.1 FUNCTIONS IN: molecular_function unknown3.8e-0541.74Show/hide
Query:  SSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADSDSSN---STANGGGEESVASKGGMNGGKKAGIA-VGVIAAACFVGIGGIVYKKRQD
        S+  +   +    + P S+   +PSP  + +    P+ S   +S+  N   ST  GG E SV  + G  GGKK GIA VG IAAA  VG GG V KKR++
Subjt:  SSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADSDSSN---STANGGGEESVASKGGMNGGKKAGIA-VGVIAAACFVGIGGIVYKKRQD

Query:  NIRRSQYGNAARSSF
        NIRRS+YG A+   F
Subjt:  NIRRSQYGNAARSSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAATCCGTTGCTTTTTGTTGCCTTCTTCTCTTGATGAATGCCGCTTTCTCTTTGGAACAACCAGCAGAAGTTCCGCCTTCTCCATCCCCTGAATCTGCTGCGGA
TTCTCCTCCATTTCCCTCTCCTACTCCATTCTCTCACGCTCCTTCCAGTTCTCCGGTGAAATCGCCTTTGAACTCTCCTCCTGCTCCTCCGCCCTCAGATTTCACTCCCA
CTCCATCTCCAGCGCCTGTTCCATCTCCTTCGCCGACTCCGGCCCATTCACCTACCGCCGACAGCGATTCTAGTAACAGCACTGCCAATGGCGGTGGAGAAGAGTCAGTA
GCCTCCAAAGGCGGGATGAATGGAGGCAAGAAGGCTGGAATTGCAGTTGGAGTGATTGCTGCAGCGTGTTTCGTCGGTATTGGAGGAATCGTGTACAAGAAGCGTCAAGA
CAACATTCGCCGATCTCAGTACGGGAACGCTGCTAGGTCGTCGTTCCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAATCCGTTGCTTTTTGTTGCCTTCTTCTCTTGATGAATGCCGCTTTCTCTTTGGAACAACCAGCAGAAGTTCCGCCTTCTCCATCCCCTGAATCTGCTGCGGA
TTCTCCTCCATTTCCCTCTCCTACTCCATTCTCTCACGCTCCTTCCAGTTCTCCGGTGAAATCGCCTTTGAACTCTCCTCCTGCTCCTCCGCCCTCAGATTTCACTCCCA
CTCCATCTCCAGCGCCTGTTCCATCTCCTTCGCCGACTCCGGCCCATTCACCTACCGCCGACAGCGATTCTAGTAACAGCACTGCCAATGGCGGTGGAGAAGAGTCAGTA
GCCTCCAAAGGCGGGATGAATGGAGGCAAGAAGGCTGGAATTGCAGTTGGAGTGATTGCTGCAGCGTGTTTCGTCGGTATTGGAGGAATCGTGTACAAGAAGCGTCAAGA
CAACATTCGCCGATCTCAGTACGGGAACGCTGCTAGGTCGTCGTTCCTATGA
Protein sequenceShow/hide protein sequence
MAKSVAFCCLLLLMNAAFSLEQPAEVPPSPSPESAADSPPFPSPTPFSHAPSSSPVKSPLNSPPAPPPSDFTPTPSPAPVPSPSPTPAHSPTADSDSSNSTANGGGEESV
ASKGGMNGGKKAGIAVGVIAAACFVGIGGIVYKKRQDNIRRSQYGNAARSSFL