; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0294 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0294
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionclassical arabinogalactan protein 1-like
Genome locationMC02:2672401..2674616
RNA-Seq ExpressionMC02g0294
SyntenyMC02g0294
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605547.1 hypothetical protein SDJN03_02864, partial [Cucurbita argyrosperma subsp. sororia]2.71e-5569.06Show/hide
Query:  MAKSVAFCCLLL-FVSLLINVTVSLE-PLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPS--PSP
        MAKSVAFCCLLL FV++ ++   SLE P++V       PSPSP+SAADSPP+PSP PFPHAP++SP ESPL SPPAPPPSDL      TPSPSP+  PSP
Subjt:  MAKSVAFCCLLL-FVSLLINVTVSLE-PLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPS--PSP

Query:  SPTPAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        SP PAPS   DSD   S+A+GG  E+   SKGGMNGGKKAGIA GVIAA CFVG+GGIVYKKRQ+NIRRSQ+G+AARSSFL
Subjt:  SPTPAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

XP_022140708.1 classical arabinogalactan protein 1-like [Momordica charantia]4.18e-103100Show/hide
Query:  MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP
        MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP
Subjt:  MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP

Query:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
Subjt:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

XP_022995398.1 lysine-rich arabinogalactan protein 18-like [Cucurbita maxima]4.71e-5670.17Show/hide
Query:  MAKSVAFCCLLL-FVSLLINVTVSLE-PLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPS--PSP
        MAKSVAFCCLLL FV++ ++   SLE P++V       PSPSP+SAADSPP+PSP PFPHAP++SP ESPL SPPAPPPSDL      TPSPSP+  PSP
Subjt:  MAKSVAFCCLLL-FVSLLINVTVSLE-PLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPS--PSP

Query:  SPTPAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        SP PAPS   DSD   S+A+GG  E+  ASKGGMNGGKKAGIA GVIAA CFVGIGGIVYKKRQ+NIRRSQ+G+AARSSFL
Subjt:  SPTPAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

XP_023534499.1 alpha carbonic anhydrase 8-like [Cucurbita pepo subsp. pepo]1.27e-5368.51Show/hide
Query:  MAKSVAFCCLLL-FVSLLINVTVSLE-PLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPS--PSP
        MAKSVAF CLLL FV++ ++   SLE P++V       PSPSP+SAADSPP+PSP PFPHAP++SP ESPL SPPAPPPSDL      TPSPSP+  PSP
Subjt:  MAKSVAFCCLLL-FVSLLINVTVSLE-PLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPS--PSP

Query:  SPTPAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        SP PAPS   DSD   S+A+GG  E+   SKGGMNGGKKAGIA GVIAA CFVG+GGIVYKKRQ+NIRRSQ+G+AARSSFL
Subjt:  SPTPAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

XP_038901486.1 early nodulin-20-like isoform X1 [Benincasa hispida]5.91e-5470.22Show/hide
Query:  MAKSVAFCCLL-LFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPT
        MAKS+ FC LL +FVSL INV+ SLE L    P    PSPSP+S  +SPPLPSP PFPHAP++SP ESPL+SPPAPPPSDL      T SPSPSPSPSP+
Subjt:  MAKSVAFCCLL-LFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPT

Query:  PAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        P+PSP  DSD S S +SGG  +S  ASKGGM GGKKAGIA GVIAA  FVGIGG VYKKRQ+NIRRSQYG+AARSSFL
Subjt:  PAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

TrEMBL top hitse value%identityAlignment
A0A6J1CIM1 classical arabinogalactan protein 1-like2.02e-103100Show/hide
Query:  MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP
        MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP
Subjt:  MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP

Query:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
Subjt:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

A0A6J1FYT3 alpha carbonic anhydrase 8-like2.65e-4965.54Show/hide
Query:  MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP
        MAK +AFCC LL    L+NV  SLE  +   P     SP P+SAA  PPL SP PFPHAP++SP ESPL+SPPAPPPSDL     P+PSP+  PSPS  P
Subjt:  MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP

Query:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        APSP  D D   S+++ G  ES  +SKGGMNGGKKAGIA GVIAA CFVGIGGIVYKKRQ+NIRRSQYG+AARSSFL
Subjt:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

A0A6J1H149 lysine-rich arabinogalactan protein 18-like8.69e-5468.16Show/hide
Query:  MAKSVAFCCLLL-FVSLLINVTVSLE-PLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSP
        MAKSVAFCCLLL FV++ ++   SLE P++V       PSPSP+SAADSPP+PSP PFPHAP++SP ESPL SPPAPP SDL     P PSP+  PSPSP
Subjt:  MAKSVAFCCLLL-FVSLLINVTVSLE-PLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSP

Query:  TPAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
         PAPS   DSD   S+A+GG  E+   SKGGMNGGKKAGIA GVIAA CFVG+GGIVYKKRQ+NIRRSQ+G+AARSSFL
Subjt:  TPAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

A0A6J1JA87 proline-rich receptor-like protein kinase PERK128.70e-4863.84Show/hide
Query:  MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP
        MAK +AFCC LL    L+NV  SLE  +   P+       P+SA   PPL SP PFPH P++SP ESPL+SPPAPPPSDL     P+PSP+  PSPSP  
Subjt:  MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP

Query:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        AP+P  DSD   S+++ G VES  +SKGGMNGGKKAGIA GVIAA CFVGIGGIVYKKRQ+NIRRSQYG+AARSSFL
Subjt:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

A0A6J1K7T9 lysine-rich arabinogalactan protein 18-like2.28e-5670.17Show/hide
Query:  MAKSVAFCCLLL-FVSLLINVTVSLE-PLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPS--PSP
        MAKSVAFCCLLL FV++ ++   SLE P++V       PSPSP+SAADSPP+PSP PFPHAP++SP ESPL SPPAPPPSDL      TPSPSP+  PSP
Subjt:  MAKSVAFCCLLL-FVSLLINVTVSLE-PLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPS--PSP

Query:  SPTPAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        SP PAPS   DSD   S+A+GG  E+  ASKGGMNGGKKAGIA GVIAA CFVGIGGIVYKKRQ+NIRRSQ+G+AARSSFL
Subjt:  SPTPAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G28440.1 proline-rich family protein3.4e-0737.02Show/hide
Query:  DVTAPAGGSPSP---SPQSAADSPPLPSPAPFPHAPSASPPESPLNSP--------PAPPP---------SDLPPRPAPTPSPS-------PSPS----P
        +V +P   S SP   SPQ  + SP   SP P   +P A+ P+SP +SP        P+PPP         S   P PAP P+PS       P P     P
Subjt:  DVTAPAGGSPSP---SPQSAADSPPLPSPAPFPHAPSASPPESPLNSP--------PAPPP---------SDLPPRPAPTPSPS-------PSPS----P

Query:  SPTPAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        SP P+P      D+  S A+G E+        GM+G +KAGIA G I  V  + IG +VYKKR++N+ R++Y       FL
Subjt:  SPTPAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

AT3G45230.1 hydroxyproline-rich glycoprotein family protein6.3e-1452.35Show/hide
Query:  SPSPSPQSAADSPPLPSPAPFPHAPSASPPESPL--NSPPAPPPSDLP-PRPAPTPSPSP-------SPSPSPTPAPSPNRDSDVSKSVASGGEVES-GA
        SP+PSP   ADSP + +  P       SP ESP+  +SPP P     P P PA +PS SP       SPS S +P+PSP   SDV+ S  +G E E   +
Subjt:  SPSPSPQSAADSPPLPSPAPFPHAPSASPPESPL--NSPPAPPPSDLP-PRPAPTPSPSP-------SPSPSPTPAPSPNRDSDVSKSVASGGEVES-GA

Query:  ASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAAR
         S GGM+GGKK G+AFG IAAVC VG+ G VYKKRQ NIRRS+YG AAR
Subjt:  ASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAATCCGTTGCATTCTGTTGCCTTCTTCTGTTCGTTTCGCTGTTGATCAATGTCACTGTCTCTTTGGAACCACTCGATGTGACTGCGCCGGCGGGTGGTTCGCC
TTCTCCGTCCCCTCAATCTGCCGCCGATTCTCCTCCACTGCCCTCCCCGGCCCCATTCCCCCACGCTCCTTCCGCCTCTCCACCGGAGTCGCCTTTGAACTCTCCTCCTG
CGCCTCCGCCCTCAGATCTTCCTCCCCGTCCGGCTCCGACGCCTTCTCCTTCTCCTTCTCCTTCTCCGTCCCCGACTCCTGCTCCTTCGCCTAACAGAGACAGCGATGTC
AGTAAGAGCGTCGCCAGTGGCGGTGAAGTGGAATCGGGAGCAGCCTCCAAAGGCGGGATGAACGGAGGCAAGAAGGCTGGAATTGCATTTGGAGTGATTGCCGCAGTGTG
TTTCGTCGGAATTGGAGGAATCGTGTACAAGAAGCGCCAAAACAACATTCGCCGATCTCAGTACGGGGACGCCGCTAGGTCTTCCTTCCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAATCCGTTGCATTCTGTTGCCTTCTTCTGTTCGTTTCGCTGTTGATCAATGTCACTGTCTCTTTGGAACCACTCGATGTGACTGCGCCGGCGGGTGGTTCGCC
TTCTCCGTCCCCTCAATCTGCCGCCGATTCTCCTCCACTGCCCTCCCCGGCCCCATTCCCCCACGCTCCTTCCGCCTCTCCACCGGAGTCGCCTTTGAACTCTCCTCCTG
CGCCTCCGCCCTCAGATCTTCCTCCCCGTCCGGCTCCGACGCCTTCTCCTTCTCCTTCTCCTTCTCCGTCCCCGACTCCTGCTCCTTCGCCTAACAGAGACAGCGATGTC
AGTAAGAGCGTCGCCAGTGGCGGTGAAGTGGAATCGGGAGCAGCCTCCAAAGGCGGGATGAACGGAGGCAAGAAGGCTGGAATTGCATTTGGAGTGATTGCCGCAGTGTG
TTTCGTCGGAATTGGAGGAATCGTGTACAAGAAGCGCCAAAACAACATTCGCCGATCTCAGTACGGGGACGCCGCTAGGTCTTCCTTCCTATGAGGATCAAAATGGAGAC
GGAAGCTATGGTTCAGAGAAGTCGCAACCTAAACTCATTGTACACGCAACAATATACACTCCCTGCATCAAAAAATCATGTATCCCGTAGGCCAACTCCGTTTCGACTTA
ATTTCTGCTTGATGTTTCTGAGTTTCCGCCAACTGTAGGATGCTGGCGGGGAATGGCATTGTGTAGTGAACTTATAGCTATGAATGGTCGTATTGGTTGCGCTATTCTCT
GAGGGTCAAGGCGCTGTTGTGTTGAAGGTAGCCCCCATCGTCTTCTCCATCTTAAAAATTCAACGGTAATTGAACTTCCGCTCATGACAATGCCAAACCCAGTTAATGTA
GCCAGTACAATGGCCAGAACTGCTTGGACACCAACCTGTAAACCATTCTAAAGATCAATCTCCAACATCAAAATAAGTCTGCTACTGGGCTTATTTGTACAAGGCATCCT
ATTGTTAATTCAGCAGCAAAATGAAGATATATTAGGTTTTAGTTTTCAGATAATGCCATTCGATAAATGATCCAAAAGAATACGAGCTGTGGATTGTAAAACTCAATCCC
TGCTAACAAATGTCCACTATGGATAATATCAATAGGAAAAAAAAAGATAGATTTATCTCTTACCACGGAGTAAAATATGTGCGAGAAGAGAACCACCATGGCAAATTGTA
CTGTTGCATACACCCATATGAATCTTCTCTTCACTGCAAACGAAATTTTTTGTTCAGGGGTACTGAAGTAAACTAACTGTCTATAATAGCAACAAACTGTCACAGAAGAA
TCCCAAAGTCTACCAAATCAAGACAGTTTCAGCCAAGTCAGGATTGACTGAACTCTATGTTTCAGGCTTATTGAATATTAGTGCAATGTGATAGATTAGCAAATATGAGT
TGAATCCTCA
Protein sequenceShow/hide protein sequence
MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTPAPSPNRDSDV
SKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL