; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g03540 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g03540
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionclassical arabinogalactan protein 1-like
Genome locationchr2:2670999..2671532
RNA-Seq ExpressionMoc02g03540
SyntenyMoc02g03540
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605547.1 hypothetical protein SDJN03_02864, partial [Cucurbita argyrosperma subsp. sororia]3.6e-4368.54Show/hide
Query:  MAKSVAFCCLLL-FVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPT
        MAKSVAFCCLLL FV++ ++   SLE      P    PSPSP+SAADSPP+PSP PFPHAP++SP ESPL SPPAPPPSDL     P+PSP+  PSPSP 
Subjt:  MAKSVAFCCLLL-FVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPT

Query:  PAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        PAPS   DSD   S+A+GG  E+   SKGGMNGGKKAGIA GVIAA CFVG+GGIVYKKRQ+NIRRSQ+G+AARSSFL
Subjt:  PAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

XP_022140708.1 classical arabinogalactan protein 1-like [Momordica charantia]8.1e-80100Show/hide
Query:  MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP
        MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP
Subjt:  MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP

Query:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
Subjt:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

XP_022958106.1 lysine-rich arabinogalactan protein 18-like [Cucurbita moschata]6.7e-4267.98Show/hide
Query:  MAKSVAFCCLLL-FVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPT
        MAKSVAFCCLLL FV++ ++   SLE      P    PSPSP+SAADSPP+PSP PFPHAP++SP ESPL SPPAPP SDL     P PSP+  PSPSP 
Subjt:  MAKSVAFCCLLL-FVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPT

Query:  PAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        PAPS   DSD   S+A+GG  E+   SKGGMNGGKKAGIA GVIAA CFVG+GGIVYKKRQ+NIRRSQ+G+AARSSFL
Subjt:  PAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

XP_022995398.1 lysine-rich arabinogalactan protein 18-like [Cucurbita maxima]9.4e-4469.66Show/hide
Query:  MAKSVAFCCLLL-FVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPT
        MAKSVAFCCLLL FV++ ++   SLE      P    PSPSP+SAADSPP+PSP PFPHAP++SP ESPL SPPAPPPSDL     P+PSP+  PSPSP 
Subjt:  MAKSVAFCCLLL-FVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPT

Query:  PAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        PAPS   DSD   S+A+GG  E+  ASKGGMNGGKKAGIA GVIAA CFVGIGGIVYKKRQ+NIRRSQ+G+AARSSFL
Subjt:  PAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

XP_038901486.1 early nodulin-20-like isoform X1 [Benincasa hispida]3.0e-4270.22Show/hide
Query:  MAKSVAFCCLL-LFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPT
        MAKS+ FC LL +FVSL INV+ SLE L    P    PSPSP+S  +SPPLPSP PFPHAP++SP ESPL+SPPAPPPSDL      T SPSPSPSPSP+
Subjt:  MAKSVAFCCLL-LFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPT

Query:  PAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        P+PSP  DSD S S +SGG  E   ASKGGM GGKKAGIA GVIAA  FVGIGG VYKKRQ+NIRRSQYG+AARSSFL
Subjt:  PAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

TrEMBL top hitse value%identityAlignment
A0A6J1CIM1 classical arabinogalactan protein 1-like3.9e-80100Show/hide
Query:  MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP
        MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP
Subjt:  MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP

Query:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
Subjt:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

A0A6J1FYT3 alpha carbonic anhydrase 8-like9.8e-3965.54Show/hide
Query:  MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP
        MAK +AFCC L    LL+NV  SLE  +        PSP P+SAA  PPL SP PFPHAP++SP ESPL+SPPAPPPSDL     P+PSP+  PSPS  P
Subjt:  MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP

Query:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        APSP  D D   S+++ G  ES  +SKGGMNGGKKAGIA GVIAA CFVGIGGIVYKKRQ+NIRRSQYG+AARSSFL
Subjt:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

A0A6J1H149 lysine-rich arabinogalactan protein 18-like3.3e-4267.98Show/hide
Query:  MAKSVAFCCLLL-FVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPT
        MAKSVAFCCLLL FV++ ++   SLE      P    PSPSP+SAADSPP+PSP PFPHAP++SP ESPL SPPAPP SDL     P PSP+  PSPSP 
Subjt:  MAKSVAFCCLLL-FVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPT

Query:  PAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        PAPS   DSD   S+A+GG  E+   SKGGMNGGKKAGIA GVIAA CFVG+GGIVYKKRQ+NIRRSQ+G+AARSSFL
Subjt:  PAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

A0A6J1JA87 proline-rich receptor-like protein kinase PERK121.4e-3764.41Show/hide
Query:  MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP
        MAK +AFCC L    LL+NV  SLE  +        PS  P+SA   PPL SP PFPH P++SP ESPL+SPPAPPPSDL     P+PSP+  PSPSP  
Subjt:  MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTP

Query:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        AP+P  DSD   S+++ G VES  +SKGGMNGGKKAGIA GVIAA CFVGIGGIVYKKRQ+NIRRSQYG+AARSSFL
Subjt:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

A0A6J1K7T9 lysine-rich arabinogalactan protein 18-like4.5e-4469.66Show/hide
Query:  MAKSVAFCCLLL-FVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPT
        MAKSVAFCCLLL FV++ ++   SLE      P    PSPSP+SAADSPP+PSP PFPHAP++SP ESPL SPPAPPPSDL     P+PSP+  PSPSP 
Subjt:  MAKSVAFCCLLL-FVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPT

Query:  PAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        PAPS   DSD   S+A+GG  E+  ASKGGMNGGKKAGIA GVIAA CFVGIGGIVYKKRQ+NIRRSQ+G+AARSSFL
Subjt:  PAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G28440.1 proline-rich family protein3.4e-0737.02Show/hide
Query:  DVTAPAGGSPSP---SPQSAADSPPLPSPAPFPHAPSASPPESPLNSP--------PAPPP---------SDLPPRPAPTPSPS-------PSPS----P
        +V +P   S SP   SPQ  + SP   SP P   +P A+ P+SP +SP        P+PPP         S   P PAP P+PS       P P     P
Subjt:  DVTAPAGGSPSP---SPQSAADSPPLPSPAPFPHAPSASPPESPLNSP--------PAPPP---------SDLPPRPAPTPSPS-------PSPS----P

Query:  SPTPAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        SP P+P      D+  S A+G E+        GM+G +KAGIA G I  V  + IG +VYKKR++N+ R++Y       FL
Subjt:  SPTPAPSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

AT3G45230.1 hydroxyproline-rich glycoprotein family protein6.3e-1452.35Show/hide
Query:  SPSPSPQSAADSPPLPSPAPFPHAPSASPPESPL--NSPPAPPPSDLP-PRPAPTPSPSP-------SPSPSPTPAPSPNRDSDVSKSVASGGEVES-GA
        SP+PSP   ADSP + +  P       SP ESP+  +SPP P     P P PA +PS SP       SPS S +P+PSP   SDV+ S  +G E E   +
Subjt:  SPSPSPQSAADSPPLPSPAPFPHAPSASPPESPL--NSPPAPPPSDLP-PRPAPTPSPSP-------SPSPSPTPAPSPNRDSDVSKSVASGGEVES-GA

Query:  ASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAAR
         S GGM+GGKK G+AFG IAAVC VG+ G VYKKRQ NIRRS+YG AAR
Subjt:  ASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAATCCGTTGCATTCTGTTGCCTTCTTCTGTTCGTTTCGCTGTTGATCAATGTCACTGTCTCTTTGGAACCACTCGATGTGACTGCGCCGGCGGGTGGTTCGCC
TTCTCCGTCCCCTCAATCTGCCGCCGATTCTCCTCCACTGCCCTCCCCGGCCCCATTCCCCCACGCTCCTTCCGCCTCTCCACCGGAGTCGCCTTTGAACTCTCCTCCTG
CGCCTCCGCCCTCAGATCTTCCTCCCCGTCCGGCTCCGACGCCTTCTCCTTCTCCTTCTCCTTCTCCGTCCCCGACTCCTGCTCCTTCGCCTAACAGAGACAGCGATGTC
AGTAAGAGCGTCGCCAGTGGCGGTGAAGTGGAATCGGGAGCAGCCTCCAAAGGCGGGATGAACGGAGGCAAGAAGGCTGGAATTGCATTTGGAGTGATTGCCGCAGTGTG
TTTCGTCGGAATTGGAGGAATCGTGTACAAGAAGCGCCAAAACAACATTCGCCGATCTCAGTACGGGGACGCCGCTAGGTCTTCCTTCCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAATCCGTTGCATTCTGTTGCCTTCTTCTGTTCGTTTCGCTGTTGATCAATGTCACTGTCTCTTTGGAACCACTCGATGTGACTGCGCCGGCGGGTGGTTCGCC
TTCTCCGTCCCCTCAATCTGCCGCCGATTCTCCTCCACTGCCCTCCCCGGCCCCATTCCCCCACGCTCCTTCCGCCTCTCCACCGGAGTCGCCTTTGAACTCTCCTCCTG
CGCCTCCGCCCTCAGATCTTCCTCCCCGTCCGGCTCCGACGCCTTCTCCTTCTCCTTCTCCTTCTCCGTCCCCGACTCCTGCTCCTTCGCCTAACAGAGACAGCGATGTC
AGTAAGAGCGTCGCCAGTGGCGGTGAAGTGGAATCGGGAGCAGCCTCCAAAGGCGGGATGAACGGAGGCAAGAAGGCTGGAATTGCATTTGGAGTGATTGCCGCAGTGTG
TTTCGTCGGAATTGGAGGAATCGTGTACAAGAAGCGCCAAAACAACATTCGCCGATCTCAGTACGGGGACGCCGCTAGGTCTTCCTTCCTATGA
Protein sequenceShow/hide protein sequence
MAKSVAFCCLLLFVSLLINVTVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPSPTPAPSPNRDSDV
SKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL