; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005070 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005070
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionclassical arabinogalactan protein 1-like
Genome locationscaffold176:2445619..2446143
RNA-Seq ExpressionMS005070
SyntenyMS005070
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605547.1 hypothetical protein SDJN03_02864, partial [Cucurbita argyrosperma subsp. sororia]1.6e-4369.32Show/hide
Query:  MAKSVAFCCLLL-FVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPA
        MAKSVAFCCLLL FV++ ++ A SLE      P    PSPSP+SAADSPP+PSP PFPHAP++SP ESPL SPPAPPPSDL   P+P+P+  PSPSP PA
Subjt:  MAKSVAFCCLLL-FVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPA

Query:  PSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        PS   DSD   S+A+GG  E+   SKGGMNGGKKAGIA GVIAA CFVG+GGIVYKKRQ+NIRRSQ+G+AARSSFL
Subjt:  PSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

XP_022140708.1 classical arabinogalactan protein 1-like [Momordica charantia]4.9e-7798.31Show/hide
Query:  MAKSVAFCCLLLFVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPT--PSPSPSPSPTP
        MAKSVAFCCLLLFVSLLINV VSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPT  PSPSPSPSPTP
Subjt:  MAKSVAFCCLLLFVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPT--PSPSPSPSPTP

Query:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
Subjt:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

XP_022958106.1 lysine-rich arabinogalactan protein 18-like [Cucurbita moschata]2.3e-4268.75Show/hide
Query:  MAKSVAFCCLLL-FVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPA
        MAKSVAFCCLLL FV++ ++ A SLE      P    PSPSP+SAADSPP+PSP PFPHAP++SP ESPL SPPAPP SDL   P P+P+  PSPSP PA
Subjt:  MAKSVAFCCLLL-FVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPA

Query:  PSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        PS   DSD   S+A+GG  E+   SKGGMNGGKKAGIA GVIAA CFVG+GGIVYKKRQ+NIRRSQ+G+AARSSFL
Subjt:  PSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

XP_022995398.1 lysine-rich arabinogalactan protein 18-like [Cucurbita maxima]4.2e-4470.45Show/hide
Query:  MAKSVAFCCLLL-FVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPA
        MAKSVAFCCLLL FV++ ++ A SLE      P    PSPSP+SAADSPP+PSP PFPHAP++SP ESPL SPPAPPPSDL   P+P+P+  PSPSP PA
Subjt:  MAKSVAFCCLLL-FVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPA

Query:  PSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        PS   DSD   S+A+GG  E+  ASKGGMNGGKKAGIA GVIAA CFVGIGGIVYKKRQ+NIRRSQ+G+AARSSFL
Subjt:  PSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

XP_023534499.1 alpha carbonic anhydrase 8-like [Cucurbita pepo subsp. pepo]3.0e-4268.75Show/hide
Query:  MAKSVAFCCLLL-FVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPA
        MAKSVAF CLLL FV++ ++ A SLE      P    PSPSP+SAADSPP+PSP PFPHAP++SP ESPL SPPAPPPSDL   P+P+P+  PSPSP PA
Subjt:  MAKSVAFCCLLL-FVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPA

Query:  PSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        PS   DSD   S+A+GG  E+   SKGGMNGGKKAGIA GVIAA CFVG+GGIVYKKRQ+NIRRSQ+G+AARSSFL
Subjt:  PSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

TrEMBL top hitse value%identityAlignment
A0A6J1CIM1 classical arabinogalactan protein 1-like2.4e-7798.31Show/hide
Query:  MAKSVAFCCLLLFVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPT--PSPSPSPSPTP
        MAKSVAFCCLLLFVSLLINV VSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPT  PSPSPSPSPTP
Subjt:  MAKSVAFCCLLLFVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPT--PSPSPSPSPTP

Query:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
Subjt:  APSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

A0A6J1FYT3 alpha carbonic anhydrase 8-like1.3e-3866.29Show/hide
Query:  MAKSVAFCCLLLFVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPAP
        MAK +AFCC L    LL+NVA SLE  +        PSP P+SAA  PPL SP PFPHAP++SP ESPL+SPPAPPPSDL   P+P+P+  PSPS  PAP
Subjt:  MAKSVAFCCLLLFVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPAP

Query:  SPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        SP  D D   S+++ G  ES  +SKGGMNGGKKAGIA GVIAA CFVGIGGIVYKKRQ+NIRRSQYG+AARSSFL
Subjt:  SPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

A0A6J1H149 lysine-rich arabinogalactan protein 18-like1.1e-4268.75Show/hide
Query:  MAKSVAFCCLLL-FVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPA
        MAKSVAFCCLLL FV++ ++ A SLE      P    PSPSP+SAADSPP+PSP PFPHAP++SP ESPL SPPAPP SDL   P P+P+  PSPSP PA
Subjt:  MAKSVAFCCLLL-FVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPA

Query:  PSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        PS   DSD   S+A+GG  E+   SKGGMNGGKKAGIA GVIAA CFVG+GGIVYKKRQ+NIRRSQ+G+AARSSFL
Subjt:  PSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

A0A6J1JA87 proline-rich receptor-like protein kinase PERK121.8e-3765.14Show/hide
Query:  MAKSVAFCCLLLFVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPAP
        MAK +AFCC L    LL+NVA SLE  +        PS  P+SA   PPL SP PFPH P++SP ESPL+SPPAPPPSDL   P+P+P+  PSPSP  AP
Subjt:  MAKSVAFCCLLLFVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPAP

Query:  SPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        +P  DSD   S+++ G VES  +SKGGMNGGKKAGIA GVIAA CFVGIGGIVYKKRQ+NIRRSQYG+AARSSFL
Subjt:  SPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

A0A6J1K7T9 lysine-rich arabinogalactan protein 18-like2.0e-4470.45Show/hide
Query:  MAKSVAFCCLLL-FVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPA
        MAKSVAFCCLLL FV++ ++ A SLE      P    PSPSP+SAADSPP+PSP PFPHAP++SP ESPL SPPAPPPSDL   P+P+P+  PSPSP PA
Subjt:  MAKSVAFCCLLL-FVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPA

Query:  PSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        PS   DSD   S+A+GG  E+  ASKGGMNGGKKAGIA GVIAA CFVGIGGIVYKKRQ+NIRRSQ+G+AARSSFL
Subjt:  PSPNRDSDVSKSVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G28440.1 proline-rich family protein3.3e-0737.84Show/hide
Query:  DVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPAPSPNRDSDVSKSVASGGEVESGAASKGG
        +  +P   + SP P+S ADSP  P P P P +PS+     P    P P PSD      P P     PSP P+P      D+  S A+G E+        G
Subjt:  DVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPAPSPNRDSDVSKSVASGGEVESGAASKGG

Query:  MNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL
        M+G +KAGIA G I  V  + IG +VYKKR++N+ R++Y       FL
Subjt:  MNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL

AT3G45230.1 hydroxyproline-rich glycoprotein family protein1.4e-1351.35Show/hide
Query:  SPSPSPQSAADSPPLPSPAPFPHAPSASPPESPL--NSPPAPPPSDLP-PRPAPTPSPSP-------SPSPTPAPSPNRD-SDVSKSVASGGEVES-GAA
        SP+PSP   ADSP + +  P       SP ESP+  +SPP P     P P PA +PS SP       SPS + +PSP+ + SDV+ S  +G E E   + 
Subjt:  SPSPSPQSAADSPPLPSPAPFPHAPSASPPESPL--NSPPAPPPSDLP-PRPAPTPSPSP-------SPSPTPAPSPNRD-SDVSKSVASGGEVES-GAA

Query:  SKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAAR
        S GGM+GGKK G+AFG IAAVC VG+ G VYKKRQ NIRRS+YG AAR
Subjt:  SKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAATCCGTTGCATTCTGTTGCCTTCTTCTGTTCGTTTCGCTGTTGATCAATGTCGCTGTCTCTTTGGAACCACTCGATGTGACTGCGCCGGCGGGTGGTTCGCC
TTCTCCGTCCCCTCAATCTGCCGCCGATTCTCCTCCACTGCCCTCCCCGGCCCCATTCCCCCACGCTCCTTCCGCCTCTCCACCGGAGTCGCCTTTGAACTCTCCTCCTG
CGCCTCCGCCCTCAGATCTTCCTCCCCGTCCGGCTCCGACGCCTTCTCCTTCTCCTTCTCCGTCCCCGACTCCTGCTCCTTCGCCTAACAGAGACAGCGATGTCAGTAAG
AGCGTCGCCAGTGGCGGTGAAGTGGAATCGGGAGCAGCCTCCAAAGGCGGGATGAACGGAGGCAAGAAGGCTGGAATTGCATTTGGAGTGATTGCCGCAGTGTGTTTCGT
CGGAATTGGAGGAATCGTGTACAAGAAGCGCCAAAACAACATTCGCCGATCTCAGTACGGGGACGCCGCTAGGTCTTCCTTCCTA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAATCCGTTGCATTCTGTTGCCTTCTTCTGTTCGTTTCGCTGTTGATCAATGTCGCTGTCTCTTTGGAACCACTCGATGTGACTGCGCCGGCGGGTGGTTCGCC
TTCTCCGTCCCCTCAATCTGCCGCCGATTCTCCTCCACTGCCCTCCCCGGCCCCATTCCCCCACGCTCCTTCCGCCTCTCCACCGGAGTCGCCTTTGAACTCTCCTCCTG
CGCCTCCGCCCTCAGATCTTCCTCCCCGTCCGGCTCCGACGCCTTCTCCTTCTCCTTCTCCGTCCCCGACTCCTGCTCCTTCGCCTAACAGAGACAGCGATGTCAGTAAG
AGCGTCGCCAGTGGCGGTGAAGTGGAATCGGGAGCAGCCTCCAAAGGCGGGATGAACGGAGGCAAGAAGGCTGGAATTGCATTTGGAGTGATTGCCGCAGTGTGTTTCGT
CGGAATTGGAGGAATCGTGTACAAGAAGCGCCAAAACAACATTCGCCGATCTCAGTACGGGGACGCCGCTAGGTCTTCCTTCCTA
Protein sequenceShow/hide protein sequence
MAKSVAFCCLLLFVSLLINVAVSLEPLDVTAPAGGSPSPSPQSAADSPPLPSPAPFPHAPSASPPESPLNSPPAPPPSDLPPRPAPTPSPSPSPSPTPAPSPNRDSDVSK
SVASGGEVESGAASKGGMNGGKKAGIAFGVIAAVCFVGIGGIVYKKRQNNIRRSQYGDAARSSFL