; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS000233 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS000233
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionclassical arabinogalactan protein 4
Genome locationscaffold44:49752..50303
RNA-Seq ExpressionMS000233
SyntenyMS000233
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576849.1 hypothetical protein SDJN03_24423, partial [Cucurbita argyrosperma subsp. sororia]2.5e-3966.15Show/hide
Query:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAP-PSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATP
        MASF VLNV+TAALLL+SAAANSPLPSPAPSP+SPPW+WTP TESPSSPP  E P PS +PP LSPVPSS  PT PP ANP           PTLSPA  
Subjt:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAP-PSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATP

Query:  PPAAKTPGHAPSPSKMKPAAPAPA-----PTKAPKSSKAP-ASPPSPYGGLMPPAPAPAPT-PPADNGGAINRFTISGCTVAVGLMAAALMA
        PPA K+P HAPSPSK K  APAP+     P +APKSS  P +SPP+P G + PP P+ APT P ADNGGA NRF I G +VA GLMAAAL+A
Subjt:  PPAAKTPGHAPSPSKMKPAAPAPA-----PTKAPKSSKAP-ASPPSPYGGLMPPAPAPAPT-PPADNGGAINRFTISGCTVAVGLMAAALMA

KAG7014869.1 hypothetical protein SDJN02_22499, partial [Cucurbita argyrosperma subsp. argyrosperma]1.9e-3966.15Show/hide
Query:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAP-PSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATP
        MASF VLNV+TAALLL+SAAANSPLPSPAPSP+SPPW+WTP TESPSSPP  E P PS +PP LSPVPSS  PT PP ANP           PTLSPA  
Subjt:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAP-PSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATP

Query:  PPAAKTPGHAPSPSKMKPAAPAPA-----PTKAPKSSKAP-ASPPSPYGGLMPPAPAPAPT-PPADNGGAINRFTISGCTVAVGLMAAALMA
        PPA K+P HAPSPSK K  APAP+     P +APKSS  P +SPP+P G + PP P+ APT P ADNGGA NRF I G +VA GLMAAAL+A
Subjt:  PPAAKTPGHAPSPSKMKPAAPAPA-----PTKAPKSSKAP-ASPPSPYGGLMPPAPAPAPT-PPADNGGAINRFTISGCTVAVGLMAAALMA

XP_022141102.1 vegetative cell wall protein gp1-like [Momordica charantia]5.0e-8096.74Show/hide
Query:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAPPSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATPP
        MASFTVLNVLTAALLLVSA ANSPLPSPAPSPDSPPWQW PGTESPSSPPTAEAPPSENPP LSPVPSSH PTPP AANPPEVSPVPTSHSPTLSPATPP
Subjt:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAPPSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATPP

Query:  PAAKTPGHAPSPSKMKPAAPAPAPTKAPKSSKAPASPPSPYGGLMPPAPAPAPTPPADNGGAINRFTISGCTVAVGLMAAALMA
        PAAKTPGHAPSPSK KPAAPAPAPTKAPKSSKAPASPPSPYGGLMPPAPAPAPTPPADNGGAINRFTISGCTVAVGLMAAALMA
Subjt:  PAAKTPGHAPSPSKMKPAAPAPAPTKAPKSSKAPASPPSPYGGLMPPAPAPAPTPPADNGGAINRFTISGCTVAVGLMAAALMA

XP_022922559.1 extensin-like [Cucurbita moschata]2.5e-3965.62Show/hide
Query:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAP-PSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATP
        MASF VLNV+TAALLL+SA+ANSPLPSPAPSP+SPPW+WTP T+SPSSPP  E P PS +PP LSPVPSS  PT PP ANP           PTLSPA  
Subjt:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAP-PSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATP

Query:  PPAAKTPGHAPSPSKMKPAAPAPA-----PTKAPKSSKAP-ASPPSPYGGLMPPAPAPAPT-PPADNGGAINRFTISGCTVAVGLMAAALMA
        PPA K+P HAPSPSK K  APAP+     P +APKSS AP +SPP+P G + PP P+ APT P A+NGGA NRF I G TVA GLMAAAL+A
Subjt:  PPAAKTPGHAPSPSKMKPAAPAPA-----PTKAPKSSKAP-ASPPSPYGGLMPPAPAPAPT-PPADNGGAINRFTISGCTVAVGLMAAALMA

XP_022985113.1 extensin-like [Cucurbita maxima]6.8e-3765.1Show/hide
Query:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAP-PSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATP
        MASF VLNV+TAALLL+SA+ANSPLPS APSP+S PW+WTP TESPSSPP  E P PS +PP LSPVPSS  PT P  ANP           PTLSPA  
Subjt:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAP-PSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATP

Query:  PPAAKTPGHAPSPSKMKPAAPAPA-----PTKAPKSSKAPA-SPPSPYGGLMPPAPAPAPT-PPADNGGAINRFTISGCTVAVGLMAAALMA
        PPA K+P HAPSPSK K  APAP+     P +APKSS APA SPP+P G + PP P+ APT P ADNGGA NRF IS  +VA GLMAAAL+A
Subjt:  PPAAKTPGHAPSPSKMKPAAPAPA-----PTKAPKSSKAPA-SPPSPYGGLMPPAPAPAPT-PPADNGGAINRFTISGCTVAVGLMAAALMA

TrEMBL top hitse value%identityAlignment
A0A1S3CHW9 classical arabinogalactan protein 45.4e-3255.67Show/hide
Query:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAP------------PSENPPALSPVPSSHAPTPPPAANPPEVSPVPT
        MAS T+LN+LT A LL+SAAANSP PSPAPS +SP W+WTP  + PSSPPTAE P            P   PP LSPVPSS++PT PP ANP        
Subjt:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAP------------PSENPPALSPVPSSHAPTPPPAANPPEVSPVPT

Query:  SHSPTLSPATPPPAAKTPGHAPSPSKMKPAAPA-----PAPTKAPKSSKAP-ASPPSPYGGLMPPAPAPAPTPPAD-NGGAINRFTISGCTVAVGLMAAA
           PTLSP  PPPA  +P HAPSP+K K  APA     PAP KAPKSSKAP +SPPSP G + PP P+ AP P  + NG   NRF I G ++A G M AA
Subjt:  SHSPTLSPATPPPAAKTPGHAPSPSKMKPAAPA-----PAPTKAPKSSKAP-ASPPSPYGGLMPPAPAPAPTPPAD-NGGAINRFTISGCTVAVGLMAAA

Query:  LMA
         +A
Subjt:  LMA

A0A5A7V3Q7 Classical arabinogalactan protein 45.4e-3255.67Show/hide
Query:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAP------------PSENPPALSPVPSSHAPTPPPAANPPEVSPVPT
        MAS T+LN+LT A LL+SAAANSP PSPAPS +SP W+WTP  + PSSPPTAE P            P   PP LSPVPSS++PT PP ANP        
Subjt:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAP------------PSENPPALSPVPSSHAPTPPPAANPPEVSPVPT

Query:  SHSPTLSPATPPPAAKTPGHAPSPSKMKPAAPA-----PAPTKAPKSSKAP-ASPPSPYGGLMPPAPAPAPTPPAD-NGGAINRFTISGCTVAVGLMAAA
           PTLSP  PPPA  +P HAPSP+K K  APA     PAP KAPKSSKAP +SPPSP G + PP P+ AP P  + NG   NRF I G ++A G M AA
Subjt:  SHSPTLSPATPPPAAKTPGHAPSPSKMKPAAPA-----PAPTKAPKSSKAP-ASPPSPYGGLMPPAPAPAPTPPAD-NGGAINRFTISGCTVAVGLMAAA

Query:  LMA
         +A
Subjt:  LMA

A0A6J1CIX2 vegetative cell wall protein gp1-like2.4e-8096.74Show/hide
Query:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAPPSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATPP
        MASFTVLNVLTAALLLVSA ANSPLPSPAPSPDSPPWQW PGTESPSSPPTAEAPPSENPP LSPVPSSH PTPP AANPPEVSPVPTSHSPTLSPATPP
Subjt:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAPPSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATPP

Query:  PAAKTPGHAPSPSKMKPAAPAPAPTKAPKSSKAPASPPSPYGGLMPPAPAPAPTPPADNGGAINRFTISGCTVAVGLMAAALMA
        PAAKTPGHAPSPSK KPAAPAPAPTKAPKSSKAPASPPSPYGGLMPPAPAPAPTPPADNGGAINRFTISGCTVAVGLMAAALMA
Subjt:  PAAKTPGHAPSPSKMKPAAPAPAPTKAPKSSKAPASPPSPYGGLMPPAPAPAPTPPADNGGAINRFTISGCTVAVGLMAAALMA

A0A6J1E3R0 extensin-like1.2e-3965.62Show/hide
Query:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAP-PSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATP
        MASF VLNV+TAALLL+SA+ANSPLPSPAPSP+SPPW+WTP T+SPSSPP  E P PS +PP LSPVPSS  PT PP ANP           PTLSPA  
Subjt:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAP-PSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATP

Query:  PPAAKTPGHAPSPSKMKPAAPAPA-----PTKAPKSSKAP-ASPPSPYGGLMPPAPAPAPT-PPADNGGAINRFTISGCTVAVGLMAAALMA
        PPA K+P HAPSPSK K  APAP+     P +APKSS AP +SPP+P G + PP P+ APT P A+NGGA NRF I G TVA GLMAAAL+A
Subjt:  PPAAKTPGHAPSPSKMKPAAPAPA-----PTKAPKSSKAP-ASPPSPYGGLMPPAPAPAPT-PPADNGGAINRFTISGCTVAVGLMAAALMA

A0A6J1JCM0 extensin-like3.3e-3765.1Show/hide
Query:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAP-PSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATP
        MASF VLNV+TAALLL+SA+ANSPLPS APSP+S PW+WTP TESPSSPP  E P PS +PP LSPVPSS  PT P  ANP           PTLSPA  
Subjt:  MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAP-PSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATP

Query:  PPAAKTPGHAPSPSKMKPAAPAPA-----PTKAPKSSKAPA-SPPSPYGGLMPPAPAPAPT-PPADNGGAINRFTISGCTVAVGLMAAALMA
        PPA K+P HAPSPSK K  APAP+     P +APKSS APA SPP+P G + PP P+ APT P ADNGGA NRF IS  +VA GLMAAAL+A
Subjt:  PPAAKTPGHAPSPSKMKPAAPAPA-----PTKAPKSSKAPA-SPPSPYGGLMPPAPAPAPT-PPADNGGAINRFTISGCTVAVGLMAAALMA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G68725.1 arabinogalactan protein 191.6e-0440.83Show/hide
Query:  VSAA---ANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAPPSENPPALSPVPSSHAPTPPPAAN---PPEVSPVPTSHSPTLSPATPPPA--AKTPGH
        VSAA   A+   P PA +P SPP        SP++PP     P ++PPA +P  S    +PPPA     P   SP P   SP  +PA+PPPA  +  P  
Subjt:  VSAA---ANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAPPSENPPALSPVPSSHAPTPPPAAN---PPEVSPVPTSHSPTLSPATPPPA--AKTPGH

Query:  APSPSKMKPAAPAPAPTK------------APKSSKAPASPPSPYGGLMPPAPAPAPTPPADNGGAINR
        APSP  + P APAPAPTK            AP  +  P SPPSP     P   APAP+P  + G A+N+
Subjt:  APSPSKMKPAAPAPAPTK------------APKSSKAPASPPSPYGGLMPPAPAPAPTPPADNGGAINR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTTCACCGTTCTGAACGTTTTGACGGCGGCGCTTCTGCTCGTTTCCGCCGCCGCCAACTCCCCGCTGCCGTCTCCCGCTCCGAGCCCCGACTCGCCGCCGTG
GCAATGGACTCCCGGTACCGAGTCGCCTTCCTCTCCACCAACAGCGGAGGCACCGCCGTCGGAAAACCCACCGGCGCTCAGCCCCGTTCCGTCATCCCATGCGCCGACTC
CGCCGCCGGCAGCCAACCCACCGGAAGTAAGCCCAGTTCCGACGTCCCATTCACCGACTTTGTCTCCGGCCACTCCGCCACCGGCGGCGAAGACTCCCGGTCACGCACCT
TCGCCGTCGAAAATGAAACCAGCAGCTCCGGCTCCGGCGCCAACGAAGGCTCCAAAATCCTCTAAAGCTCCGGCGAGTCCTCCGTCGCCCTATGGAGGGTTAATGCCCCC
AGCGCCAGCGCCAGCGCCGACGCCACCGGCAGACAACGGCGGCGCTATAAACAGATTCACAATTTCTGGGTGTACTGTTGCAGTTGGATTAATGGCGGCGGCTTTAATGG
CC
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTTCACCGTTCTGAACGTTTTGACGGCGGCGCTTCTGCTCGTTTCCGCCGCCGCCAACTCCCCGCTGCCGTCTCCCGCTCCGAGCCCCGACTCGCCGCCGTG
GCAATGGACTCCCGGTACCGAGTCGCCTTCCTCTCCACCAACAGCGGAGGCACCGCCGTCGGAAAACCCACCGGCGCTCAGCCCCGTTCCGTCATCCCATGCGCCGACTC
CGCCGCCGGCAGCCAACCCACCGGAAGTAAGCCCAGTTCCGACGTCCCATTCACCGACTTTGTCTCCGGCCACTCCGCCACCGGCGGCGAAGACTCCCGGTCACGCACCT
TCGCCGTCGAAAATGAAACCAGCAGCTCCGGCTCCGGCGCCAACGAAGGCTCCAAAATCCTCTAAAGCTCCGGCGAGTCCTCCGTCGCCCTATGGAGGGTTAATGCCCCC
AGCGCCAGCGCCAGCGCCGACGCCACCGGCAGACAACGGCGGCGCTATAAACAGATTCACAATTTCTGGGTGTACTGTTGCAGTTGGATTAATGGCGGCGGCTTTAATGG
CC
Protein sequenceShow/hide protein sequence
MASFTVLNVLTAALLLVSAAANSPLPSPAPSPDSPPWQWTPGTESPSSPPTAEAPPSENPPALSPVPSSHAPTPPPAANPPEVSPVPTSHSPTLSPATPPPAAKTPGHAP
SPSKMKPAAPAPAPTKAPKSSKAPASPPSPYGGLMPPAPAPAPTPPADNGGAINRFTISGCTVAVGLMAAALMA