; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004863 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004863
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationscaffold176:1077539..1078175
RNA-Seq ExpressionMS004863
SyntenyMS004863
Gene Ontology termsGO:0009855 - determination of bilateral symmetry (biological process)
GO:0010087 - phloem or xylem histogenesis (biological process)
GO:0010305 - leaf vascular tissue pattern formation (biological process)
GO:0010588 - cotyledon vascular tissue pattern formation (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151872.1 uncharacterized protein LOC101212188 [Cucumis sativus]1.2e-7490.45Show/hide
Query:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW
        K+VGVLICLLVVAMDIVAGLLGIEA+IAQNKVK LRLWIFECR+PSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQ+E++KSPPNRQ+S+ACLIFTW
Subjt:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW

Query:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE
        IILAVGMSMLVIG + NNKSRASCGFTHHHFLSIGGILCFVH LFCVAYYV+ATA E
Subjt:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE

XP_008455827.1 PREDICTED: uncharacterized protein LOC103495927 [Cucumis melo]1.6e-7489.81Show/hide
Query:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW
        K+VGVLICLLVVAMDIVAGLLGIEA+IAQNKVK LRLWIFECR+PSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQ+E++KSPPNRQ+S+ACL+FTW
Subjt:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW

Query:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE
        IILAVGMSMLVIG + NNKSRASCGFTHHHFLSIGGILCFVH LFCVAYYV+ATA E
Subjt:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE

XP_022140601.1 uncharacterized protein LOC111011214 isoform X1 [Momordica charantia]5.5e-80100Show/hide
Query:  VVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTWI
        VVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTWI
Subjt:  VVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTWI

Query:  ILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE
        ILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE
Subjt:  ILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE

XP_022140603.1 uncharacterized protein LOC111011214 isoform X2 [Momordica charantia]1.1e-80100Show/hide
Query:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW
        KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW
Subjt:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW

Query:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE
        IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE
Subjt:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE

XP_038902419.1 uncharacterized protein LOC120089064 [Benincasa hispida]1.2e-7490.45Show/hide
Query:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW
        ++VGVLICLLVVAMDIVAGLLGIEA+IAQNKVK LRLWIFECR+PSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEE++KSPPN+QLS+ACLIFTW
Subjt:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW

Query:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE
        IILAVGMSMLVIG L NNKSRA+CGFTHHHFLSIGGILCFVH LFCVAYYV+ATA E
Subjt:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE

TrEMBL top hitse value%identityAlignment
A0A0A0LR43 Uncharacterized protein5.8e-7590.45Show/hide
Query:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW
        K+VGVLICLLVVAMDIVAGLLGIEA+IAQNKVK LRLWIFECR+PSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQ+E++KSPPNRQ+S+ACLIFTW
Subjt:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW

Query:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE
        IILAVGMSMLVIG + NNKSRASCGFTHHHFLSIGGILCFVH LFCVAYYV+ATA E
Subjt:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE

A0A1S3C1S7 uncharacterized protein LOC1034959277.5e-7589.81Show/hide
Query:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW
        K+VGVLICLLVVAMDIVAGLLGIEA+IAQNKVK LRLWIFECR+PSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQ+E++KSPPNRQ+S+ACL+FTW
Subjt:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW

Query:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE
        IILAVGMSMLVIG + NNKSRASCGFTHHHFLSIGGILCFVH LFCVAYYV+ATA E
Subjt:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE

A0A6J1CGJ3 uncharacterized protein LOC111011214 isoform X12.7e-80100Show/hide
Query:  VVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTWI
        VVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTWI
Subjt:  VVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTWI

Query:  ILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE
        ILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE
Subjt:  ILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE

A0A6J1CID0 uncharacterized protein LOC111011214 isoform X25.4e-81100Show/hide
Query:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW
        KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW
Subjt:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW

Query:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE
        IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE
Subjt:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE

A0A6J1FUW3 uncharacterized protein LOC1114490225.1e-7184.71Show/hide
Query:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW
        K+VGVL+CLLVVAMDIVAGLLGIEA+IAQNKVK LRLWIFECR+PS QA++LGL AAG+LGLAH+IANLLGGCNCICSQE ++KSPPN+Q+S+ACL+FTW
Subjt:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW

Query:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE
        IILAV MSMLVIG L NNKSRASCGFTHHHFLSIGGILCFVH LFCVAYYV+ATA E
Subjt:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05291.1 Protein of unknown function (DUF1218)4.3e-2238.56Show/hide
Query:  VLICL-LVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNR---QLSLACLIFTW
        +++C+ L V +DIVAG +G++A+ AQ  VK  +L   EC+ PS+ AF LG+ A   L  AH+ AN++ GC+     + +   P N+     ++ACL   W
Subjt:  VLICL-LVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNR---QLSLACLIFTW

Query:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSA
        ++   G  +L  G  SN +SR  C FT++H  SIGG +CF+HA+    YY+S+
Subjt:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSA

AT1G11500.1 Protein of unknown function (DUF1218)3.2e-2540.62Show/hide
Query:  VGVLICLLVVAMDIVAGLLGIEAEIAQNKV------KQLRLWIFEC-REPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLAC
        +G L+ ++++  DI A +LGIEAEIAQ+K       +  R     C R PS+ AF  G+ A  LL + H++AN+LGGC  I S+++ +++  N+ L++A 
Subjt:  VGVLICLLVVAMDIVAGLLGIEAEIAQNKV------KQLRLWIFEC-REPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLAC

Query:  LIFTWIILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATA
        L+ +WI   V  S L+IGTL+N+++   C   H  F  IGGI C  H +   AYYVSA A
Subjt:  LIFTWIILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATA

AT2G32280.1 Protein of unknown function (DUF1218)1.9e-6267.74Show/hide
Query:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW
        K+ G+L+CL++V +D+ A +LGI+AE+AQN+VK +RLW+FECREPS+ AF+LGLGAA +L +AH++ NL+GGC CICSQ+E Q+S   RQ+S+ACL+ TW
Subjt:  KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTW

Query:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATA
        I+ AVG   +VIGT+SN+KSR+SCGFTHHHFLSIGGILCF+HALFCVAYYVSATA
Subjt:  IILAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATA

AT4G21310.1 Protein of unknown function (DUF1218)1.1e-5464.71Show/hide
Query:  VGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTWII
        VG  IC+L++AMD+ AG+LGIEAEIAQNKVK L++WIFECR+PS  AFK GL A  LL LAH+ AN LGGC C+ S+++++KS  N+QL++A LIFTWII
Subjt:  VGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTWII

Query:  LAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATA
        LA+  SML++GT++N++SR +CG +HH  LSIGGILCFVH LF VAYY+SATA
Subjt:  LAVGMSMLVIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AAGGTTGTTGGAGTGTTGATTTGCCTGCTGGTGGTGGCCATGGATATTGTGGCCGGCCTACTCGGCATCGAAGCCGAAATTGCACAGAACAAGGTTAAGCAGCTGCGGCT
GTGGATATTCGAATGCAGAGAGCCAAGTGAGCAGGCTTTCAAGCTGGGATTAGGGGCAGCGGGACTGCTGGGATTGGCCCACATAATTGCTAATCTGCTGGGCGGCTGCA
ACTGCATTTGCTCTCAAGAAGAGATCCAAAAGTCTCCCCCTAACAGGCAACTCTCCCTCGCATGCCTCATCTTCACATGGATAATTCTAGCGGTGGGGATGTCGATGCTG
GTGATTGGGACATTGTCGAACAACAAATCCAGAGCATCCTGTGGATTCACACACCATCACTTTCTGTCAATCGGAGGGATTTTGTGCTTTGTTCATGCCTTGTTTTGTGT
TGCTTATTATGTTTCTGCCACTGCTGATGAG
mRNA sequenceShow/hide mRNA sequence
AAGGTTGTTGGAGTGTTGATTTGCCTGCTGGTGGTGGCCATGGATATTGTGGCCGGCCTACTCGGCATCGAAGCCGAAATTGCACAGAACAAGGTTAAGCAGCTGCGGCT
GTGGATATTCGAATGCAGAGAGCCAAGTGAGCAGGCTTTCAAGCTGGGATTAGGGGCAGCGGGACTGCTGGGATTGGCCCACATAATTGCTAATCTGCTGGGCGGCTGCA
ACTGCATTTGCTCTCAAGAAGAGATCCAAAAGTCTCCCCCTAACAGGCAACTCTCCCTCGCATGCCTCATCTTCACATGGATAATTCTAGCGGTGGGGATGTCGATGCTG
GTGATTGGGACATTGTCGAACAACAAATCCAGAGCATCCTGTGGATTCACACACCATCACTTTCTGTCAATCGGAGGGATTTTGTGCTTTGTTCATGCCTTGTTTTGTGT
TGCTTATTATGTTTCTGCCACTGCTGATGAG
Protein sequenceShow/hide protein sequence
KVVGVLICLLVVAMDIVAGLLGIEAEIAQNKVKQLRLWIFECREPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEEIQKSPPNRQLSLACLIFTWIILAVGMSML
VIGTLSNNKSRASCGFTHHHFLSIGGILCFVHALFCVAYYVSATADE