; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS025976 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS025976
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationscaffold35:2092002..2092767
RNA-Seq ExpressionMS025976
SyntenyMS025976
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647674.1 hypothetical protein Csa_003691 [Cucumis sativus]2.0e-8185Show/hide
Query:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF
        M+KTYG+VVCIL+I++DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILL LAHAIANL GGC  VRSA +YKGL+ANKQ AV SL+F
Subjt:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF

Query:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV
        AWIAL++GFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCF+HGLF VAYY++ TAG RE+TKPPP  QGNPAGATGHV
Subjt:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV

XP_004134744.2 uncharacterized protein LOC101214369 [Cucumis sativus]2.0e-8185Show/hide
Query:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF
        M+KTYG+VVCIL+I++DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILL LAHAIANL GGC  VRSA +YKGL+ANKQ AV SL+F
Subjt:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF

Query:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV
        AWIAL++GFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCF+HGLF VAYY++ TAG RE+TKPPP  QGNPAGATGHV
Subjt:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV

XP_008439942.1 PREDICTED: uncharacterized protein LOC103484578 [Cucumis melo]6.3e-8083.89Show/hide
Query:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF
        M+KTYG+VVCIL+I++DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILL LAHAIANL GGC  VRSA +YKGL+ANKQ A  SL+F
Subjt:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF

Query:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV
        AWIAL++GFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCF+HGLF VAYY++ TAG RE+TKPPP  QGNPA ATGHV
Subjt:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV

XP_022142361.1 uncharacterized protein LOC111012499 [Momordica charantia]1.8e-95100Show/hide
Query:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF
        MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF
Subjt:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF

Query:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV
        AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV
Subjt:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV

XP_022950216.1 uncharacterized protein LOC111453374 [Cucurbita moschata]6.3e-8085.56Show/hide
Query:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF
        MQK  G VVCILVIVMDVTAGILGIQAE+AQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANL GGC  VRSA EYKGLT+NKQ A  SLVF
Subjt:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF

Query:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV
        AWIA+++GFSLLISGAMYNTRSRKSCGLAHN LLSIGGI CF+HGLF VAYY++ATAG REET+PPP  QG+PAGATGHV
Subjt:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV

TrEMBL top hitse value%identityAlignment
A0A0A0KLA9 Uncharacterized protein9.5e-8285Show/hide
Query:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF
        M+KTYG+VVCIL+I++DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILL LAHAIANL GGC  VRSA +YKGL+ANKQ AV SL+F
Subjt:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF

Query:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV
        AWIAL++GFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCF+HGLF VAYY++ TAG RE+TKPPP  QGNPAGATGHV
Subjt:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV

A0A1S3AZY5 uncharacterized protein LOC1034845783.1e-8083.89Show/hide
Query:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF
        M+KTYG+VVCIL+I++DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILL LAHAIANL GGC  VRSA +YKGL+ANKQ A  SL+F
Subjt:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF

Query:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV
        AWIAL++GFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCF+HGLF VAYY++ TAG RE+TKPPP  QGNPA ATGHV
Subjt:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV

A0A5D3CMA3 Uncharacterized protein3.1e-8083.89Show/hide
Query:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF
        M+KTYG+VVCIL+I++DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILL LAHAIANL GGC  VRSA +YKGL+ANKQ A  SL+F
Subjt:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF

Query:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV
        AWIAL++GFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCF+HGLF VAYY++ TAG RE+TKPPP  QGNPA ATGHV
Subjt:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV

A0A6J1CLY7 uncharacterized protein LOC1110124998.9e-96100Show/hide
Query:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF
        MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF
Subjt:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF

Query:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV
        AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV
Subjt:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV

A0A6J1GE80 uncharacterized protein LOC1114533743.1e-8085.56Show/hide
Query:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF
        MQK  G VVCILVIVMDVTAGILGIQAE+AQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANL GGC  VRSA EYKGLT+NKQ A  SLVF
Subjt:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF

Query:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV
        AWIA+++GFSLLISGAMYNTRSRKSCGLAHN LLSIGGI CF+HGLF VAYY++ATAG REET+PPP  QG+PAGATGHV
Subjt:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05291.1 Protein of unknown function (DUF1218)7.9e-2035.47Show/hide
Query:  LVVCILVIV-MDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANK---QFAVASLVFAW
        ++VCI++ V +D+ AG +G+QA+ AQ  V H K+   ECK PS  AF LG+ A   L  AH  AN+  GC           L  NK    F +A L   W
Subjt:  LVVCILVIV-MDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANK---QFAVASLVFAW

Query:  IALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPA
        +  + G  +L +G   NT SR  C   +N + SIGG VCF+H +    YYI++            P +  P+
Subjt:  IALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPA

AT1G11500.1 Protein of unknown function (DUF1218)3.2e-2940.24Show/hide
Query:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFE------C-KDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQF
        M+   G +V +++I  D+TA +LGI+AEIAQ+K  H             C + PS  AF  G+AA +LL + H +AN+ GGC  +RS  ++K  TANK  
Subjt:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFE------C-KDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQF

Query:  AVASLVFAWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREE
        AVA LV +WI  ++ +S L+ G + N+R+ + C L H     IGGI C  HG+   AYY++A A  +E+
Subjt:  AVASLVFAWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREE

AT2G32280.1 Protein of unknown function (DUF1218)7.5e-4750Show/hide
Query:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF
        M K  G++VC++++ +DV A ILGIQAE+AQN+V H ++W+FEC++PS +AF+LGL AA +L +AH + NL GGC C+ S DE++  ++ +Q ++A LV 
Subjt:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF

Query:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETK
         WI   +GF  ++ G M N++SR SCG  H+  LSIGGI+CF+H LF VAYY++ATA  ++E K
Subjt:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETK

AT4G21310.1 Protein of unknown function (DUF1218)8.6e-5965.03Show/hide
Query:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF
        M +  G  +CIL++ MDV+AGILGI+AEIAQNKV H KMWIFEC+DPSY AFK GLAA ILL LAH  AN  GGC CV S  + +  +ANKQ AVASL+F
Subjt:  MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVF

Query:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREET
         WI L + FS+LI G M N+RSRK+CG++H+R+LSIGGI+CF+HGLF VAYYI+ATA  RE+T
Subjt:  AWIALLLGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREET


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAAAACTTATGGTCTTGTAGTCTGCATCTTGGTTATTGTTATGGATGTTACAGCCGGAATACTTGGAATTCAGGCTGAGATAGCTCAAAACAAGGTGAACCATTT
CAAAATGTGGATATTTGAATGTAAAGACCCCAGCTACAATGCTTTCAAGTTGGGCTTGGCTGCAGCCATACTGCTTGGCCTTGCTCATGCCATTGCCAACTTGTTTGGTG
GATGCTTCTGCGTTCGATCGGCTGATGAATACAAAGGATTAACAGCTAACAAGCAATTTGCTGTAGCCTCGCTCGTCTTTGCATGGATTGCGCTTCTGCTCGGATTCTCA
TTGCTTATCTCAGGAGCAATGTATAACACAAGGTCGAGAAAATCGTGCGGTCTAGCTCACAATCGTCTTCTGTCTATAGGGGGGATTGTTTGCTTCATACACGGCTTGTT
TGTAGTTGCTTATTATATTACTGCCACGGCAGGACTCAGGGAGGAGACAAAGCCACCGCCGCCACAACAGGGAAACCCTGCTGGAGCTACAGGCCACGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGAAAACTTATGGTCTTGTAGTCTGCATCTTGGTTATTGTTATGGATGTTACAGCCGGAATACTTGGAATTCAGGCTGAGATAGCTCAAAACAAGGTGAACCATTT
CAAAATGTGGATATTTGAATGTAAAGACCCCAGCTACAATGCTTTCAAGTTGGGCTTGGCTGCAGCCATACTGCTTGGCCTTGCTCATGCCATTGCCAACTTGTTTGGTG
GATGCTTCTGCGTTCGATCGGCTGATGAATACAAAGGATTAACAGCTAACAAGCAATTTGCTGTAGCCTCGCTCGTCTTTGCATGGATTGCGCTTCTGCTCGGATTCTCA
TTGCTTATCTCAGGAGCAATGTATAACACAAGGTCGAGAAAATCGTGCGGTCTAGCTCACAATCGTCTTCTGTCTATAGGGGGGATTGTTTGCTTCATACACGGCTTGTT
TGTAGTTGCTTATTATATTACTGCCACGGCAGGACTCAGGGAGGAGACAAAGCCACCGCCGCCACAACAGGGAAACCCTGCTGGAGCTACAGGCCACGTTTAG
Protein sequenceShow/hide protein sequence
MQKTYGLVVCILVIVMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLGLAHAIANLFGGCFCVRSADEYKGLTANKQFAVASLVFAWIALLLGFS
LLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFIHGLFVVAYYITATAGLREETKPPPPQQGNPAGATGHV