; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS016443 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS016443
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationscaffold1486:240687..241511
RNA-Seq ExpressionMS016443
SyntenyMS016443
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022139993.1 uncharacterized protein LOC111010766 [Momordica charantia]2.1e-9197.75Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF
        MAQNYGFLVCILVMVMDAVAGILGIRAEK QNRVVLQSVSVWVYECSRKPRDDAFSQGLA TILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGL F
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF

Query:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSPHV
        MILSWITLAIGFSMLMAGTVDNSNWKNSCEISS GLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSPHV
Subjt:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSPHV

XP_022954638.1 uncharacterized protein LOC111456841 [Cucurbita moschata]1.1e-7684.27Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF
        MA+NYGFLVCILVMVMDAVAG+L IRAEK QN+V LQS S+WVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLG CI IR+ QHFQ+S+AN+RLGLLF
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF

Query:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSPHV
        MILSWITLAIGFSML+AGTVDNS  KNSC+ISSHGLFL GGIVCF HGLCTVAYYVSATAA REE+RK  E  S PHV
Subjt:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSPHV

XP_022994470.1 uncharacterized protein LOC111490180 [Cucurbita maxima]7.9e-7583.52Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF
        MA+NYGFLVCILVMVMDAVAG+L IRAEK QN+V LQS S+W YECSRKPRDDAFSQGLAATILLGLAHAIAKVLG CI IR+ QHFQ+S+AN+RLGLLF
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF

Query:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSP
        MILSWITLAIGFSML+AGTVDNS  KNSC+ISSHGLFL GGIVCF HGLCTVAYYVSATAA REE+RK  E  S P
Subjt:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSP

XP_023542772.1 uncharacterized protein LOC111802580 [Cucurbita pepo subsp. pepo]3.2e-7685.23Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF
        MAQNYGFLVCILVMVMDAVAG+L IRAEK QN+V LQS S+WVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLG CI IR+ QHFQ+S+AN+RLGLLF
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF

Query:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSP
        MILSWITLAIGFSML+AGTVDNS  KNSCEISSHGLFL GGIVCF HGLCTVAYYVSATAA REE+RK  E  S P
Subjt:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSP

XP_038894860.1 uncharacterized protein LOC120083260 [Benincasa hispida]4.5e-7885.23Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF
        MA+NYGFLVCILVMV+DAVAGILGI+AEK QNRVVL+SVS+WV  CSRKPRDDAFSQGLAATILLG+AH IAKVLG CICIR+KQHFQES+ANKRLGLLF
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF

Query:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSP
        MILSWITLAIG S+L+AGTVDNS WKNSCEISSHGLFL GGIVCF HGLCTVAYYVSATAA REEQRK     S P
Subjt:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSP

TrEMBL top hitse value%identityAlignment
A0A0A0LRQ0 Uncharacterized protein1.8e-6981.07Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLG--RCICIRSKQHFQESSANKRLGL
        M +NYGFLVCILV+V+DAVAG+LGI AEK QNRVVL+S+S+ + ECSRKPRDDAFS+GLAA+ILLGLAH IAKVLG  +CICIR+KQ+ QE SAN+ LG 
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLG--RCICIRSKQHFQESSANKRLGL

Query:  LFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQR
        LFMILSWITLAIGFS+LMA T+DNS WKNSCEISSHGLFL GGIVCFFHGLCTVAYYVSATAA REEQR
Subjt:  LFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQR

A0A1S3B4I4 uncharacterized protein LOC1034857082.1e-6578.11Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLG--RCICIRSKQHFQESSANKRLGL
        M +NYGFLVCILVMV+D VAG+LGI AEK QNRVVL+S+S+ V ECSRKPRDDAFS+GLAA ILLGLAH IA VLG  +CI I +KQ+ Q+ SAN+ LGL
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLG--RCICIRSKQHFQESSANKRLGL

Query:  LFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQR
         FMILSWITL IGFS+LMA T+DNS WKNSCEISSHGLFL GGIVCF HGLCTVAYYVSATAA REEQR
Subjt:  LFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQR

A0A6J1CGZ6 uncharacterized protein LOC1110107661.0e-9197.75Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF
        MAQNYGFLVCILVMVMDAVAGILGIRAEK QNRVVLQSVSVWVYECSRKPRDDAFSQGLA TILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGL F
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF

Query:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSPHV
        MILSWITLAIGFSMLMAGTVDNSNWKNSCEISS GLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSPHV
Subjt:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSPHV

A0A6J1GRM8 uncharacterized protein LOC1114568415.4e-7784.27Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF
        MA+NYGFLVCILVMVMDAVAG+L IRAEK QN+V LQS S+WVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLG CI IR+ QHFQ+S+AN+RLGLLF
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF

Query:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSPHV
        MILSWITLAIGFSML+AGTVDNS  KNSC+ISSHGLFL GGIVCF HGLCTVAYYVSATAA REE+RK  E  S PHV
Subjt:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSPHV

A0A6J1K198 uncharacterized protein LOC1114901803.8e-7583.52Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF
        MA+NYGFLVCILVMVMDAVAG+L IRAEK QN+V LQS S+W YECSRKPRDDAFSQGLAATILLGLAHAIAKVLG CI IR+ QHFQ+S+AN+RLGLLF
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF

Query:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSP
        MILSWITLAIGFSML+AGTVDNS  KNSC+ISSHGLFL GGIVCF HGLCTVAYYVSATAA REE+RK  E  S P
Subjt:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11500.1 Protein of unknown function (DUF1218)1.8e-3241.67Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVV----LQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRL
        M    GFLV ++++  D  A +LGI AE  Q++       Q        C R P D AF++G+AA +LL + H +A VLG C  IRSKQ F+ ++ANK L
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVV----LQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRL

Query:  GLLFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREE-QRKRNENVSS
         + F++LSWI   + +S LM GT+ NS     C +     FL GGI C  HG+ T AYYVSA AAK+E+ +  + EN+++
Subjt:  GLLFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREE-QRKRNENVSS

AT2G32280.1 Protein of unknown function (DUF1218)1.3e-3544.03Show/hide
Query:  GFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILSW
        G LVC++++ +D  A ILGI+AE  QN+V  + + +W++EC R+P  DAF  GL A  +L +AH +  ++G C+CI S+  FQ SS+ +++ +  ++L+W
Subjt:  GFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILSW

Query:  ITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKRE
        I  A+GF  ++ GT+ NS  ++SC  + H     GGI+CF H L  VAYYVSATAAK E
Subjt:  ITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKRE

AT4G21310.1 Protein of unknown function (DUF1218)2.9e-4352.73Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF
        MA+N GF +CIL++ MD  AGILGI AE  QN+V  + + +W++EC R P   AF  GLAA ILL LAH  A  LG C+C+ S+Q  ++SSANK+L +  
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF

Query:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREE
        +I +WI LAI FSML+ GT+ NS  + +C IS H +   GGI+CF HGL  VAYY+SATA+ RE+
Subjt:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREE

AT5G17210.1 Protein of unknown function (DUF1218)1.5e-0724.55Show/hide
Query:  LVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRK---PRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILS
        ++C ++ ++  ++ +    AE T  R+    V+V V +   K   PR  AF+ G  + + L +A  I  V   C C R       S +N  + L+  ++S
Subjt:  LVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRK---PRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILS

Query:  WITLAIGFSMLMAGTVDNSNWKNS--------CEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKR
        W T  I F +L++G   N              C I   G+F  G ++        + YY+  T+ K+
Subjt:  WITLAIGFSMLMAGTVDNSNWKNS--------CEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKR

AT5G17210.2 Protein of unknown function (DUF1218)4.4e-0726.09Show/hide
Query:  VVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILSWITLAIGFSMLMAGTVDNSNWKNS-----
        +V  +VS  + +C+  PR  AF+ G  + + L +A  I  V   C C R       S +N  + L+  ++SW T  I F +L++G   N           
Subjt:  VVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILSWITLAIGFSMLMAGTVDNSNWKNS-----

Query:  ---CEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKR
           C I   G+F  G ++        + YY+  T+ K+
Subjt:  ---CEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCAAAACTATGGCTTTCTGGTCTGTATCTTGGTCATGGTAATGGACGCTGTTGCTGGGATACTTGGAATTCGAGCGGAAAAGACTCAGAATCGGGTGGTACTACA
ATCGGTGAGCGTATGGGTGTATGAATGCAGCAGAAAGCCGAGAGATGATGCTTTTAGCCAGGGACTGGCCGCTACAATTCTGCTTGGCCTTGCTCATGCCATTGCTAAAG
TACTTGGTAGGTGCATTTGCATTCGGTCTAAGCAACATTTCCAAGAATCATCTGCTAACAAGCGATTGGGATTGCTCTTCATGATTCTCTCATGGATTACATTGGCTATT
GGGTTCTCAATGTTGATGGCGGGGACGGTGGACAATTCCAACTGGAAGAACTCTTGCGAGATATCAAGCCATGGCCTATTTTTGGCAGGTGGGATTGTGTGTTTCTTTCA
TGGGCTCTGTACTGTCGCTTATTATGTTTCTGCAACAGCAGCTAAGAGAGAAGAACAGAGGAAACGCAATGAAAATGTTTCTTCTCCACATGTT
mRNA sequenceShow/hide mRNA sequence
ATGGCGCAAAACTATGGCTTTCTGGTCTGTATCTTGGTCATGGTAATGGACGCTGTTGCTGGGATACTTGGAATTCGAGCGGAAAAGACTCAGAATCGGGTGGTACTACA
ATCGGTGAGCGTATGGGTGTATGAATGCAGCAGAAAGCCGAGAGATGATGCTTTTAGCCAGGGACTGGCCGCTACAATTCTGCTTGGCCTTGCTCATGCCATTGCTAAAG
TACTTGGTAGGTGCATTTGCATTCGGTCTAAGCAACATTTCCAAGAATCATCTGCTAACAAGCGATTGGGATTGCTCTTCATGATTCTCTCATGGATTACATTGGCTATT
GGGTTCTCAATGTTGATGGCGGGGACGGTGGACAATTCCAACTGGAAGAACTCTTGCGAGATATCAAGCCATGGCCTATTTTTGGCAGGTGGGATTGTGTGTTTCTTTCA
TGGGCTCTGTACTGTCGCTTATTATGTTTCTGCAACAGCAGCTAAGAGAGAAGAACAGAGGAAACGCAATGAAAATGTTTCTTCTCCACATGTT
Protein sequenceShow/hide protein sequence
MAQNYGFLVCILVMVMDAVAGILGIRAEKTQNRVVLQSVSVWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILSWITLAI
GFSMLMAGTVDNSNWKNSCEISSHGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSPHV