; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g1464 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g1464
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPIG-H domain-containing protein
Genome locationMC10:17876803..17879550
RNA-Seq ExpressionMC10g1464
SyntenyMC10g1464
Gene Ontology termsGO:0006506 - GPI anchor biosynthetic process (biological process)
GO:0000506 - glycosylphosphatidylinositol-N-acetylglucosaminyltransferase (GPI-GnT) complex (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0017176 - phosphatidylinositol N-acetylglucosaminyltransferase activity (molecular function)
InterPro domainsIPR019328 - GPI-GlcNAc transferase complex, PIG-H component, conserved domain
IPR044215 - Phosphatidylinositol N-acetylglucosaminyltransferase subunit H


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145301.1 uncharacterized protein LOC111014789, partial [Momordica charantia]2.45e-10299.33Show/hide
Query:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLRSKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLETH
        M EFSITNWRYGYFRDSKWPSEAVDIHHVVLRSKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLETH
Subjt:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLRSKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLETH

Query:  YRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFK
        YRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFK
Subjt:  YRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFK

XP_022158291.1 uncharacterized protein LOC111024811 isoform X1 [Momordica charantia]3.44e-11090.56Show/hide
Query:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLR-SKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLET
        M +FSITNWRYGYFRDSK PSEAVDIHHVV+R SKGGGKGFLLCIFAALAFCF LLKDQSIFVVFWCLVLNVFFAKKLFR  VEKESV VMPNFGVQLET
Subjt:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLR-SKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLET

Query:  HYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATGNNNNREACS
        HYRSGKVI +FVPVDKLLK VLLECVTPVTC W LSLIIQGEDKLLLVFKELRPPLKM VPIWKALS ATGNNNNR+ACS
Subjt:  HYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATGNNNNREACS

XP_022158294.1 uncharacterized protein LOC111024811 isoform X2 [Momordica charantia]6.89e-10390.64Show/hide
Query:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLR-SKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLET
        M +FSITNWRYGYFRDSK PSEAVDIHHVV+R SKGGGKGFLLCIFAALAFCF LLKDQSIFVVFWCLVLNVFFAKKLFR  VEKESV VMPNFGVQLET
Subjt:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLR-SKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLET

Query:  HYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATG
        HYRSGKVI +FVPVDKLLK VLLECVTPVTC W LSLIIQGEDKLLLVFKELRPPLKM VPIWKALS ATG
Subjt:  HYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATG

XP_022158295.1 uncharacterized protein LOC111024811 isoform X3 [Momordica charantia]3.91e-10688.89Show/hide
Query:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLR-SKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLET
        M +FSITNWRYGYFRDSK PSEAVDIHHVV+R SKGGGKGFLLCIFAALAFCF LLKDQSIFVVFWCLVLNVFFAKKLFR  VEK    VMPNFGVQLET
Subjt:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLR-SKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLET

Query:  HYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATGNNNNREACS
        HYRSGKVI +FVPVDKLLK VLLECVTPVTC W LSLIIQGEDKLLLVFKELRPPLKM VPIWKALS ATGNNNNR+ACS
Subjt:  HYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATGNNNNREACS

XP_038876673.1 uncharacterized protein LOC120069067 [Benincasa hispida]4.81e-10181.11Show/hide
Query:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVV-LRSKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLET
        M +FSIT+WRYGYF D KWPSEAVDIHHVV LR++ G KGFL C FAALAFCF LLK QSIFV  WCLVLNVFF KKLF+ TVEKE+VMV+PNFGVQLET
Subjt:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVV-LRSKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLET

Query:  HYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATGNNNNREACS
        H+RSGKVI RFVPVDK+LKPVLLEC+TPVTC WSLSLI+QGED+LLLVFKELRPP+KMLVPIWKAL TATG++ NR+ACS
Subjt:  HYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATGNNNNREACS

TrEMBL top hitse value%identityAlignment
A0A6J1CVY4 uncharacterized protein LOC1110147891.19e-10299.33Show/hide
Query:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLRSKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLETH
        M EFSITNWRYGYFRDSKWPSEAVDIHHVVLRSKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLETH
Subjt:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLRSKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLETH

Query:  YRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFK
        YRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFK
Subjt:  YRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFK

A0A6J1DVF9 uncharacterized protein LOC111024811 isoform X31.89e-10688.89Show/hide
Query:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLR-SKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLET
        M +FSITNWRYGYFRDSK PSEAVDIHHVV+R SKGGGKGFLLCIFAALAFCF LLKDQSIFVVFWCLVLNVFFAKKLFR  VEK    VMPNFGVQLET
Subjt:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLR-SKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLET

Query:  HYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATGNNNNREACS
        HYRSGKVI +FVPVDKLLK VLLECVTPVTC W LSLIIQGEDKLLLVFKELRPPLKM VPIWKALS ATGNNNNR+ACS
Subjt:  HYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATGNNNNREACS

A0A6J1DVP3 uncharacterized protein LOC111024811 isoform X11.67e-11090.56Show/hide
Query:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLR-SKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLET
        M +FSITNWRYGYFRDSK PSEAVDIHHVV+R SKGGGKGFLLCIFAALAFCF LLKDQSIFVVFWCLVLNVFFAKKLFR  VEKESV VMPNFGVQLET
Subjt:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLR-SKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLET

Query:  HYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATGNNNNREACS
        HYRSGKVI +FVPVDKLLK VLLECVTPVTC W LSLIIQGEDKLLLVFKELRPPLKM VPIWKALS ATGNNNNR+ACS
Subjt:  HYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATGNNNNREACS

A0A6J1DYZ9 uncharacterized protein LOC111024811 isoform X23.34e-10390.64Show/hide
Query:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLR-SKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLET
        M +FSITNWRYGYFRDSK PSEAVDIHHVV+R SKGGGKGFLLCIFAALAFCF LLKDQSIFVVFWCLVLNVFFAKKLFR  VEKESV VMPNFGVQLET
Subjt:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLR-SKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLET

Query:  HYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATG
        HYRSGKVI +FVPVDKLLK VLLECVTPVTC W LSLIIQGEDKLLLVFKELRPPLKM VPIWKALS ATG
Subjt:  HYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATG

A0A6J1G4Q1 uncharacterized protein LOC1114505016.21e-9679.44Show/hide
Query:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVV-LRSKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLET
        M +FSIT+WRYGYF DSKW SEAVDIHHVV LR++  GKGFL  I AALAFCF +LK QSIFVV WC +LNV FAKKLF+ TV+KESVMVMPNFGVQLET
Subjt:  MVEFSITNWRYGYFRDSKWPSEAVDIHHVV-LRSKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLET

Query:  HYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATGNNNNREACS
        HY SGKVICRFVPVDK+LKPVLLECVTPVTC WSLSLIIQGEDKLLLVFKELRPP+KMLVPIWKAL        N++ACS
Subjt:  HYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATGNNNNREACS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G35530.1 phosphatidylinositolglycan-related4.4e-4751.89Show/hide
Query:  MVEFSITNWRYGYFRD--SKWPSEAVDIHHVVLRSKGG-------GKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMP
        MV  S++N RY Y  +  SK   EA+DIHHV++    G       G GF L +F A +  FLL KD     + W  +L+ F      R+ V+KESV+++P
Subjt:  MVEFSITNWRYGYFRD--SKWPSEAVDIHHVVLRSKGG-------GKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMP

Query:  NFGVQLETHYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATGNNNNRE
         FG+QLET Y SGK + RF+P+DK+LKPVL+ECVTP+TC WSLSL ++GE++L LVFKELRPPLKMLVPIWKAL  A G ++  E
Subjt:  NFGVQLETHYRSGKVICRFVPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATGNNNNRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGAGTTTTCCATTACGAATTGGAGATACGGATACTTCCGCGACAGTAAATGGCCTTCTGAAGCAGTTGATATCCACCACGTTGTTCTTAGGAGTAAGGGTGGTGG
AAAGGGTTTCCTCTTGTGTATTTTTGCTGCCCTTGCGTTCTGTTTTTTGCTACTCAAGGACCAATCAATCTTTGTCGTCTTTTGGTGCTTGGTCTTGAATGTATTCTTTG
CCAAGAAGTTGTTCAGGGAAACGGTTGAAAAAGAGTCTGTTATGGTTATGCCAAATTTTGGAGTCCAACTTGAAACTCACTATAGGAGTGGAAAAGTCATCTGTCGGTTT
GTTCCTGTCGATAAACTTCTAAAACCAGTTCTCCTTGAATGCGTGACTCCAGTTACTTGTGACTGGAGCTTGTCTTTGATTATACAAGGGGAAGACAAGCTATTGTTGGT
TTTCAAGGAATTACGCCCACCACTGAAAATGTTGGTCCCTATCTGGAAGGCTTTATCTACTGCTACTGGTAATAATAATAACAGAGAAGCTTGTTCGTAA
mRNA sequenceShow/hide mRNA sequence
GGGGTAATAGTAGTATAACTTTAGCTATTTTAGTGGGAGGGCATATGTGTCCACATTAAATTCTAGCGTTGAACTAAAACTCTCTATTTAATATAATTGTTCCTCTAGTT
GTCCCAATAAAAACACTTACCTAATAGACCAACCAAGACTAAAGTGAGCATAACTGACTGATAATTGATATATATCTCCAACCGAGAGGTTGTAAGTTCAATCCCCCAAT
CGTTATGAAAGAGAGAGAGCTGGAGTACGGGCTGATGTGTTCTAAAGCCCAAAGAGTGCGATGAAGTTGGGTGGGCTTGGAACTGGGCCTGCCGTATCGGCTCCGATGAC
GAAGAGGGCTTTCTCATAAGGCCAGCGGCGGAGGCGAAGGCGGAGGCCGAGCCATCACTATTGCGGTGAAATGCGACTCAATCTTCTCGAGGTCAGACGATCGTATATTG
AAGCTCAAATGGCGATCAATTCTGAAACGCCAATTGTTACACTGCACTGTGGAAGATTGTTTGTAGGTTATACTTATTAAATCCTTCAGGTACTGCTGATCCGAAACTTC
TCCCAGAGTAATGCTGATTATCTGAGAAAATTTAAGGGCACACGTTGGAGAATGCCAAAATGGTCGAGTTTTCCATTACGAATTGGAGATACGGATACTTCCGCGACAGT
AAATGGCCTTCTGAAGCAGTTGATATCCACCACGTTGTTCTTAGGAGTAAGGGTGGTGGAAAGGGTTTCCTCTTGTGTATTTTTGCTGCCCTTGCGTTCTGTTTTTTGCT
ACTCAAGGACCAATCAATCTTTGTCGTCTTTTGGTGCTTGGTCTTGAATGTATTCTTTGCCAAGAAGTTGTTCAGGGAAACGGTTGAAAAAGAGTCTGTTATGGTTATGC
CAAATTTTGGAGTCCAACTTGAAACTCACTATAGGAGTGGAAAAGTCATCTGTCGGTTTGTTCCTGTCGATAAACTTCTAAAACCAGTTCTCCTTGAATGCGTGACTCCA
GTTACTTGTGACTGGAGCTTGTCTTTGATTATACAAGGGGAAGACAAGCTATTGTTGGTTTTCAAGGAATTACGCCCACCACTGAAAATGTTGGTCCCTATCTGGAAGGC
TTTATCTACTGCTACTGGTAATAATAATAACAGAGAAGCTTGTTCGTAAGGCCATCGGCAATTAACTCTAGTTGCTGTGTATGATCAATCTTATGATTTTTGCTGCCTCA
AATCCTACCCGCGGTTGTGTGTGTATATCAATGTTTAAATTTTAGTTGTAGCCATTCTCCTTTCTTTTTTTTTTCTTTTTCTTTTTCTCTCTTCCATTAATGTTTGTTTG
ATGCTATCTCACAGGTACACCCTATTGTTCTGGTCTTGTCTGAGAACATTTTTGTTTGAAAGTATGACCTATAGAGAAAAGATAATTGAAGTTTGGAACAAGCACACAAC
AAGAATTCAAGAACTTTTTATATCTGAT
Protein sequenceShow/hide protein sequence
MVEFSITNWRYGYFRDSKWPSEAVDIHHVVLRSKGGGKGFLLCIFAALAFCFLLLKDQSIFVVFWCLVLNVFFAKKLFRETVEKESVMVMPNFGVQLETHYRSGKVICRF
VPVDKLLKPVLLECVTPVTCDWSLSLIIQGEDKLLLVFKELRPPLKMLVPIWKALSTATGNNNNREACS