; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009521 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009521
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF1677)
Genome locationscaffold813:1875243..1875824
RNA-Seq ExpressionMS009521
SyntenyMS009521
Gene Ontology termsNA
InterPro domainsIPR012876 - Protein of unknown function DUF1677, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051038.1 uncharacterized protein E6C27_scaffold481G00160 [Cucumis melo var. makuwa]2.6e-9591.75Show/hide
Query:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
        MAPHGEAISRSNNFSKPPRLSS  LQRTISDISMELGKELAIASD KQG LPPISE+EDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
Subjt:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV

Query:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN
        EK GGS+EEALNAHMSAC RFNKLGRAYPVLFQAEAMREMLKK+RMDGR G RAKSLSPRDKAA QKKGGIARSSSCIPAI RSED+ NLKL+N
Subjt:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN

KAE8646534.1 hypothetical protein Csa_016674 [Cucumis sativus]1.5e-9591.75Show/hide
Query:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
        MAPHGEAISRSNNFSKPPRLSSN LQRTISDISMELGKELAI SD KQGVLPPISE+EDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
Subjt:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV

Query:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN
        EK GGS+E ALNAHMSAC RFNKLGRAYPVLFQAEAMREMLKK+RMDGR GFRAKSLSPRDKAA QKKGGI RSSSCIPAI RSED+ NLKL+N
Subjt:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN

XP_011659684.1 uncharacterized protein LOC105436220 [Cucumis sativus]1.5e-9591.75Show/hide
Query:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
        MAPHGEAISRSNNFSKPPRLSSN LQRTISDISMELGKELAI SD KQGVLPPISE+EDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
Subjt:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV

Query:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN
        EK GGS+E ALNAHMSAC RFNKLGRAYPVLFQAEAMREMLKK+RMDGR GFRAKSLSPRDKAA QKKGGI RSSSCIPAI RSED+ NLKL+N
Subjt:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN

XP_022147640.1 uncharacterized protein LOC111016511 [Momordica charantia]7.5e-10398.97Show/hide
Query:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
        MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQG LPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
Subjt:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV

Query:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN
        EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIAR+SSCIPAITRSEDITNLKLKN
Subjt:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN

XP_038898439.1 uncharacterized protein LOC120086077 [Benincasa hispida]5.8e-9591.24Show/hide
Query:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
        MAPHGEAISRSNNFSKPPRLSSN LQRTISDISMELGKELAI SD KQG LPPISE+EDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
Subjt:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV

Query:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN
        EK GGS+E ALNAHMSAC RFNKLGRAYPVLFQAEAMREMLKK+R DGR GFRAKSLSPRDKAA QKKGGIARSSSCIPAI RSED+ NLKL+N
Subjt:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN

TrEMBL top hitse value%identityAlignment
A0A0A0K9S5 Uncharacterized protein7.3e-9691.75Show/hide
Query:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
        MAPHGEAISRSNNFSKPPRLSSN LQRTISDISMELGKELAI SD KQGVLPPISE+EDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
Subjt:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV

Query:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN
        EK GGS+E ALNAHMSAC RFNKLGRAYPVLFQAEAMREMLKK+RMDGR GFRAKSLSPRDKAA QKKGGI RSSSCIPAI RSED+ NLKL+N
Subjt:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN

A0A1S3CGX8 uncharacterized protein LOC1035007864.8e-9591.24Show/hide
Query:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
        MAPHGEAISRSNNFSKPPRLSS  LQRTISDISMELGKELA ASD KQG LPPISE+EDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
Subjt:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV

Query:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN
        EK GGS+EEALNAHMSAC RFNKLGRAYPVLFQAEAMREMLKK+RMDGR G RAKSLSPRDKAA QKKGGIARSSSCIPAI RSED+ NLKL+N
Subjt:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN

A0A5D3BVT1 Uncharacterized protein1.3e-9591.75Show/hide
Query:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
        MAPHGEAISRSNNFSKPPRLSS  LQRTISDISMELGKELAIASD KQG LPPISE+EDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
Subjt:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV

Query:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN
        EK GGS+EEALNAHMSAC RFNKLGRAYPVLFQAEAMREMLKK+RMDGR G RAKSLSPRDKAA QKKGGIARSSSCIPAI RSED+ NLKL+N
Subjt:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN

A0A6J1D0Q4 uncharacterized protein LOC1110165113.6e-10398.97Show/hide
Query:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
        MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQG LPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
Subjt:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV

Query:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN
        EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIAR+SSCIPAITRSEDITNLKLKN
Subjt:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN

A0A6J1KMI4 uncharacterized protein LOC1114946563.1e-9491.75Show/hide
Query:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV
        MAPHGEA+SRSNNFSKPPRLSSNALQRTISDISMEL KEL I SD  QG LPPISE+EDARCECCGMCEEYTQEYID+MRDKFLGKWICGLCAEAVEGEV
Subjt:  MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEV

Query:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN
        EK GGS+E ALNAHMSACARFNKLGRAYPVLFQAEAMREMLKK+RMDGRGGFRAKSLSPRDKAAA KKGGIARSSSCIPAITRSEDI+NLKL+N
Subjt:  EKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72510.1 Protein of unknown function (DUF1677)2.2e-1535.82Show/hide
Query:  EVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEVEKLGG--SREEALNAHMSACARFNKLG-RAYPVLFQAEAMREMLKKNRMDGRGGF
        E +   C+CCG+ EE TQ YI+ +R++++GKWICGLC+EAV+ EV +     + EEA+  HM+ C +F        P      AMR++L+K+ +D     
Subjt:  EVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEVEKLGG--SREEALNAHMSACARFNKLG-RAYPVLFQAEAMREMLKKNRMDGRGGF

Query:  RAKSLSP----RDKAAAQKKGGIARSSSCIPAIT
        R+   SP    +D         ++RS SC  ++T
Subjt:  RAKSLSP----RDKAAAQKKGGIARSSSCIPAIT

AT1G79770.1 Protein of unknown function (DUF1677)5.3e-3848.95Show/hide
Query:  MAPHGEAISRSNNFSK--PPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEG
        MA +G   S S +  K  PP++   +L R++SDIS++  +E       ++  L  I EVE A+CECCGM EE T EYI+R+R+KF GKWICGLC+EAV+ 
Subjt:  MAPHGEAISRSNNFSK--PPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEG

Query:  EVEKLGGSREE-----ALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITR
        E +K    REE     AL  HMSAC RFNKLGR YP LFQA+AMR+ML+++        R +S+ P        K  I+R+SSCIPAITR
Subjt:  EVEKLGGSREE-----ALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITR

AT3G22540.1 Protein of unknown function (DUF1677)6.7e-1743.82Show/hide
Query:  EVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEVEKLGGSR-EEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKK
        E+E  RCECCG+ E+ TQ+YI  ++  F  KW+CGLC+EAV  EV +   +  +EA+ AH+S C +F K     P +  A+ MR+ML++
Subjt:  EVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEVEKLGGSR-EEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKK

AT4G14819.1 Protein of unknown function (DUF1677)8.5e-2043.81Show/hide
Query:  EVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEVEKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAK
        E+E   CECCG+ E+ TQ YI +++  F GKW+CGLC+EAV  E  +   + EEA+NAHMS C +FN    A P    A+ MR+ML++         R+ 
Subjt:  EVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEVEKLGGSREEALNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAK

Query:  SLSPR
         LSP+
Subjt:  SLSPR

AT5G25840.1 Protein of unknown function (DUF1677)1.6e-3952.02Show/hide
Query:  LQRTISDISMELGKELAIASDAKQGV-------LPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEVEKLGGS-------REEA
        LQRTISDIS ++  +  +  +            L  ISEVEDA+CECCGM EE T EYI R+R KF GK ICGLC +AVEGE+EK+  S       REEA
Subjt:  LQRTISDISMELGKELAIASDAKQGV-------LPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEVEKLGGS-------REEA

Query:  LNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITR
        +  HMSAC+RFN+LGR+YPVL+QAEA++EMLKK         R+K +     A   +KGG+ARSSSC+PA+ +
Subjt:  LNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACCACATGGAGAGGCCATTTCAAGGAGCAACAACTTCTCCAAGCCACCAAGGCTGTCATCCAATGCCCTCCAAAGAACAATCTCAGACATCTCCATGGAGTTGGG
CAAGGAGTTGGCGATAGCCTCGGACGCCAAACAAGGAGTGCTTCCACCGATATCCGAGGTCGAGGACGCGCGGTGCGAGTGCTGCGGCATGTGCGAGGAGTACACGCAGG
AGTACATCGATCGGATGCGGGACAAGTTCCTCGGGAAGTGGATATGCGGGCTCTGCGCCGAGGCCGTGGAAGGGGAGGTGGAGAAGCTTGGAGGGAGTAGAGAGGAAGCG
TTGAATGCACACATGAGCGCGTGCGCGAGGTTTAACAAGCTGGGAAGGGCTTATCCAGTGTTGTTTCAGGCTGAGGCAATGAGGGAGATGCTGAAGAAGAATAGGATGGA
TGGCAGAGGAGGCTTTAGAGCCAAGTCTTTGAGTCCTAGAGACAAAGCTGCAGCTCAAAAGAAAGGTGGAATTGCAAGGAGTTCAAGTTGCATTCCAGCCATCACAAGAT
CAGAGGATATTACCAATCTAAAACTCAAAAAC
mRNA sequenceShow/hide mRNA sequence
ATGGCACCACATGGAGAGGCCATTTCAAGGAGCAACAACTTCTCCAAGCCACCAAGGCTGTCATCCAATGCCCTCCAAAGAACAATCTCAGACATCTCCATGGAGTTGGG
CAAGGAGTTGGCGATAGCCTCGGACGCCAAACAAGGAGTGCTTCCACCGATATCCGAGGTCGAGGACGCGCGGTGCGAGTGCTGCGGCATGTGCGAGGAGTACACGCAGG
AGTACATCGATCGGATGCGGGACAAGTTCCTCGGGAAGTGGATATGCGGGCTCTGCGCCGAGGCCGTGGAAGGGGAGGTGGAGAAGCTTGGAGGGAGTAGAGAGGAAGCG
TTGAATGCACACATGAGCGCGTGCGCGAGGTTTAACAAGCTGGGAAGGGCTTATCCAGTGTTGTTTCAGGCTGAGGCAATGAGGGAGATGCTGAAGAAGAATAGGATGGA
TGGCAGAGGAGGCTTTAGAGCCAAGTCTTTGAGTCCTAGAGACAAAGCTGCAGCTCAAAAGAAAGGTGGAATTGCAAGGAGTTCAAGTTGCATTCCAGCCATCACAAGAT
CAGAGGATATTACCAATCTAAAACTCAAAAAC
Protein sequenceShow/hide protein sequence
MAPHGEAISRSNNFSKPPRLSSNALQRTISDISMELGKELAIASDAKQGVLPPISEVEDARCECCGMCEEYTQEYIDRMRDKFLGKWICGLCAEAVEGEVEKLGGSREEA
LNAHMSACARFNKLGRAYPVLFQAEAMREMLKKNRMDGRGGFRAKSLSPRDKAAAQKKGGIARSSSCIPAITRSEDITNLKLKN