; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004901 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004901
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationscaffold176:1357133..1359113
RNA-Seq ExpressionMS004901
SyntenyMS004901
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008455918.1 PREDICTED: uncharacterized protein LOC103495987 isoform X2 [Cucumis melo]5.4e-7181.71Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREA
        MA K VG  SPG TKVGL FM + +AAYI+ PPLYWHF E LAA SSSS STCPPCFCDCSS TDFA +EDCVKHD GMN ETEK+F ELLSEELKLREA
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREA

Query:  EALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSR
        EALE+HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ+RLT LWETRA QRGWR DIV SR
Subjt:  EALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSR

XP_022140315.1 uncharacterized protein LOC111011015 [Momordica charantia]5.0e-9395.77Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISE--------DCVKHDPGMNRETEKSFVELLS
        MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISE        DCVKHDPGMNRETEKSFVELLS
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISE--------DCVKHDPGMNRETEKSFVELLS

Query:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGT
        EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGT
Subjt:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGT

XP_022947174.1 uncharacterized protein LOC111451120 isoform X2 [Cucurbita moschata]2.9e-7280.11Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREA
        MAVK  G CSPG TKVGLGF+ L +AAYI+ PPLYWHF+E LA +SSSSSSTCPPCFCDCSS TDFA ++DCVKHD GMN ETE++F ELLSEELKLREA
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREA

Query:  EALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRA
        EA+E HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ++LTALWE RA QRGWR DIV S A
Subjt:  EALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRA

XP_022971113.1 uncharacterized protein LOC111469880 isoform X2 [Cucurbita maxima]7.1e-7178.98Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREA
        MAVK  G CSPG TKVGLGF+ L +AAYI+ PPLYWHF+E LA +SSS SSTCPPCFCDCSS TDFA ++DCVKHD GMN ETE+SF ELLSE+LKLREA
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREA

Query:  EALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRA
        +A+E HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ++LTALWE RA QRGWR DIV S A
Subjt:  EALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRA

XP_023534558.1 uncharacterized protein LOC111796098 isoform X2 [Cucurbita pepo subsp. pepo]2.9e-7280.68Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREA
        MAVK  G CSPG TKVGLGF+ L +AAYI+ PPLYWHF+E LA +SSSSSSTCPPCFCDCSS TDFA ++DCVKHD GMN ETE+SF ELLSEELKLREA
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREA

Query:  EALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRA
        EA+E HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ++LTALWE RA QRGWR DIV S A
Subjt:  EALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRA

TrEMBL top hitse value%identityAlignment
A0A0A0LQC2 Uncharacterized protein9.9e-7176.72Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISE--------DCVKHDPGMNRETEKSFVELLS
        MAVK VG CSPG TKVGL  M L +AAYI+ PPLYWHF+E L A SSSS STCPPCFCDCSS TDFA +E        DCVKHD GMN ETEK+F ELLS
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISE--------DCVKHDPGMNRETEKSFVELLS

Query:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGT
        EELKLREAEALE+HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ+RLTALWETRA QRGWR +IV SR   +G+
Subjt:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGT

A0A1S3C2Q4 uncharacterized protein LOC103495987 isoform X22.6e-7181.71Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREA
        MA K VG  SPG TKVGL FM + +AAYI+ PPLYWHF E LAA SSSS STCPPCFCDCSS TDFA +EDCVKHD GMN ETEK+F ELLSEELKLREA
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREA

Query:  EALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSR
        EALE+HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ+RLT LWETRA QRGWR DIV SR
Subjt:  EALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSR

A0A6J1CER0 uncharacterized protein LOC1110110152.4e-9395.77Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISE--------DCVKHDPGMNRETEKSFVELLS
        MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISE        DCVKHDPGMNRETEKSFVELLS
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISE--------DCVKHDPGMNRETEKSFVELLS

Query:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGT
        EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGT
Subjt:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGT

A0A6J1G609 uncharacterized protein LOC111451120 isoform X21.4e-7280.11Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREA
        MAVK  G CSPG TKVGLGF+ L +AAYI+ PPLYWHF+E LA +SSSSSSTCPPCFCDCSS TDFA ++DCVKHD GMN ETE++F ELLSEELKLREA
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREA

Query:  EALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRA
        EA+E HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ++LTALWE RA QRGWR DIV S A
Subjt:  EALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRA

A0A6J1I5W5 uncharacterized protein LOC111469880 isoform X23.4e-7178.98Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREA
        MAVK  G CSPG TKVGLGF+ L +AAYI+ PPLYWHF+E LA +SSS SSTCPPCFCDCSS TDFA ++DCVKHD GMN ETE+SF ELLSE+LKLREA
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREA

Query:  EALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRA
        +A+E HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ++LTALWE RA QRGWR DIV S A
Subjt:  EALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)1.1e-5061.59Show/hide
Query:  ATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISE--------DCVKHDPGMNRETEKSFVELLSEELKLREAEALE
        A K+GL  +GL +A YI+ PPLYWH  E LAAVS+SS   CP C C+CS+Y+   I +        DC KHDP +N +TEK++ ELL+EELKLREAE+LE
Subjt:  ATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISE--------DCVKHDPGMNRETEKSFVELLSEELKLREAEALE

Query:  SHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWR
         H+RAD+ LLEAKK+TS YQKEADKCNSGMETCE AREKAE +L  Q++LT+ WE RA Q+GWR
Subjt:  SHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWR

AT2G32580.1 Protein of unknown function (DUF1068)2.3e-4353.53Show/hide
Query:  ATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAIS--------EDCVKHDPGMNRETEKSFVELLSEELKLREAEALE
        A KVGL  + L +  YI+ PPLYWH  E LA     S+++C  C CDCSS     I          DC K DP +N +TEK++ ELL+EELK REA ++E
Subjt:  ATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAIS--------EDCVKHDPGMNRETEKSFVELLSEELKLREAEALE

Query:  SHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRS
         H+R D  LLEAKK+TS YQKEADKCNSGMETCE AREKAE +LV Q++LT++WE RA Q+G++    +S
Subjt:  SHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRS

AT2G32580.2 Protein of unknown function (DUF1068)2.0e-3161.54Show/hide
Query:  DCVKHDPGMNRETEKSFVELLSEELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGD
        +C K DP +N +TEK++ ELL+EELK REA ++E H+R D  LLEAKK+TS YQKEADKCNSGMETCE AREKAE +LV Q++LT++WE RA Q+G++  
Subjt:  DCVKHDPGMNRETEKSFVELLSEELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGD

Query:  IVRS
          +S
Subjt:  IVRS

AT4G04360.1 Protein of unknown function (DUF1068)1.7e-3853.57Show/hide
Query:  KVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISE--------DCVKHDPGMNRETEKSFVELLSEELKLREAEALESH
        KV    MGL + AYI  P LYWH  E +A    S  S+CPPC CDCSS    +I +        DC++H+ G + E+E SF E+++EELKLREA+A E  
Subjt:  KVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISE--------DCVKHDPGMNRETEKSFVELLSEELKLREAEALESH

Query:  RRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRS
         RAD  LL+AKK  SQYQKEADKC+ GMETCE AREKAEA+L  Q+RL+ +WE RA Q GW+   V S
Subjt:  RRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRS

AT4G30996.1 Protein of unknown function (DUF1068)9.9e-3145.81Show/hide
Query:  AAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDC----------SSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREAEALESHRRADISLLE
        A  +  P LYW F +     S+ ++S CPPC CDC              + +I+ DC   DP + +E EK FV+LL+EELKL+EA A E  R  +++L E
Subjt:  AAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDC----------SSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREAEALESHRRADISLLE

Query:  AKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGD
        AK++ SQYQKEA+KCN+  E CE+ARE+AEA L+ ++++T+LWE RA Q GW G+
Subjt:  AKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTGAAGGGTGTGGGCTGGTGCTCTCCAGGAGCGACGAAGGTGGGATTGGGTTTTATGGGTCTTTTTGTAGCAGCTTATATTGTTGCTCCCCCTCTCTACTGGCA
CTTCATCGAGTGCTTGGCCGCCGTCTCTTCTTCCTCTTCCTCCACTTGCCCTCCTTGTTTCTGTGACTGTTCTTCTTACACTGACTTCGCCATTTCTGAAGATTGTGTGA
AACATGACCCTGGTATGAATCGGGAAACAGAAAAGAGTTTTGTGGAGTTGTTGTCGGAGGAACTGAAACTGAGGGAAGCTGAAGCTTTGGAAAGTCACCGTCGCGCTGAC
ATATCTCTGCTGGAAGCGAAGAAGATGACATCTCAATATCAGAAAGAAGCAGACAAGTGCAATTCAGGTATGGAAACATGTGAAGCAGCAAGGGAAAAAGCTGAAGCTTC
ATTAGTTTCACAGCAGAGGCTAACAGCATTATGGGAGACAAGGGCTCATCAAAGAGGATGGAGAGGCGACATTGTCAGATCCCGTGCTCTGGCTCGTGGTACA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGTGAAGGGTGTGGGCTGGTGCTCTCCAGGAGCGACGAAGGTGGGATTGGGTTTTATGGGTCTTTTTGTAGCAGCTTATATTGTTGCTCCCCCTCTCTACTGGCA
CTTCATCGAGTGCTTGGCCGCCGTCTCTTCTTCCTCTTCCTCCACTTGCCCTCCTTGTTTCTGTGACTGTTCTTCTTACACTGACTTCGCCATTTCTGAAGATTGTGTGA
AACATGACCCTGGTATGAATCGGGAAACAGAAAAGAGTTTTGTGGAGTTGTTGTCGGAGGAACTGAAACTGAGGGAAGCTGAAGCTTTGGAAAGTCACCGTCGCGCTGAC
ATATCTCTGCTGGAAGCGAAGAAGATGACATCTCAATATCAGAAAGAAGCAGACAAGTGCAATTCAGGTATGGAAACATGTGAAGCAGCAAGGGAAAAAGCTGAAGCTTC
ATTAGTTTCACAGCAGAGGCTAACAGCATTATGGGAGACAAGGGCTCATCAAAGAGGATGGAGAGGCGACATTGTCAGATCCCGTGCTCTGGCTCGTGGTACA
Protein sequenceShow/hide protein sequence
MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEDCVKHDPGMNRETEKSFVELLSEELKLREAEALESHRRAD
ISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGT