; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0482 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0482
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationMC04:4036179..4036883
RNA-Seq ExpressionMC04g0482
SyntenyMC04g0482
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011694.1 hypothetical protein SDJN02_26600, partial [Cucurbita argyrosperma subsp. argyrosperma]3.40e-7967.09Show/hide
Query:  IRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEER
        I PPY WST R AVVHTL +L SN+I+TI+GEV+C++C+R+YE+EYD+VSKF EI  FVE   +SF DRAP EWM PNYP C+FCG E G +PV+P E  
Subjt:  IRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEER

Query:  RINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFS
        +INW+FLLLG+M+G L LNHLKYFC  T NHRTG+K+RL+YLTY+TLC Q+DPSGRFS
Subjt:  RINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFS

XP_008447299.1 PREDICTED: uncharacterized protein LOC103489770 [Cucumis melo]4.02e-9364.26Show/hide
Query:  LSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVQAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSN
        LSLRPP  +  S   PSPP         H  RAN++T+ RI+RN G  RRSS    RCNSRS          +T  I PPY WST RRA+V TL+ L+S+
Subjt:  LSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVQAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSN

Query:  RIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYF
        +I+ I+G+VRCR+CQ  Y IEYD+VSKFEEIASFVE+NK+ F DRAP  WMNPNYP C+FCG ENGARPV+P E R+INWLFLLLG+MLGVL+LNHLKYF
Subjt:  RIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYF

Query:  CGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR
        C  T+NHRTGAKNRLLYLTY+TLC+QVDPSGRF+R
Subjt:  CGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR

XP_011659748.1 uncharacterized protein LOC105436256 [Cucumis sativus]1.33e-9062.13Show/hide
Query:  LSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVQAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSN
        LSLRPP  + SS  + +P          H  R N++T+ R++R+ G  RRSS    RCNSRS          +T  I PPY WST RRA+V TL+ LKSN
Subjt:  LSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVQAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSN

Query:  RIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYF
        +I+ I+G+V+CR+CQ  Y IEYD+ SKFEEIASFVE+NK+SF DRAP  WMNPNYP C+FCG ENGARPV+P + R+INWLFLLLG+MLGVL+LNHLKYF
Subjt:  RIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYF

Query:  CGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR
        C  T NHRTGAKNRLLYLTY+TLC+QVDPSGRF+R
Subjt:  CGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR

XP_022952797.1 uncharacterized protein LOC111455388 [Cucurbita moschata]1.00e-8067.9Show/hide
Query:  STREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVP
        +T  I PPY WST R AVVHTL +L SN+I+TI+GEV+C++C+R+YEIEYD+VSKF EI SFVE N +SF DRAP EWM PNYP C+FCG E G +PV+P
Subjt:  STREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVP

Query:  TEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFS
         E  +INW+FLLLG+M+G L LNHLKYFC  T NHRTG+K+RL+YLTY+TLC Q+DPSGRFS
Subjt:  TEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFS

XP_022972401.1 uncharacterized protein LOC111470968 [Cucurbita maxima]7.01e-7965.64Show/hide
Query:  STREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVP
        +T  I PPY WST R AVVHTL +L  N+I+TI+G+V+C++C+R+YEIEY++VSKF EI SFVE N +SF DRAP +WM PNYP C+FCG E G +PV+P
Subjt:  STREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVP

Query:  TEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR
         E  +INW+FLLLG+M+G L LNHLKYFC  T NHRTG+K+RL+YLTY+TLC Q+DPSGRFSR
Subjt:  TEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR

TrEMBL top hitse value%identityAlignment
A0A0A0K3Q8 Uncharacterized protein6.44e-9162.13Show/hide
Query:  LSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVQAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSN
        LSLRPP  + SS  + +P          H  R N++T+ R++R+ G  RRSS    RCNSRS          +T  I PPY WST RRA+V TL+ LKSN
Subjt:  LSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVQAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSN

Query:  RIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYF
        +I+ I+G+V+CR+CQ  Y IEYD+ SKFEEIASFVE+NK+SF DRAP  WMNPNYP C+FCG ENGARPV+P + R+INWLFLLLG+MLGVL+LNHLKYF
Subjt:  RIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYF

Query:  CGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR
        C  T NHRTGAKNRLLYLTY+TLC+QVDPSGRF+R
Subjt:  CGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR

A0A1S3BHR1 uncharacterized protein LOC1034897701.95e-9364.26Show/hide
Query:  LSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVQAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSN
        LSLRPP  +  S   PSPP         H  RAN++T+ RI+RN G  RRSS    RCNSRS          +T  I PPY WST RRA+V TL+ L+S+
Subjt:  LSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVQAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSN

Query:  RIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYF
        +I+ I+G+VRCR+CQ  Y IEYD+VSKFEEIASFVE+NK+ F DRAP  WMNPNYP C+FCG ENGARPV+P E R+INWLFLLLG+MLGVL+LNHLKYF
Subjt:  RIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYF

Query:  CGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR
        C  T+NHRTGAKNRLLYLTY+TLC+QVDPSGRF+R
Subjt:  CGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR

A0A5A7T547 Uncharacterized protein8.17e-7963.01Show/hide
Query:  LSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVQAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSN
        LSLRPP  +  S   PSPP         H  RAN++T+ RI+RN G  RRSS    RCNSRS          +T  I PPY WST RRA+V TL+ L+S+
Subjt:  LSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVQAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSN

Query:  RIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYF
        +I+ I+G+VRCR+CQ  Y IEYD+VSKFEEIASFVE+NK+ F DRAP  WMNPNYP C+FCG ENGARPV+P E R+INWLFLLLG+MLGVL+LNHLKYF
Subjt:  RIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYF

Query:  CGCTDNHRTGAKNRLLYLT
        C  T+NHRTGAKNRLLYLT
Subjt:  CGCTDNHRTGAKNRLLYLT

A0A6J1GLD4 uncharacterized protein LOC1114553884.86e-8167.9Show/hide
Query:  STREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVP
        +T  I PPY WST R AVVHTL +L SN+I+TI+GEV+C++C+R+YEIEYD+VSKF EI SFVE N +SF DRAP EWM PNYP C+FCG E G +PV+P
Subjt:  STREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVP

Query:  TEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFS
         E  +INW+FLLLG+M+G L LNHLKYFC  T NHRTG+K+RL+YLTY+TLC Q+DPSGRFS
Subjt:  TEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFS

A0A6J1I5V9 uncharacterized protein LOC1114709683.40e-7965.64Show/hide
Query:  STREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVP
        +T  I PPY WST R AVVHTL +L  N+I+TI+G+V+C++C+R+YEIEY++VSKF EI SFVE N +SF DRAP +WM PNYP C+FCG E G +PV+P
Subjt:  STREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVP

Query:  TEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR
         E  +INW+FLLLG+M+G L LNHLKYFC  T NHRTG+K+RL+YLTY+TLC Q+DPSGRFSR
Subjt:  TEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein6.8e-4036.71Show/hide
Query:  SSSSVAPSPPPP------------RRSPGR----SHIQRANSITSHRISRNPGIIRRSSPAGGRC-NSRSAAVQAIGMEASTREIRPPYLWSTTRRAVVH
        ++ S+ P PPPP            +++P      +H    + +     +  P  ++R      R   SRS       +   +  I PP+ W+T RR  + 
Subjt:  SSSSVAPSPPPP------------RRSPGR----SHIQRANSITSHRISRNPGIIRRSSPAGGRC-NSRSAAVQAIGMEASTREIRPPYLWSTTRRAVVH

Query:  TLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVL
        +L +L+SN+I TI+GEV+CR C++VY++ Y+L  +F E+  F    K    DRA  +W  P   RC+ CG E   +PV+   + +INWLFLLLGQ LG  
Subjt:  TLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVL

Query:  SLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDP
        +L  LK FC  + NHRTGAK+R+LYLTY+ LC  + P
Subjt:  SLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDP

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)1.6e-4142.25Show/hide
Query:  RRSSPAGGRCNSRS-AAVQAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAP
        RR  P GG+    S   V  +      REI PPY W+T +   + +   L SN I  ISG+V C+ C R   +EY+L  KF E+  +++ NK+    RAP
Subjt:  RRSSPAGGRCNSRS-AAVQAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAP

Query:  SEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFS
          W  P    C+ C  E   +PV+   +  INWLFLLLGQMLG  +L+ L+YFC     HRTG+K+R++Y+TYL+LC Q+DP G F+
Subjt:  SEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFS

AT2G16190.2 FUNCTIONS IN: molecular_function unknown4.3e-2639.33Show/hide
Query:  RRSSPAGGRCNSRS-AAVQAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAP
        RR  P GG+    S   V  +      REI PPY W+T +   + +   L SN I  ISG+V C+ C R   +EY+L  KF E+  +++ NK+    RAP
Subjt:  RRSSPAGGRCNSRS-AAVQAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAP

Query:  SEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHL
          W  P    C+ C  E   +PV+   +  INWLFLLLGQMLG  +L+ L
Subjt:  SEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTCTCGCTCCGTCCGCCGGTACAAAACAGTAGTAGTAGTGTGGCGCCGTCTCCGCCGCCGCCTCGGCGCTCCCCCGGCCGCAGCCACATCCAGCGGGCAAATTCAATAAC
CTCACATAGAATTAGCCGGAATCCTGGAATAATTCGCCGATCTTCTCCGGCCGGCGGACGCTGCAATTCGCGATCCGCGGCGGTGCAGGCGATCGGTATGGAGGCGAGTA
CGAGGGAGATCCGGCCACCGTACCTGTGGTCGACGACGCGCCGAGCTGTGGTCCACACTCTGAGCCACCTAAAATCGAACCGGATAGTCACGATCAGTGGCGAGGTCCGG
TGCCGGCGGTGCCAGAGAGTGTACGAGATCGAGTATGACCTGGTTTCGAAATTCGAAGAGATCGCGAGTTTCGTAGAGAAAAACAAGGATTCGTTTCACGACAGAGCACC
GAGCGAGTGGATGAACCCAAATTATCCACGGTGCAAATTCTGCGGCCTGGAGAACGGAGCGCGGCCGGTGGTTCCGACGGAGGAGCGGCGGATCAATTGGCTGTTCTTGC
TTTTGGGACAAATGCTCGGAGTTTTGAGTCTGAATCATCTGAAATATTTCTGCGGATGCACAGATAATCATCGAACAGGTGCCAAGAATCGTCTCCTCTATCTCACTTAC
CTCACTCTCTGCAACCAAGTTGATCCTTCTGGCCGATTCAGTCGC
mRNA sequenceShow/hide mRNA sequence
CTCTCGCTCCGTCCGCCGGTACAAAACAGTAGTAGTAGTGTGGCGCCGTCTCCGCCGCCGCCTCGGCGCTCCCCCGGCCGCAGCCACATCCAGCGGGCAAATTCAATAAC
CTCACATAGAATTAGCCGGAATCCTGGAATAATTCGCCGATCTTCTCCGGCCGGCGGACGCTGCAATTCGCGATCCGCGGCGGTGCAGGCGATCGGTATGGAGGCGAGTA
CGAGGGAGATCCGGCCACCGTACCTGTGGTCGACGACGCGCCGAGCTGTGGTCCACACTCTGAGCCACCTAAAATCGAACCGGATAGTCACGATCAGTGGCGAGGTCCGG
TGCCGGCGGTGCCAGAGAGTGTACGAGATCGAGTATGACCTGGTTTCGAAATTCGAAGAGATCGCGAGTTTCGTAGAGAAAAACAAGGATTCGTTTCACGACAGAGCACC
GAGCGAGTGGATGAACCCAAATTATCCACGGTGCAAATTCTGCGGCCTGGAGAACGGAGCGCGGCCGGTGGTTCCGACGGAGGAGCGGCGGATCAATTGGCTGTTCTTGC
TTTTGGGACAAATGCTCGGAGTTTTGAGTCTGAATCATCTGAAATATTTCTGCGGATGCACAGATAATCATCGAACAGGTGCCAAGAATCGTCTCCTCTATCTCACTTAC
CTCACTCTCTGCAACCAAGTTGATCCTTCTGGCCGATTCAGTCGC
Protein sequenceShow/hide protein sequence
LSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVQAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVR
CRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTY
LTLCNQVDPSGRFSR