; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS022063 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS022063
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationscaffold47:434801..435361
RNA-Seq ExpressionMS022063
SyntenyMS022063
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0016301 - kinase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572024.1 hypothetical protein SDJN03_28752, partial [Cucurbita argyrosperma subsp. sororia]1.4e-5861.58Show/hide
Query:  RRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGE
        RRRRSR       I+PPYPWS E +A +H+L YL+ N I+TI GDVRC +CE+ Y IEY+LM KF+EIA FIE+ +  +HDRAP  W NP   +C+ C E
Subjt:  RRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGE

Query:  ENCVRPTIPEGDNKN----INWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH
        ENCV P IP+ ++ N    INWLFLLLGQ+IGRLKL+ LKYFCA+T NHRTGAK+RL++LTYL LCKQLQPS  LF+
Subjt:  ENCVRPTIPEGDNKN----INWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH

KAG7011696.1 hypothetical protein SDJN02_26602, partial [Cucurbita argyrosperma subsp. argyrosperma]8.4e-5961.58Show/hide
Query:  RRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGE
        RRRRSR       I+PPYPWS E +A +H+L YL+ N I+TI GDVRC +CE+ Y IEY+LM KF+EIA FIE+ +  +HDRAP  W NP   +C+ C E
Subjt:  RRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGE

Query:  ENCVRPTIPEGDNKN----INWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH
        ENCV P IP+ ++ N    INWLFLLLGQ+IGRLKL+ LKYFCA+T NHRTGAK+RL++LTYL LCKQLQPS  LF+
Subjt:  ENCVRPTIPEGDNKN----INWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH

XP_022135937.1 uncharacterized protein LOC111007768 [Momordica charantia]2.0e-10598.4Show/hide
Query:  PSPGRAREPASPVVRRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPES
        PSP RAREPASPVVR+RRSRVELKNTPIKPPYPWSTEHQA+VHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPES
Subjt:  PSPGRAREPASPVVRRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPES

Query:  WTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH
        WTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH
Subjt:  WTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH

XP_022135938.1 probable serine/threonine-protein kinase samkC [Momordica charantia]1.7e-7273.71Show/hide
Query:  RRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGEEN
        RR R++ K+T I+PPYPWST ++A+VHDL YL++NQILTITGDV+C +C+KQY IEYDL+TKF+EIASFIEKNK TLHDRAP SWTNPN  +CK CG+E+
Subjt:  RRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGEEN

Query:  CVRPTIP----EGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH
        C+RP IP    + D KNINWLFLLLGQMIG L L+HLKYFC YTNNHRT AK+RLVYLTYL+LCKQLQPS ELFH
Subjt:  CVRPTIP----EGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH

XP_022953023.1 mucin-16-like [Cucurbita moschata]1.4e-5861.58Show/hide
Query:  RRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGE
        RRRRSR       I+PPYPWS E +A +H+L YL+ N I+TI GDVRC +CE+ Y IEY+LM KF+EIA FIE+ +  +HDRAP  W NP   +C+ C E
Subjt:  RRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGE

Query:  ENCVRPTIPEGDNKN----INWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH
        ENCV P IP+ ++ N    INWLFLLLGQ+IGRLKL+ LKYFCA+T NHRTGAK+RL++LTYL LCKQLQPS  LF+
Subjt:  ENCVRPTIPEGDNKN----INWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH

TrEMBL top hitse value%identityAlignment
A0A1S3BHR1 uncharacterized protein LOC1034897705.8e-5863.31Show/hide
Query:  VRRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCG
        +RR  SR       I+PPYPWST  +AMV  LN LR +QIL ITGDVRC +C+ +YTIEYD+++KFEEIASF+E+NK    DRAP SW NPN+  C+ CG
Subjt:  VRRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCG

Query:  EENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPS
         EN  RP IP+ + + INWLFLLLG+M+G L L HLKYFC+YTNNHRTGAKNRL+YLTY+TLC Q+ PS
Subjt:  EENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPS

A0A6J1C462 uncharacterized protein LOC1110077689.8e-10698.4Show/hide
Query:  PSPGRAREPASPVVRRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPES
        PSP RAREPASPVVR+RRSRVELKNTPIKPPYPWSTEHQA+VHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPES
Subjt:  PSPGRAREPASPVVRRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPES

Query:  WTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH
        WTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH
Subjt:  WTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH

A0A6J1C690 probable serine/threonine-protein kinase samkC8.4e-7373.71Show/hide
Query:  RRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGEEN
        RR R++ K+T I+PPYPWST ++A+VHDL YL++NQILTITGDV+C +C+KQY IEYDL+TKF+EIASFIEKNK TLHDRAP SWTNPN  +CK CG+E+
Subjt:  RRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGEEN

Query:  CVRPTIP----EGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH
        C+RP IP    + D KNINWLFLLLGQMIG L L+HLKYFC YTNNHRT AK+RLVYLTYL+LCKQLQPS ELFH
Subjt:  CVRPTIP----EGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH

A0A6J1GM83 mucin-16-like6.9e-5961.58Show/hide
Query:  RRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGE
        RRRRSR       I+PPYPWS E +A +H+L YL+ N I+TI GDVRC +CE+ Y IEY+LM KF+EIA FIE+ +  +HDRAP  W NP   +C+ C E
Subjt:  RRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGE

Query:  ENCVRPTIPEGDNKN----INWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH
        ENCV P IP+ ++ N    INWLFLLLGQ+IGRLKL+ LKYFCA+T NHRTGAK+RL++LTYL LCKQLQPS  LF+
Subjt:  ENCVRPTIPEGDNKN----INWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH

A0A6J1I8I0 uncharacterized protein KIAA0754-like2.0e-5861.02Show/hide
Query:  RRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGE
        RRRRSR       I+PPYPWS E +A +H+L YL+ N I+ I GDVRC +CE+ Y IEY+LM KF+EIA FIE+ +  +HDRAP  W NP   +C+ C E
Subjt:  RRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGE

Query:  ENCVRPTIPEGDNKN----INWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH
        ENCV P IP+ ++ N    INWLFLLLGQ+IGRLKL+ LKYFCA+T NHRTGAK+RL++LTYL LCKQLQPS  LF+
Subjt:  ENCVRPTIPEGDNKN----INWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein5.1e-4646.77Show/hide
Query:  PSPGRAREPASPVVRRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPES
        P P +     S  + R RS V  K+  I PP+PW+T  +  +  L YL  NQI TITG+V+C  CEK Y + Y+L  +F E+  F    K  + DRA + 
Subjt:  PSPGRAREPASPVVRRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPES

Query:  WTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELF
        W  P    C+LCG E  V+P I E  ++ INWLFLLLGQ +G   LE LK FC ++ NHRTGAK+R++YLTY+ LCK LQP  +LF
Subjt:  WTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELF

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)2.3e-3841.08Show/hide
Query:  SPGRAREPASPVVRRRRSRV-----ELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDR
        +P R R P     R  +  V      + +  I PPYPW+T+    +     L  N I  I+G V C  C++  T+EY+L  KF E+  +I+ NK  +  R
Subjt:  SPGRAREPASPVVRRRRSRV-----ELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDR

Query:  APESWTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQP
        AP SW+ P  + C+ C  E  ++P + E   + INWLFLLLGQM+G   L+ L+YFC   + HRTG+K+R+VY+TYL+LCKQL P
Subjt:  APESWTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQP

AT2G16190.2 FUNCTIONS IN: molecular_function unknown3.2e-2436.6Show/hide
Query:  SPGRAREPASPVVRRRRSRV-----ELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDR
        +P R R P     R  +  V      + +  I PPYPW+T+    +     L  N I  I+G V C  C++  T+EY+L  KF E+  +I+ NK  +  R
Subjt:  SPGRAREPASPVVRRRRSRV-----ELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDR

Query:  APESWTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHL
        AP SW+ P  + C+ C  E  ++P + E   + INWLFLLLGQM+G   L+ L
Subjt:  APESWTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CCTAGTCCTGGGCGTGCTCGAGAGCCTGCATCTCCAGTTGTGCGGCGGAGGCGATCGAGAGTCGAGCTGAAGAACACGCCGATCAAGCCACCCTATCCATGGTCCACGGA
GCACCAAGCCATGGTTCACGACCTCAACTACCTCCGCGAGAATCAAATCCTGACAATCACAGGTGACGTCAGATGCGATCGATGCGAGAAACAGTACACGATCGAGTATG
ACCTAATGACGAAATTTGAAGAGATTGCGAGTTTCATAGAGAAGAACAAGGCTACTTTGCACGACCGAGCTCCGGAGTCGTGGACGAACCCTAATTTTCTGGACTGCAAA
TTGTGTGGGGAAGAAAACTGCGTGAGGCCGACAATTCCGGAGGGCGATAACAAGAACATAAATTGGCTGTTCTTGCTTTTAGGGCAAATGATTGGACGTTTGAAACTTGA
ACATCTCAAATACTTCTGTGCTTACACCAATAATCATCGAACTGGTGCTAAGAATCGTCTTGTTTATCTCACTTATCTTACTCTTTGCAAACAACTTCAGCCCTCCATGG
AACTCTTCCAT
mRNA sequenceShow/hide mRNA sequence
CCTAGTCCTGGGCGTGCTCGAGAGCCTGCATCTCCAGTTGTGCGGCGGAGGCGATCGAGAGTCGAGCTGAAGAACACGCCGATCAAGCCACCCTATCCATGGTCCACGGA
GCACCAAGCCATGGTTCACGACCTCAACTACCTCCGCGAGAATCAAATCCTGACAATCACAGGTGACGTCAGATGCGATCGATGCGAGAAACAGTACACGATCGAGTATG
ACCTAATGACGAAATTTGAAGAGATTGCGAGTTTCATAGAGAAGAACAAGGCTACTTTGCACGACCGAGCTCCGGAGTCGTGGACGAACCCTAATTTTCTGGACTGCAAA
TTGTGTGGGGAAGAAAACTGCGTGAGGCCGACAATTCCGGAGGGCGATAACAAGAACATAAATTGGCTGTTCTTGCTTTTAGGGCAAATGATTGGACGTTTGAAACTTGA
ACATCTCAAATACTTCTGTGCTTACACCAATAATCATCGAACTGGTGCTAAGAATCGTCTTGTTTATCTCACTTATCTTACTCTTTGCAAACAACTTCAGCCCTCCATGG
AACTCTTCCAT
Protein sequenceShow/hide protein sequence
PSPGRAREPASPVVRRRRSRVELKNTPIKPPYPWSTEHQAMVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCK
LCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH