; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC07g0343 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC07g0343
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionhydroxyproline-rich glycoprotein family protein
Genome locationMC07:11260015..11260494
RNA-Seq ExpressionMC07g0343
SyntenyMC07g0343
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157692.1 uncharacterized protein LOC111024349 [Momordica charantia]9.55e-119100Show/hide
Query:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGK
        SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGK
Subjt:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGK

Query:  KREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD
        KREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD
Subjt:  KREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD

XP_031248679.1 uncharacterized protein LOC116106459 [Pistacia vera]2.48e-6156.25Show/hide
Query:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGK
        +E +P PYPWAT HRA + SL+ L  + I KIRGE++CK+C +  ++E++L EKFMEV SFIS NK  MH RAP  W  P    C  C       PVT K
Subjt:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGK

Query:  KREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD
        K+ +NWLFLLLGQM+G   L  LKY CKHT+NHRTGAKDR++Y+ Y+ LCKQL P GP+D
Subjt:  KREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD

XP_031257817.1 uncharacterized protein LOC116115824 [Pistacia vera]2.48e-6156.25Show/hide
Query:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGK
        +E +P PYPWAT HRA + SL+ L  + I KIRGE++CK+C +  ++E++L EKFMEV SFIS NK  MH RAP  W  P    C  C       PVT K
Subjt:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGK

Query:  KREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD
        K+ +NWLFL LGQM+G   L  LKY CKHT+NHRTGAKDR++Y+AY+ LCKQL P GP+D
Subjt:  KREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD

XP_039002770.1 uncharacterized protein LOC120129312 [Hibiscus syriacus]8.31e-6055.9Show/hide
Query:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-TG
        SETI PPYPWAT H+A +  L  L  NGI  I G+++CK+C    +MEF+L EKF E+  +I+ NK  MH RAP  W  P    C  C  E   +PV + 
Subjt:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-TG

Query:  KKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD
        KK+ +NWLFLLLGQMIG  +LEHLKY CKHT  HRT AKDR++Y+AY+CLCKQL PT P+D
Subjt:  KKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD

XP_039013264.1 uncharacterized protein LOC120142848 [Hibiscus syriacus]9.90e-6157.14Show/hide
Query:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-TG
        SETI PPYPWAT HRA++  L  L  NGI  I G+++CK+C    +ME++L EKF E+  +I+ NK  MH RAP  W  P    C  C  E   +PV + 
Subjt:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-TG

Query:  KKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD
        KKR +NWLFLLLGQMIG  +LEHLKY CKHT  HRT AKDR++Y+AY+CLCKQL PTGP+D
Subjt:  KKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD

TrEMBL top hitse value%identityAlignment
A0A2H5NYI9 Uncharacterized protein1.23e-5955.62Show/hide
Query:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGK
        +ETIP P+PWAT  RA + SL+ LT + + KI GE++CK+C    ++E++L  KFMEV SFIS NK  MH RAP  W  P   +C  C    C +P+ GK
Subjt:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGK

Query:  KREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD
        K+ +NWLFLLLGQM+G   L  LKY CKHTRNHRTGAKDR++Y+ Y+ LCKQL P G YD
Subjt:  KREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD

A0A6A2ZN07 Uncharacterized protein4.79e-6157.14Show/hide
Query:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-TG
        SETI PPYPWAT HRA++  L  L  NGI  I G+++CK+C    +ME++L EKF E+  +I+ NK  MH RAP  W  P    C  C  E   +PV + 
Subjt:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-TG

Query:  KKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD
        KKR +NWLFLLLGQMIG  +LEHLKY CKHT  HRT AKDR++Y+AY+CLCKQL PTGP+D
Subjt:  KKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD

A0A6A3AEM8 Uncharacterized protein4.03e-6055.9Show/hide
Query:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-TG
        SETI PPYPWAT H+A +  L  L  NGI  I G+++CK+C    +MEF+L EKF E+  +I+ NK  MH RAP  W  P    C  C  E   +PV + 
Subjt:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-TG

Query:  KKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD
        KK+ +NWLFLLLGQMIG  +LEHLKY CKHT  HRT AKDR++Y+AY+CLCKQL PT P+D
Subjt:  KKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD

A0A6J1DV57 uncharacterized protein LOC1110243494.63e-119100Show/hide
Query:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGK
        SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGK
Subjt:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGK

Query:  KREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD
        KREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD
Subjt:  KREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD

A0A6P6WVB4 uncharacterized protein LOC1137358265.16e-5954.09Show/hide
Query:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGK
        SE IPPPYPWAT  RA + +LD L  NG+ +IRGE++CK+C    +MEF+L  KF E+ +FI+ NK  +H RAP  W  P   +C  C      +P+  K
Subjt:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGK

Query:  KREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPY
        KR +NWLFLLLGQM+G   L  LKY CKHT+NHRTGAKDR++Y+ Y+ LCKQL P GP+
Subjt:  KREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein7.9e-4048.08Show/hide
Query:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTG-
        S+TI PP+PWAT  R  I+SL+ L  N I  I GE++C+ C    ++ +NL E+F EV  F    K +M  RA + W  P ++ C  C  E   +PV   
Subjt:  SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTG-

Query:  KKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHP
        +K ++NWLFLLLGQ +GF +LE LK  CKH++NHRTGAKDR++Y+ YM LCK L P
Subjt:  KKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHP

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)9.4e-4148.73Show/hide
Query:  IPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-TGKKR
        I PPYPWAT+    I+S  DL+ N I  I G++ CK C     +E+NL EKF E+  +I +NK EM  RAP  W  P    C  C  E   +PV + +K 
Subjt:  IPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-TGKKR

Query:  EMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD
        E+NWLFLLLGQM+G  +L+ L+Y C+    HRTG+KDR+VYI Y+ LCKQL P GP++
Subjt:  EMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD

AT2G16190.2 FUNCTIONS IN: molecular_function unknown5.5e-2546.28Show/hide
Query:  IPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-TGKKR
        I PPYPWAT+    I+S  DL+ N I  I G++ CK C     +E+NL EKF E+  +I +NK EM  RAP  W  P    C  C  E   +PV + +K 
Subjt:  IPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-TGKKR

Query:  EMNWLFLLLGQMIGFSSLEHL
        E+NWLFLLLGQM+G  +L+ L
Subjt:  EMNWLFLLLGQMIGFSSLEHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AGCGAGACAATCCCGCCACCGTATCCATGGGCGACGGAGCACCGAGCAATCATACGCAGCCTGGACGATCTCACCCGAAACGGGATAGAGAAAATCAGAGGGGAAATGAA
GTGCAAGAAGTGCGGAGTAGATAGCAAGATGGAATTCAATCTGACAGAGAAGTTCATGGAAGTAGAGAGTTTCATATCGATGAACAAGTCGGAGATGCACCAGCGAGCCC
CGAGGGGTTGGGAGTGCCCTCCGCGGCAGGACTGCGGCTGTTGCAGCGGAGAGGGCTGCACGGAGCCGGTGACGGGGAAGAAGAGGGAGATGAATTGGCTGTTCTTGTTG
CTAGGGCAAATGATTGGATTCTCTAGCTTAGAACATCTGAAATACTTGTGTAAGCACACGAGGAACCACAGGACAGGCGCAAAAGACAGGCTTGTGTACATTGCCTACAT
GTGTTTGTGCAAACAACTTCATCCAACAGGACCTTATGAT
mRNA sequenceShow/hide mRNA sequence
AGCGAGACAATCCCGCCACCGTATCCATGGGCGACGGAGCACCGAGCAATCATACGCAGCCTGGACGATCTCACCCGAAACGGGATAGAGAAAATCAGAGGGGAAATGAA
GTGCAAGAAGTGCGGAGTAGATAGCAAGATGGAATTCAATCTGACAGAGAAGTTCATGGAAGTAGAGAGTTTCATATCGATGAACAAGTCGGAGATGCACCAGCGAGCCC
CGAGGGGTTGGGAGTGCCCTCCGCGGCAGGACTGCGGCTGTTGCAGCGGAGAGGGCTGCACGGAGCCGGTGACGGGGAAGAAGAGGGAGATGAATTGGCTGTTCTTGTTG
CTAGGGCAAATGATTGGATTCTCTAGCTTAGAACATCTGAAATACTTGTGTAAGCACACGAGGAACCACAGGACAGGCGCAAAAGACAGGCTTGTGTACATTGCCTACAT
GTGTTTGTGCAAACAACTTCATCCAACAGGACCTTATGAT
Protein sequenceShow/hide protein sequence
SETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGKKREMNWLFLL
LGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD