; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004569 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004569
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationscaffold995:375723..376220
RNA-Seq ExpressionMS004569
SyntenyMS004569
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649619.1 hypothetical protein Csa_012837 [Cucumis sativus]3.7e-5073.05Show/hide
Query:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSP--SPPP---QKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSP
        MD DEFYR+PAAVPFKWEIKPGVPR HHRL  SP  SPP    QKLKPPP VSH   P      SLHSS RT+S+RWRF RS      QVS ++GCFPSP
Subjt:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSP--SPPP---QKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSP

Query:  SPNRKSGKSMNRK-PEPNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWA
         PNRKS KS++RK PEP+Y+++L+TLSRWSVSSRKSISPFR SVSSSPSSFSSYQSSPRPTSDTEWA
Subjt:  SPNRKSGKSMNRK-PEPNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWA

XP_004142634.1 uncharacterized protein LOC101220757 [Cucumis sativus]2.1e-5373.84Show/hide
Query:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSP--SPPP---QKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSP
        MD DEFYR+PAAVPFKWEIKPGVPR HHRL  SP  SPP    QKLKPPP VSH   P      SLHSS RT+S+RWRF RS      QVS ++GCFPSP
Subjt:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSP--SPPP---QKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSP

Query:  SPNRKSGKSMNRK-PEPNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF
         PNRKS KS++RK PEP+Y+++L+TLSRWSVSSRKSISPFR SVSSSPSSFSSYQSSPRPTSDTEWAGFGLF
Subjt:  SPNRKSGKSMNRK-PEPNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF

XP_008444194.1 PREDICTED: uncharacterized protein LOC103487607 [Cucumis melo]1.0e-5273.99Show/hide
Query:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSP--SPPP---QKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSP
        MD DEFYR+PAAVPFKWEIKPGVPR HHR   SP  SPP    QKLKPPP VSH   PS     SLHSS RTRSDRWRF RS      QVS ++GCFPSP
Subjt:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSP--SPPP---QKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSP

Query:  SPNRKSGKSMNRK-PEPNYTTELETLSRWSVSSRKSISPFRDSV-SSSPSSFSSYQSSPRPTSDTEWAGFGLF
         PNRKS K+++RK PEP+Y+++L+TLSRWSVSSRKSISPFR SV SSSPSSFSSYQSSPRPTSDTEWAGFGLF
Subjt:  SPNRKSGKSMNRK-PEPNYTTELETLSRWSVSSRKSISPFRDSV-SSSPSSFSSYQSSPRPTSDTEWAGFGLF

XP_022131529.1 uncharacterized protein DKFZp434B061-like [Momordica charantia]5.5e-8699.4Show/hide
Query:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSPSPPPQKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSPSPNRK
        MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSPSPPPQKLKPPPVVSH RRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSPSPNRK
Subjt:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSPSPPPQKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSPSPNRK

Query:  SGKSMNRKPEPNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF
        SGKSMNRKPEPNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF
Subjt:  SGKSMNRKPEPNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF

XP_038899347.1 uncharacterized protein LOC120086669 [Benincasa hispida]1.2e-5373.53Show/hide
Query:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSP--SPPP--QKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSPS
        MD DEFYR+PAAVPFKWEIKPGVP+ HHRL  SP  SPP   QKLKPPP VS+   PS     SLHSSSRTRSDRWRF     ++P QVS  +GCFPSP 
Subjt:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSP--SPPP--QKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSPS

Query:  PNRKSGKSMNRKPEPNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF
        PNRKS KS++R PEP+Y++ LE+LSRWSVSSRKSISPFR SVSSSPSS+SSY SSPRPTSDTEWAGFGLF
Subjt:  PNRKSGKSMNRKPEPNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF

TrEMBL top hitse value%identityAlignment
A0A1S3BAM4 uncharacterized protein LOC1034876075.0e-5373.99Show/hide
Query:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSP--SPPP---QKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSP
        MD DEFYR+PAAVPFKWEIKPGVPR HHR   SP  SPP    QKLKPPP VSH   PS     SLHSS RTRSDRWRF RS      QVS ++GCFPSP
Subjt:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSP--SPPP---QKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSP

Query:  SPNRKSGKSMNRK-PEPNYTTELETLSRWSVSSRKSISPFRDSV-SSSPSSFSSYQSSPRPTSDTEWAGFGLF
         PNRKS K+++RK PEP+Y+++L+TLSRWSVSSRKSISPFR SV SSSPSSFSSYQSSPRPTSDTEWAGFGLF
Subjt:  SPNRKSGKSMNRK-PEPNYTTELETLSRWSVSSRKSISPFRDSV-SSSPSSFSSYQSSPRPTSDTEWAGFGLF

A0A6J1BQH3 uncharacterized protein DKFZp434B061-like2.6e-8699.4Show/hide
Query:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSPSPPPQKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSPSPNRK
        MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSPSPPPQKLKPPPVVSH RRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSPSPNRK
Subjt:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSPSPPPQKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSPSPNRK

Query:  SGKSMNRKPEPNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF
        SGKSMNRKPEPNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF
Subjt:  SGKSMNRKPEPNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF

A0A6J1FHC7 uncharacterized protein LOC1114457752.9e-4868.05Show/hide
Query:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSPSPPPQ---KLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSPSP
        MD DEFYR+PAAVPFKWEIKPGVPR HHRL   P+  PQ   KLKPPP V+ ++    S+S       RTRSDRW   +S LAEP QVS   GCF SP P
Subjt:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSPSPPPQ---KLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSPSP

Query:  NRKSGKSMNRKPEPNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF
        NRK+ K +NRKPEP+Y +ELETL RWSVSS+KSISPFR+SVSS  SS SSYQSSPRPTSD+EWAGFGLF
Subjt:  NRKSGKSMNRKPEPNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF

A0A6J1ISY3 uncharacterized protein LOC1114803256.4e-4867.25Show/hide
Query:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSPSPPPQ---KLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSPSP
        MD DEFYR+PAAVPFKWEIKPGVPR HH L P P+  PQ   KLKPPP V+ ++    S+S       RTRSDRW  ++S LAEP QVS   GCF SP P
Subjt:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSPSPPPQ---KLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSPSP

Query:  NRKSGKSMNRKPEPNYTTELETLSRWSVSSRKSISPFRDSVSS--SPSSFSSYQSSPRPTSDTEWAGFGLF
        NRK+ K +NRKPEP+  +ELETL RWS+SS+KSISPFR+SVSS  SPSS SSYQSSPRPTSD+EWAGFGLF
Subjt:  NRKSGKSMNRKPEPNYTTELETLSRWSVSSRKSISPFRDSVSS--SPSSFSSYQSSPRPTSDTEWAGFGLF

A0A7N2MK80 Uncharacterized protein2.3e-3451.27Show/hide
Query:  MDADEFYRRPAAVPFKWEIKPGVPRA-----------------------HHRLCPSPSPP---PQKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRF
        M  DE   +P A+PFKWEIKPGVP+                        H ++ P P+PP   PQKL+PPP  SH   P E  + S  SS RTRS+RWRF
Subjt:  MDADEFYRRPAAVPFKWEIKPGVPRA-----------------------HHRLCPSPSPP---PQKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRF

Query:  AR-SSLAEPPQVSPATGCFPSPSPNRKSGKSMNRKP----EPNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF
         R +S   P  V+P  GCF S    RKS K+  +KP    EP+Y+++LE LSRWSVSSR+S+SPFR+S  S  SSFSSYQSSPRP SD EWAGFGLF
Subjt:  AR-SSLAEPPQVSPATGCFPSPSPNRKSGKSMNRKP----EPNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21695.1 hydroxyproline-rich glycoprotein family protein1.6e-0632.23Show/hide
Query:  MDADEFYRRPAAVPFKWEIKPGVPRAH-----HRLCPSPSP---PPQKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQV---SPA-
        +D D+ ++RP AVPFKWEI+PGVP+         L   P P   PP KLK  P    S  PS SSS S  S SR+R        S  A PP     SP+ 
Subjt:  MDADEFYRRPAAVPFKWEIKPGVPRAH-----HRLCPSPSP---PPQKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQV---SPA-

Query:  --TGCFPSPSP-----NRKSGKSMNR----------------------KPEPNYTT-------ELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSP
          + C  SP+P     + ++G S+ R                      + EP+ TT       E  T    S       SP     SS  SSFSS + SP
Subjt:  --TGCFPSPSP-----NRKSGKSMNR----------------------KPEPNYTT-------ELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSP

Query:  RPTSDTEWAGF
           +D++ + +
Subjt:  RPTSDTEWAGF

AT1G77400.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF688 (InterPro:IPR007789)6.3e-1634.05Show/hide
Query:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSPSP--PPQKLKP-------------PPVVS--------------------HS-----------RRPS
        +D D+ ++RP  +PF WEI+PGVP+       + +P  PP+KL P             PP +S                    HS           R PS
Subjt:  MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSPSP--PPQKLKP-------------PPVVS--------------------HS-----------RRPS

Query:  E-----SSSCSLHSSSRTRSDRWRFARSSLAEP---PQVS-----PATGCFPSPS---PNRKSG----KSMNRKPEPNYTTELETLSRWSVSSRKSISPF
              S   S  SS R  S+RW+  R +   P   P+ S        GCFPSP       KSG    KS +R     Y +++ET+S W+VSSR+S+SP 
Subjt:  E-----SSSCSLHSSSRTRSDRWRFARSSLAEP---PQVS-----PATGCFPSPS---PNRKSG----KSMNRKPEPNYTTELETLSRWSVSSRKSISPF

Query:  RDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF
             S  SSFSS + SPR  ++ EW GFGLF
Subjt:  RDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCCGACGAATTTTACCGGCGACCGGCTGCTGTTCCCTTCAAATGGGAGATCAAACCCGGCGTCCCCAGGGCTCACCACCGGCTCTGCCCGTCCCCAAGTCCGCC
GCCACAAAAGTTGAAACCTCCTCCCGTTGTATCCCACTCCCGCCGTCCTTCCGAATCCTCCTCCTGCTCCCTCCACTCGTCCTCGCGAACCCGGTCTGATCGCTGGCGGT
TCGCTCGGTCCAGTCTCGCCGAACCTCCGCAGGTCTCGCCTGCAACTGGATGCTTCCCCTCGCCTTCGCCGAACCGGAAATCGGGCAAGAGTATGAACCGGAAACCCGAA
CCGAATTATACCACTGAATTGGAGACTTTGTCCCGGTGGTCCGTTTCCAGCAGGAAGTCGATTTCGCCGTTCAGAGATTCGGTTTCGTCGTCCCCTTCGTCGTTCTCGTC
GTACCAGTCTTCGCCCCGCCCAACGAGTGATACTGAGTGGGCCGGATTTGGGCTCTTT
mRNA sequenceShow/hide mRNA sequence
ATGGATGCCGACGAATTTTACCGGCGACCGGCTGCTGTTCCCTTCAAATGGGAGATCAAACCCGGCGTCCCCAGGGCTCACCACCGGCTCTGCCCGTCCCCAAGTCCGCC
GCCACAAAAGTTGAAACCTCCTCCCGTTGTATCCCACTCCCGCCGTCCTTCCGAATCCTCCTCCTGCTCCCTCCACTCGTCCTCGCGAACCCGGTCTGATCGCTGGCGGT
TCGCTCGGTCCAGTCTCGCCGAACCTCCGCAGGTCTCGCCTGCAACTGGATGCTTCCCCTCGCCTTCGCCGAACCGGAAATCGGGCAAGAGTATGAACCGGAAACCCGAA
CCGAATTATACCACTGAATTGGAGACTTTGTCCCGGTGGTCCGTTTCCAGCAGGAAGTCGATTTCGCCGTTCAGAGATTCGGTTTCGTCGTCCCCTTCGTCGTTCTCGTC
GTACCAGTCTTCGCCCCGCCCAACGAGTGATACTGAGTGGGCCGGATTTGGGCTCTTT
Protein sequenceShow/hide protein sequence
MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSPSPPPQKLKPPPVVSHSRRPSESSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSPSPNRKSGKSMNRKPE
PNYTTELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF