; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0531 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0531
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionVascular-related unknown protein 1
Genome locationMC02:4321439..4322829
RNA-Seq ExpressionMC02g0531
SyntenyMC02g0531
Gene Ontology termsGO:0010089 - xylem development (biological process)
InterPro domainsIPR039280 - VASCULAR-RELATED UNKNOWN PROTEIN


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604852.1 Vascular-related unknown protein 1, partial [Cucurbita argyrosperma subsp. sororia]2.12e-4761.24Show/hide
Query:  ENSVYSSIQ-IPHPLPS---------ADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAA----------AAHLNIPSKLSLKPKPKPSQFFVDTS
        E SV SSIQ  PHPLPS          ++S WTAYL DDPSD DAYSL   TSSLLSDAASHAA          + HL IP KLSLKPK   +QFFVDTS
Subjt:  ENSVYSSIQ-IPHPLPS---------ADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAA----------AAHLNIPSKLSLKPKPKPSQFFVDTS

Query:  LEDTASSPHNSPKVADLGA--NNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYL
        LEDTASSP NSPKVAD G    NNYRRKSSLG GR MGEK+  ++ +          YTDLK+RGLCLVPL++ ANYL
Subjt:  LEDTASSPHNSPKVADLGA--NNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYL

KAG7034965.1 hypothetical protein SDJN02_01758, partial [Cucurbita argyrosperma subsp. argyrosperma]7.43e-4861.8Show/hide
Query:  ENSVYSSIQ-IPHPLPS---------ADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAA----------AAHLNIPSKLSLKPKPKPSQFFVDTS
        E SV SSIQ  PHPLPS          ++S WTAYL DDPSD DAYSL   TSSLLSDAASHAA          + HL IP KLSLKPK   +QFFVDTS
Subjt:  ENSVYSSIQ-IPHPLPS---------ADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAA----------AAHLNIPSKLSLKPKPKPSQFFVDTS

Query:  LEDTASSPHNSPKVADLGA--NNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYL
        LEDTASSP NSPKVAD G    NNYRRKSSLG GR MGEK+  ++ +          YTDLK+RGLCLVPLS+ ANYL
Subjt:  LEDTASSPHNSPKVADLGA--NNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYL

XP_022140677.1 uncharacterized protein LOC111011279 [Momordica charantia]3.62e-107100Show/hide
Query:  MEKENSVYSSIQIPHPLPSADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAAAAHLNIPSKLSLKPKPKPSQFFVDTSLEDTASSPHNSPKVADL
        MEKENSVYSSIQIPHPLPSADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAAAAHLNIPSKLSLKPKPKPSQFFVDTSLEDTASSPHNSPKVADL
Subjt:  MEKENSVYSSIQIPHPLPSADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAAAAHLNIPSKLSLKPKPKPSQFFVDTSLEDTASSPHNSPKVADL

Query:  GANNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYLP
        GANNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYLP
Subjt:  GANNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYLP

XP_022947988.1 vascular-related unknown protein 1-like [Cucurbita moschata]3.00e-4761.24Show/hide
Query:  ENSVYSSIQ-IPHPLPS---------ADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAA----------AAHLNIPSKLSLKPKPKPSQFFVDTS
        E SV SSIQ  PHPLPS          ++S WTAYL DDPSD DAYSL   +SSLLSDAASHAA          + HL IP KLSLKPK   +QFFVDTS
Subjt:  ENSVYSSIQ-IPHPLPS---------ADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAA----------AAHLNIPSKLSLKPKPKPSQFFVDTS

Query:  LEDTASSPHNSPKVADLGA--NNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYL
        LEDTASSP NSPKVAD G    NNYRRKSSLG GR MGEK+  ++ +          YTDLK+RGLCLVPLS+ ANYL
Subjt:  LEDTASSPHNSPKVADLGA--NNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYL

XP_023534015.1 vascular-related unknown protein 1 [Cucurbita pepo subsp. pepo]2.19e-4761.24Show/hide
Query:  ENSVYSSIQ-IPHPLPS---------ADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAA----------AAHLNIPSKLSLKPKPKPSQFFVDTS
        E SV SSIQ  PHPLPS          ++S WTAYL DDPSD DAYSL   +SSLLSDAASHAA          + HL IP KLSLKPK   +QFFVDTS
Subjt:  ENSVYSSIQ-IPHPLPS---------ADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAA----------AAHLNIPSKLSLKPKPKPSQFFVDTS

Query:  LEDTASSPHNSPKVADLGA--NNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYL
        LEDTASSP NSPKVAD G    NNYRRKSSLG GR MGEK+  ++ +          YTDLK+RGLCLVPLS+ ANYL
Subjt:  LEDTASSPHNSPKVADLGA--NNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYL

TrEMBL top hitse value%identityAlignment
A0A1S3C201 uncharacterized protein LOC1034959483.04e-4259.77Show/hide
Query:  ENSVYSSIQ-IPHPLPS----ADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAAAA---------HLNIPSKLSLKPKPKPSQFFVDTSLEDTAS
        E SVYSS Q  PHPLPS    A +SGWTAYL D+PSD+D  +    TSSLLSDAASHA AA         HL IP+KL LKPK     FFVDTSLEDTAS
Subjt:  ENSVYSSIQ-IPHPLPS----ADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAAAA---------HLNIPSKLSLKPKPKPSQFFVDTSLEDTAS

Query:  SPHNSPKVAD-LGA--NNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAA-YTDLKQRGLCLVPLSIFANYL
        SP NSPKV D LG    N+YRRKSSLG G    EK+  ++   EI  +  A+ YTDLK+RGLCLVPLS+  NYL
Subjt:  SPHNSPKVAD-LGA--NNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAA-YTDLKQRGLCLVPLSIFANYL

A0A5D3BK94 Uncharacterized protein3.04e-4259.77Show/hide
Query:  ENSVYSSIQ-IPHPLPS----ADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAAAA---------HLNIPSKLSLKPKPKPSQFFVDTSLEDTAS
        E SVYSS Q  PHPLPS    A +SGWTAYL D+PSD+D  +    TSSLLSDAASHA AA         HL IP+KL LKPK     FFVDTSLEDTAS
Subjt:  ENSVYSSIQ-IPHPLPS----ADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAAAA---------HLNIPSKLSLKPKPKPSQFFVDTSLEDTAS

Query:  SPHNSPKVAD-LGA--NNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAA-YTDLKQRGLCLVPLSIFANYL
        SP NSPKV D LG    N+YRRKSSLG G    EK+  ++   EI  +  A+ YTDLK+RGLCLVPLS+  NYL
Subjt:  SPHNSPKVAD-LGA--NNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAA-YTDLKQRGLCLVPLSIFANYL

A0A6J1CHQ3 uncharacterized protein LOC1110112791.75e-107100Show/hide
Query:  MEKENSVYSSIQIPHPLPSADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAAAAHLNIPSKLSLKPKPKPSQFFVDTSLEDTASSPHNSPKVADL
        MEKENSVYSSIQIPHPLPSADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAAAAHLNIPSKLSLKPKPKPSQFFVDTSLEDTASSPHNSPKVADL
Subjt:  MEKENSVYSSIQIPHPLPSADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAAAAHLNIPSKLSLKPKPKPSQFFVDTSLEDTASSPHNSPKVADL

Query:  GANNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYLP
        GANNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYLP
Subjt:  GANNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYLP

A0A6J1G847 Uncharacterized protein1.45e-4761.24Show/hide
Query:  ENSVYSSIQ-IPHPLPS---------ADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAA----------AAHLNIPSKLSLKPKPKPSQFFVDTS
        E SV SSIQ  PHPLPS          ++S WTAYL DDPSD DAYSL   +SSLLSDAASHAA          + HL IP KLSLKPK   +QFFVDTS
Subjt:  ENSVYSSIQ-IPHPLPS---------ADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAA----------AAHLNIPSKLSLKPKPKPSQFFVDTS

Query:  LEDTASSPHNSPKVADLGA--NNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYL
        LEDTASSP NSPKVAD G    NNYRRKSSLG GR MGEK+  ++ +          YTDLK+RGLCLVPLS+ ANYL
Subjt:  LEDTASSPHNSPKVADLGA--NNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYL

A0A6J1I137 Uncharacterized protein1.94e-4761.93Show/hide
Query:  ENSVYSSIQ-IPHPLPS------ADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAA----------AAHLNIPSKLSLKPKPKPSQFFVDTSLED
        E SV SSIQ IPHPLPS      A++S WTAYL DDPSD DAYSL   TSSLLSDAASHAA          + HL IP KLSLKPK   +QFFVDTSLED
Subjt:  ENSVYSSIQ-IPHPLPS------ADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAA----------AAHLNIPSKLSLKPKPKPSQFFVDTSLED

Query:  TASSPHNSPKVADLG---ANNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYL
        TASSP NSPKVAD G    NN+ R+KSSLG GR MGEK+  ++ +          Y DLK+RGLCLVPLS+ ANYL
Subjt:  TASSPHNSPKVADLG---ANNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYL

SwissProt top hitse value%identityAlignment
Q5BPG5 Vascular-related unknown protein 46.2e-0532.67Show/hide
Query:  DDSGWTAYLLD-DPSDEDAYSLGVGTSSLLSDAASHAAAAH-LNIP----SKLSLK-PKPKPSQFFVDTSLEDTASSPHNSPKVADLG--ANNNYRRKSS
        ++S WT Y  D   +      +G  +SS +SDAAS  A    LN+     S L +K  + +   F     LEDTASSP  SP V  +    +NN R    
Subjt:  DDSGWTAYLLD-DPSDEDAYSLGVGTSSLLSDAASHAAAAH-LNIP----SKLSLK-PKPKPSQFFVDTSLEDTASSPHNSPKVADLG--ANNNYRRKSS

Query:  LGKGRTMGEKFSINKSEEEINL--EMGAAYTDLKQRGLCLVPLSIFANYL
        +           +   E+ ++     G    DLK++GLCLVPLS+  N+L
Subjt:  LGKGRTMGEKFSINKSEEEINL--EMGAAYTDLKQRGLCLVPLSIFANYL

Q9LSZ3 Vascular-related unknown protein 11.7e-1033.53Show/hide
Query:  DDSGWTAYLLD-------------DPSDEDAYSLGVGTSSLLSDAASHAAAAH---LNIPSKLSLKPKPKPSQFFVDTSLEDTASSPHNSPKVADLGANN
        ++SGWT YL D             D  D+ +YSL   ++SL+SDAA+HA +     +N P+KL    + +  +   D SLEDTASSP NSPKV+      
Subjt:  DDSGWTAYLLD-------------DPSDEDAYSLGVGTSSLLSDAASHAAAAH---LNIPSKLSLKPKPKPSQFFVDTSLEDTASSPHNSPKVADLGANN

Query:  NYRRKSS--------LGKGRTMG-----------EKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANY
           RK          +G  R MG           +K ++ ++  + N        DL+ RGLC+VP+S+ AN+
Subjt:  NYRRKSS--------LGKGRTMG-----------EKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANY

Arabidopsis top hitse value%identityAlignment
AT1G50930.1 unknown protein5.4e-0428.65Show/hide
Query:  IQIPHPLPSADDSGWTAYLLD-DPSDEDAYSLGVGT------SSLLSDAASHAAAAHLN--IPSKL-SLKPKPKPSQFF----------------VDTSL
        I++ H     ++S W  Y  D D  DE   + G  T      SS++SDAAS      +N  +  K  ++   PK  +                   +   
Subjt:  IQIPHPLPSADDSGWTAYLLD-DPSDEDAYSLGVGT------SSLLSDAASHAAAAHLN--IPSKL-SLKPKPKPSQFF----------------VDTSL

Query:  EDTASSPHNSPKVADL--GANNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYT-DLKQRGLCLVPLSIFANYL
        EDTASSP N  K+  +   AN+N R   ++    T  E   I ++  +I   M   ++ +LK+RGLC+VPLS+ +N++
Subjt:  EDTASSPHNSPKVADL--GANNNYRRKSSLGKGRTMGEKFSINKSEEEINLEMGAAYT-DLKQRGLCLVPLSIFANYL

AT3G21710.1 unknown protein6.1e-0841.49Show/hide
Query:  DDSGWTAYLLD-------------DPSDEDAYSLGVGTSSLLSDAASHAAAAH---LNIPSKLSLKPKPKPSQFFVDTSLEDTASSPHNSPKVA
        ++SGWT YL D             D  D+ +YSL   ++SL+SDAA+HA +     +N P+KL    + +  +   D SLEDTASSP NSPKV+
Subjt:  DDSGWTAYLLD-------------DPSDEDAYSLGVGTSSLLSDAASHAAAAH---LNIPSKLSLKPKPKPSQFFVDTSLEDTASSPHNSPKVA

AT3G21710.2 unknown protein1.2e-1133.53Show/hide
Query:  DDSGWTAYLLD-------------DPSDEDAYSLGVGTSSLLSDAASHAAAAH---LNIPSKLSLKPKPKPSQFFVDTSLEDTASSPHNSPKVADLGANN
        ++SGWT YL D             D  D+ +YSL   ++SL+SDAA+HA +     +N P+KL    + +  +   D SLEDTASSP NSPKV+      
Subjt:  DDSGWTAYLLD-------------DPSDEDAYSLGVGTSSLLSDAASHAAAAH---LNIPSKLSLKPKPKPSQFFVDTSLEDTASSPHNSPKVADLGANN

Query:  NYRRKSS--------LGKGRTMG-----------EKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANY
           RK          +G  R MG           +K ++ ++  + N        DL+ RGLC+VP+S+ AN+
Subjt:  NYRRKSS--------LGKGRTMG-----------EKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANY

AT5G54790.1 unknown protein4.4e-0632.67Show/hide
Query:  DDSGWTAYLLD-DPSDEDAYSLGVGTSSLLSDAASHAAAAH-LNIP----SKLSLK-PKPKPSQFFVDTSLEDTASSPHNSPKVADLG--ANNNYRRKSS
        ++S WT Y  D   +      +G  +SS +SDAAS  A    LN+     S L +K  + +   F     LEDTASSP  SP V  +    +NN R    
Subjt:  DDSGWTAYLLD-DPSDEDAYSLGVGTSSLLSDAASHAAAAH-LNIP----SKLSLK-PKPKPSQFFVDTSLEDTASSPHNSPKVADLG--ANNNYRRKSS

Query:  LGKGRTMGEKFSINKSEEEINL--EMGAAYTDLKQRGLCLVPLSIFANYL
        +           +   E+ ++     G    DLK++GLCLVPLS+  N+L
Subjt:  LGKGRTMGEKFSINKSEEEINL--EMGAAYTDLKQRGLCLVPLSIFANYL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGGAGAATTCGGTTTATTCATCCATCCAAATCCCGCACCCATTGCCTTCCGCCGACGACAGCGGCTGGACCGCCTATTTGCTCGACGACCCCTCCGACGAAGA
CGCCTACAGCCTCGGAGTCGGAACCTCCTCCCTGCTCTCCGACGCCGCTTCTCACGCCGCCGCCGCACACCTCAACATTCCCAGCAAATTGAGTCTCAAGCCAAAGCCAA
AGCCATCTCAATTTTTTGTTGATACTTCTCTCGAAGATACTGCCAGTTCCCCTCACAATAGCCCCAAGGTTGCTGATTTGGGTGCCAACAATAATTATCGAAGAAAGAGT
TCTCTGGGAAAGGGGAGGACGATGGGAGAGAAATTTAGTATTAATAAATCAGAAGAAGAAATCAATTTGGAGATGGGTGCTGCATATACAGACTTGAAACAAAGAGGCCT
GTGCTTGGTTCCTCTGTCCATCTTCGCCAACTACTTGCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATAGTGTTCGTCAGACACAATTCTAAAATTCGTAGGAAGAATTGAAATTAAAAAGCAGAAAGATCAACCACCGCCGTGCCATTGGATGATTACAAAGAATTCAAATTCAT
AAGACCCTGTGGTGTTGTGTGTGTGAGTGGTAGATGAATAAGTTGGATGGCCCACACCCCCAGATTCTGAAGGCGTCATGCTTTCCTCCAATTTCACTTATAAAATCTCT
CCATCCACTCCCAATTTTCTTTGCATCCAAATTAAAATTAGAACTCTCGCTCTCGCTCTCGCCCCAACTTGCAACTTCCAACTTCCAAGTCCAATTCAGAGAAAAAAAAA
AAAAATAGATGGAGAAGGAGAATTCGGTTTATTCATCCATCCAAATCCCGCACCCATTGCCTTCCGCCGACGACAGCGGCTGGACCGCCTATTTGCTCGACGACCCCTCC
GACGAAGACGCCTACAGCCTCGGAGTCGGAACCTCCTCCCTGCTCTCCGACGCCGCTTCTCACGCCGCCGCCGCACACCTCAACATTCCCAGCAAATTGAGTCTCAAGCC
AAAGCCAAAGCCATCTCAATTTTTTGTTGATACTTCTCTCGAAGATACTGCCAGTTCCCCTCACAATAGCCCCAAGGTTGCTGATTTGGGTGCCAACAATAATTATCGAA
GAAAGAGTTCTCTGGGAAAGGGGAGGACGATGGGAGAGAAATTTAGTATTAATAAATCAGAAGAAGAAATCAATTTGGAGATGGGTGCTGCATATACAGACTTGAAACAA
AGAGGCCTGTGCTTGGTTCCTCTGTCCATCTTCGCCAACTACTTGCCCTGATGTCCATACACTAATTAACTCTCTCTCTATATCATACTCAAAAATTAAAATTATACTAT
TTTCCTTGTAATTTACCACCTACACCTTGCCTTTTATGCTCCATCTTATATATATTTACTCTGTATGTGTATCTCAAATTACTAATTTACTAATTTCATAATAAACTTAC
CGTTTTTTTTTTTTGACCCTCATAACGCTACAATTTTGGACTTGAAAGTAAATTCTGCCCAATTTGAGGATTTAATTATGAAACGGGTAATTATAAGATTAAATGAGCAG
AATAGGCGCAATTATGGCCCTATTAAAATCCAAATTAGCGTGAAAGTAAGGTTGCAAATTTAATTTTCCAACAGAACGTGGGCCCCATCTCCACCACCAATCACACCGTT
GAC
Protein sequenceShow/hide protein sequence
MEKENSVYSSIQIPHPLPSADDSGWTAYLLDDPSDEDAYSLGVGTSSLLSDAASHAAAAHLNIPSKLSLKPKPKPSQFFVDTSLEDTASSPHNSPKVADLGANNNYRRKS
SLGKGRTMGEKFSINKSEEEINLEMGAAYTDLKQRGLCLVPLSIFANYLP