; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G020780 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G020780
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionglutelin type-D 1-like
Genome locationCicolChr02:3371674..3372392
RNA-Seq ExpressionCcUC02G020780
SyntenyCcUC02G020780
Gene Ontology termsGO:0000326 - protein storage vacuole (cellular component)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0045735 - nutrient reservoir activity (molecular function)
InterPro domainsIPR006045 - Cupin 1
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592225.1 Glutelin type-D 1, partial [Cucurbita argyrosperma subsp. sororia]2.8e-6589.93Show/hide
Query:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW
        M+IDLTPQL KK+YG DGGSYYSWSPK+LPML EGNIGA+KLAL+KNGFALP YSDSAKVAYVLQGNGV GIILPESEEKVIAIKKGDAIALPFGVVTWW
Subjt:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW

Query:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG
        FNKEA DLVVLFLGDTSKAH SGEFT+FFLTGANGIF+G
Subjt:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG

XP_004150394.1 glutelin type-D 1 [Cucumis sativus]3.3e-6690.65Show/hide
Query:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW
        M+IDLTPQLPKK+YG DGGSYY+WSPK+LPML EGNIGASKLAL+KNGFALP YSDSAKVAYVLQGNGV GIILPESEEKVIAIKKGDAIALPFGVVTWW
Subjt:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW

Query:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG
        FNKEA DLVVLFLGDTSKAH SGEFT+FFLTGANGIF+G
Subjt:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG

XP_008461502.1 PREDICTED: glutelin type-B 5-like [Cucumis melo]1.1e-6691.37Show/hide
Query:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW
        M+IDLTPQLPKK+YGGDGGSYYSWSPK+LPML EGNIGASKLAL+KNGFALP YSDSAKVAYVLQG+GV GIILPESEEKVIAIKKGDAIALPFGVVTWW
Subjt:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW

Query:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG
        FNKEA DLVVLFLGDTSKAH SGEFT+FFLTGANGIF+G
Subjt:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG

XP_023535755.1 glutelin type-D 1-like [Cucurbita pepo subsp. pepo]2.8e-6589.93Show/hide
Query:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW
        M+IDLTPQL KK+YG DGGSYYSWSPK+LPML EGNIGA+KLAL+KNGFALP YSDSAKVAYVLQGNGV GIILPESEEKVIAIKKGDAIALPFGVVTWW
Subjt:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW

Query:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG
        FNKEA DLVVLFLGDTSKAH SGEFT+FFLTGANGIF+G
Subjt:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG

XP_038897477.1 glutelin type-D 1-like [Benincasa hispida]2.3e-6791.37Show/hide
Query:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW
        MD+DLTPQLPKK+YGGDGGSYY+WSPK+LPML EGNIGASKLAL+KNGFALP YSDSAKVAYVLQGNGV GIILPE EEKVIAIKKGDAIALPFGVVTWW
Subjt:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW

Query:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG
        FNKEAIDLVVLFLGDTSKAH SGEFT+FFLTGANGIF+G
Subjt:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG

TrEMBL top hitse value%identityAlignment
A0A0A0K666 Uncharacterized protein1.6e-6690.65Show/hide
Query:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW
        M+IDLTPQLPKK+YG DGGSYY+WSPK+LPML EGNIGASKLAL+KNGFALP YSDSAKVAYVLQGNGV GIILPESEEKVIAIKKGDAIALPFGVVTWW
Subjt:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW

Query:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG
        FNKEA DLVVLFLGDTSKAH SGEFT+FFLTGANGIF+G
Subjt:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG

A0A1S3CG59 glutelin type-B 5-like5.5e-6791.37Show/hide
Query:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW
        M+IDLTPQLPKK+YGGDGGSYYSWSPK+LPML EGNIGASKLAL+KNGFALP YSDSAKVAYVLQG+GV GIILPESEEKVIAIKKGDAIALPFGVVTWW
Subjt:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW

Query:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG
        FNKEA DLVVLFLGDTSKAH SGEFT+FFLTGANGIF+G
Subjt:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG

A0A5A7UAB0 Glutelin type-B 5-like5.5e-6791.37Show/hide
Query:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW
        M+IDLTPQLPKK+YGGDGGSYYSWSPK+LPML EGNIGASKLAL+KNGFALP YSDSAKVAYVLQG+GV GIILPESEEKVIAIKKGDAIALPFGVVTWW
Subjt:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW

Query:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG
        FNKEA DLVVLFLGDTSKAH SGEFT+FFLTGANGIF+G
Subjt:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG

A0A6J1IBI2 glutelin type-D 1-like1.5e-6487.77Show/hide
Query:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW
        MDIDLTPQLPKK+YGGDGGSYYSWSP +LPML  GNIGA+KLAL+KNGFALP YSDSAKVAYVLQGNGV GI+LPE EEKVIAI+KGDAIALPFGVVTWW
Subjt:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW

Query:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG
        FNKEA DLVVLFLGDTSKAH SGEFT+ FLTGANGIF+G
Subjt:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG

A0A6J1IH21 glutelin type-D 1-like3.0e-6589.93Show/hide
Query:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW
        M+IDLTPQL KK+YG DGGSYYSWSPK+LPML EGNIGA+KLAL+KNGFALP YSDSAKVAYVLQGNGV GIILPESEEKVIAIKKGDAIALPFGVVTWW
Subjt:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW

Query:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG
        FNKEA DLVVLFLGDTSKAH SGEFT+FFLTGANGIF+G
Subjt:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG

SwissProt top hitse value%identityAlignment
F5B8V6 Conglutin alpha 12.7e-1026.36Show/hide
Query:  PKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILP---------------------ESEEKVIAIKKGD
        P      + G+  +W+P +  + C G +  S+  +++NG   PFY+++ +  Y+ QG G+ G+I P                     +  +KV   ++GD
Subjt:  PKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILP---------------------ESEEKVIAIKKGD

Query:  AIALPFGVVTWWFNKEAIDLVVLFLGDTS
         IA+P GV  W +N E   ++ + L DT+
Subjt:  AIALPFGVVTWWFNKEAIDLVVLFLGDTS

P04405 Glycinin G22.7e-1028.46Show/hide
Query:  DGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILP----------------------ESEEKVIAIKKGDAIALPF
        +GG   +W+P + P  C G +  S+  L +N    P Y++  +  Y+ QGNG+ G+I P                      +  +KV   ++GD IA+P 
Subjt:  DGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILP----------------------ESEEKVIAIKKGDAIALPF

Query:  GVVTWWFNKEAIDLVVLFLGDTS
        GV  W +N E   +V + + DT+
Subjt:  GVVTWWFNKEAIDLVVLFLGDTS

P05190 Legumin type B5.9e-1029.84Show/hide
Query:  DGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIIL-----------------------PESEEKVIAIKKGDAIALP
        + G   +W+P    + C G +   +  +  NG  LP YS S ++ Y++QG GV+G+ L                       P+S +K+   +KGD IA+P
Subjt:  DGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIIL-----------------------PESEEKVIAIKKGDAIALP

Query:  FGVVTWWFNKEAIDLVVLFLGDTS
         G+  W +N     LV + L DTS
Subjt:  FGVVTWWFNKEAIDLVVLFLGDTS

P11828 Glycinin G37.7e-1027.5Show/hide
Query:  DGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILP-------------------ESEEKVIAIKKGDAIALPFGVV
        +GG   +W+P + P  C G +  S+  L +N    P Y+++ +  Y+ QG+G+ G+I P                   +  +K+   ++GD IA+P G  
Subjt:  DGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILP-------------------ESEEKVIAIKKGDAIALPFGVV

Query:  TWWFNKEAIDLVVLFLGDTS
         W +N E   +V + L DT+
Subjt:  TWWFNKEAIDLVVLFLGDTS

Q647H2 Arachin Ahy-32.0e-1024.83Show/hide
Query:  LTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILP-----------------------------
        L  Q P      +GG   +W+P +    C G +  S+  L++N    PFYS++ +  ++ QG+G  G+I P                             
Subjt:  LTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILP-----------------------------

Query:  ----ESEEKVIAIKKGDAIALPFGVVTWWFNKEAIDLVVLFLGDTSKAH
            ++ +KV   ++GD IA+P GV  W +N +  D+V + +  T+  H
Subjt:  ----ESEEKVIAIKKGDAIALPFGVVTWWFNKEAIDLVVLFLGDTSKAH

Arabidopsis top hitse value%identityAlignment
AT1G03880.1 cruciferin 21.8e-0620.39Show/hide
Query:  PKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILP-------------------------ESEEKVIAI
        P ++   +GG    W      + C G     +  ++  G  LP + ++ K+ +V+ G G++G ++P                         +  +KV  +
Subjt:  PKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILP-------------------------ESEEKVIAI

Query:  KKGDAIALPFGVVTWWFNKEAIDLVVLFLGD--TSKAHISGEFTNFFLTGAN
        + GD IA P GV  W++N     L+++   D  +++  +      F + G N
Subjt:  KKGDAIALPFGVVTWWFNKEAIDLVVLFLGD--TSKAHISGEFTNFFLTGAN

AT1G03890.1 RmlC-like cupins superfamily protein9.1e-0624.41Show/hide
Query:  DGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGII---LPES-----------------------EEKVIAIKKGDAI
        + G    W      + C G +  +++ L+ N   LP +     +AYV+QG GV+G I    PE+                        +K+   ++GD  
Subjt:  DGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGII---LPES-----------------------EEKVIAIKKGDAI

Query:  ALPFGVVTWWFNKEAIDLVVLFLGDTS
        A   GV  WW+N+   D V++ + D +
Subjt:  ALPFGVVTWWFNKEAIDLVVLFLGDTS

AT1G07750.1 RmlC-like cupins superfamily protein3.2e-5973.38Show/hide
Query:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW
        M++DLTP+LPKKVYGGDGGSY +W P++LPML +GNIGA+KLAL+KNGFA+P YSDS+KVAYVLQG+G  GI+LPE EEKVIAIK+GD+IALPFGVVTWW
Subjt:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW

Query:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG
        FN E  +LV+LFLG+T K H +G+FT F+LTG NGIF+G
Subjt:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG

AT2G28680.1 RmlC-like cupins superfamily protein7.1e-5974.1Show/hide
Query:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW
        M++DL+P+LPKKVYGGDGGSY++W P++LPML +GNIGASKLAL+K G ALP YSDS KVAYVLQG G  GI+LPE EEKVIAIKKGD+IALPFGVVTWW
Subjt:  MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWW

Query:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG
        FN E  +LVVLFLG+T K H +G+FT+F+LTG+NGIF+G
Subjt:  FNKEAIDLVVLFLGDTSKAHISGEFTNFFLTGANGIFSG

AT5G44120.3 RmlC-like cupins superfamily protein6.9e-0622.22Show/hide
Query:  PKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILP--------------------------ESEEKVIA
        P  V   + G    W      + C G +  ++  ++  G  LP + ++AK+++V +G G++G ++P                          +  +KV  
Subjt:  PKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILP--------------------------ESEEKVIA

Query:  IKKGDAIALPFGVVTWWFNKEAIDLVVLFLGD--TSKAHISGEFTNFFLTGAN
        I+ GD IA   GV  W++N     LV++ + D  + +  +      F+L G N
Subjt:  IKKGDAIALPFGVVTWWFNKEAIDLVVLFLGD--TSKAHISGEFTNFFLTGAN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATTGATTTGACTCCTCAATTGCCCAAGAAAGTCTACGGTGGTGATGGAGGTTCCTATTATTCTTGGTCTCCCAAGGACCTTCCAATGCTTTGTGAAGGAAACAT
CGGCGCCTCCAAGCTTGCCTTGAAGAAGAATGGCTTTGCTCTCCCTTTCTACTCCGATTCCGCCAAGGTTGCTTACGTTCTTCAAGGCAATGGAGTAGTTGGAATCATTC
TACCAGAATCGGAGGAGAAAGTAATTGCAATCAAGAAAGGAGATGCGATTGCTCTTCCATTCGGCGTGGTGACATGGTGGTTTAACAAAGAAGCCATTGATCTGGTGGTT
CTGTTCTTAGGCGACACATCAAAGGCTCACATATCGGGCGAGTTCACCAACTTCTTCCTAACTGGTGCCAACGGAATCTTCTCTGGCGAGCTTGGGATATGGATGAGGTG
TTGGTGA
mRNA sequenceShow/hide mRNA sequence
TATAAATAGTTGAGGCTCATAATTCATATTTCCCAAATCACTTTCAAACTCCATCTTCATATCAATCTACCAATACATTTAAGCTTTCTTTCTAATTCTTTCCTATAGTA
ATGGACATTGATTTGACTCCTCAATTGCCCAAGAAAGTCTACGGTGGTGATGGAGGTTCCTATTATTCTTGGTCTCCCAAGGACCTTCCAATGCTTTGTGAAGGAAACAT
CGGCGCCTCCAAGCTTGCCTTGAAGAAGAATGGCTTTGCTCTCCCTTTCTACTCCGATTCCGCCAAGGTTGCTTACGTTCTTCAAGGCAATGGAGTAGTTGGAATCATTC
TACCAGAATCGGAGGAGAAAGTAATTGCAATCAAGAAAGGAGATGCGATTGCTCTTCCATTCGGCGTGGTGACATGGTGGTTTAACAAAGAAGCCATTGATCTGGTGGTT
CTGTTCTTAGGCGACACATCAAAGGCTCACATATCGGGCGAGTTCACCAACTTCTTCCTAACTGGTGCCAACGGAATCTTCTCTGGCGAGCTTGGGATATGGATGAGGTG
TTGGTGAAATCTCTAGTAAAGAACCAAATCGGAACTGGAATTGTGAAGCTGAAGGAGGGAACAAAGATGCCGGAGGGGAA
Protein sequenceShow/hide protein sequence
MDIDLTPQLPKKVYGGDGGSYYSWSPKDLPMLCEGNIGASKLALKKNGFALPFYSDSAKVAYVLQGNGVVGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEAIDLVV
LFLGDTSKAHISGEFTNFFLTGANGIFSGELGIWMRCW