; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G10795 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G10795
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionCOPII coat assembly protein SEC16, putative
Genome locationClcChr04:24318579..24319067
RNA-Seq ExpressionClc04G10795
SyntenyClc04G10795
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK07441.1 putative COPII coat assembly protein SEC16 [Cucumis melo var. makuwa]1.6e-6183.95Show/hide
Query:  MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRL
        MAKSAAQ+TYPPPSPS   LRR + +PSSPMPP N   SPPPPPSPINL LLSFKSSSQSYTSLKDILPSASA   AAVNSPTAASPANS YEISIRNRL
Subjt:  MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRL

Query:  VKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL
        VKQAAWAYLQPMSASSYSAGP+FFHR  LR STGNPIY C GFI  SIIPAIIR FRFCICL
Subjt:  VKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL

XP_008462609.1 PREDICTED: uncharacterized protein LOC103500927 [Cucumis melo]1.6e-6183.95Show/hide
Query:  MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRL
        MAKSAAQ+TYPPPSPS   LRR + +PSSPMPP N   SPPPPPSPINL LLSFKSSSQSYTSLKDILPSASA   AAVNSPTAASPANS YEISIRNRL
Subjt:  MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRL

Query:  VKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL
        VKQAAWAYLQPMSASSYSAGP+FFHR  LR STGNPIY C GFI  SIIPAIIR FRFCICL
Subjt:  VKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL

XP_011657724.1 early nodulin-20 [Cucumis sativus]2.4e-6284.47Show/hide
Query:  MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRL
        MAKS+AQ+TYPPPSPS   LRR + +PSSPMPP N   SPPPPPSPINL LLSFKSSSQSYTSLKDILPSASASA AAVNSPTAASPANS YEISIRNRL
Subjt:  MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRL

Query:  VKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCIC
        VKQAAWAYLQPMSASSYSAGP+FFHR  LR STGNPIY C GFIR SI+PAIIR FRFCIC
Subjt:  VKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCIC

XP_022979075.1 uncharacterized protein LOC111478820 [Cucurbita maxima]6.6e-6082.32Show/hide
Query:  MAKSAAQITYPPPSPSPS--NLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRN
        MAKSAAQ TYPPPSPSPS  NL RRSL+PSS MPPQNG  SP PPPSPI+LRLLS KSSSQSYTSLKDILP    S++ AVNSPTAASPANSGYEI IRN
Subjt:  MAKSAAQITYPPPSPSPS--NLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRN

Query:  RLVKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL
        RLVKQAAWAYLQPMSASSYSAGP+ FHR WLR STGNPI  CLGFIRG IIP+IIRVFR CIC+
Subjt:  RLVKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL

XP_038883302.1 uncharacterized protein LOC120074291 [Benincasa hispida]2.9e-6887.88Show/hide
Query:  MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGI---ASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIR
        MAKSAAQITYPPPSPS SNLR R+ +PSSPMPP NG     SPPPPPSPINLRLLSFKSSSQSYTSLKDILP    SASAAVNSPTAASPANSGYEISIR
Subjt:  MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGI---ASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIR

Query:  NRLVKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL
        NRLVKQAAWAYLQPMSASSYSAGP+FFHR WLR ST NPIYGCLGFIRG+IIPAIIRVFRFCICL
Subjt:  NRLVKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL

TrEMBL top hitse value%identityAlignment
A0A0A0KHJ9 Uncharacterized protein1.2e-6284.47Show/hide
Query:  MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRL
        MAKS+AQ+TYPPPSPS   LRR + +PSSPMPP N   SPPPPPSPINL LLSFKSSSQSYTSLKDILPSASASA AAVNSPTAASPANS YEISIRNRL
Subjt:  MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRL

Query:  VKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCIC
        VKQAAWAYLQPMSASSYSAGP+FFHR  LR STGNPIY C GFIR SI+PAIIR FRFCIC
Subjt:  VKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCIC

A0A1S3CHC6 uncharacterized protein LOC1035009277.6e-6283.95Show/hide
Query:  MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRL
        MAKSAAQ+TYPPPSPS   LRR + +PSSPMPP N   SPPPPPSPINL LLSFKSSSQSYTSLKDILPSASA   AAVNSPTAASPANS YEISIRNRL
Subjt:  MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRL

Query:  VKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL
        VKQAAWAYLQPMSASSYSAGP+FFHR  LR STGNPIY C GFI  SIIPAIIR FRFCICL
Subjt:  VKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL

A0A5A7SL81 Putative COPII coat assembly protein SEC167.6e-6283.95Show/hide
Query:  MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRL
        MAKSAAQ+TYPPPSPS   LRR + +PSSPMPP N   SPPPPPSPINL LLSFKSSSQSYTSLKDILPSASA   AAVNSPTAASPANS YEISIRNRL
Subjt:  MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRL

Query:  VKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL
        VKQAAWAYLQPMSASSYSAGP+FFHR  LR STGNPIY C GFI  SIIPAIIR FRFCICL
Subjt:  VKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL

A0A5D3C6C7 Putative COPII coat assembly protein SEC167.6e-6283.95Show/hide
Query:  MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRL
        MAKSAAQ+TYPPPSPS   LRR + +PSSPMPP N   SPPPPPSPINL LLSFKSSSQSYTSLKDILPSASA   AAVNSPTAASPANS YEISIRNRL
Subjt:  MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRL

Query:  VKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL
        VKQAAWAYLQPMSASSYSAGP+FFHR  LR STGNPIY C GFI  SIIPAIIR FRFCICL
Subjt:  VKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL

A0A6J1IS66 uncharacterized protein LOC1114788203.2e-6082.32Show/hide
Query:  MAKSAAQITYPPPSPSPS--NLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRN
        MAKSAAQ TYPPPSPSPS  NL RRSL+PSS MPPQNG  SP PPPSPI+LRLLS KSSSQSYTSLKDILP    S++ AVNSPTAASPANSGYEI IRN
Subjt:  MAKSAAQITYPPPSPSPS--NLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRN

Query:  RLVKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL
        RLVKQAAWAYLQPMSASSYSAGP+ FHR WLR STGNPI  CLGFIRG IIP+IIRVFR CIC+
Subjt:  RLVKQAAWAYLQPMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52520.1 unknown protein8.1e-0842.86Show/hide
Query:  LRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQS----YTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRLVKQAAWAYLQPMSAS
        +   +L+  S +  +NG  S       ++L L+S   SS+     YTSLKDILPS+S +  +   S   AS A SG  I+IRNRLVKQAA +YLQP S  
Subjt:  LRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQS----YTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRLVKQAAWAYLQPMSAS

Query:  SYSAGPSFFHRL
        + S+ PSF  R+
Subjt:  SYSAGPSFFHRL

AT5G06280.1 unknown protein7.8e-1141.98Show/hide
Query:  PPSPSPSNLRRRSLVPSSP---MPPQNGIASPPPPPSPINLRLLSFKSSS-QSYTSLKDIL--PSASASASAAVNSPTAASPANSGYEISIRNRLVKQAA
        PPS SP+   RR+   S+     PP+        PPS  +  L+S K SS  +YTSL+DI+  P  S+    ++N   +   + +  +ISIRNRLVKQAA
Subjt:  PPSPSPSNLRRRSLVPSSP---MPPQNGIASPPPPPSPINLRLLSFKSSS-QSYTSLKDIL--PSASASASAAVNSPTAASPANSGYEISIRNRLVKQAA

Query:  WAYLQP--MSASSYSAGPSFFHRLWLRLSTG
         +YLQP  +++S  SAG  FF R+WL LS G
Subjt:  WAYLQP--MSASSYSAGPSFFHRLWLRLSTG

AT5G06280.3 unknown protein7.8e-1141.98Show/hide
Query:  PPSPSPSNLRRRSLVPSSP---MPPQNGIASPPPPPSPINLRLLSFKSSS-QSYTSLKDIL--PSASASASAAVNSPTAASPANSGYEISIRNRLVKQAA
        PPS SP+   RR+   S+     PP+        PPS  +  L+S K SS  +YTSL+DI+  P  S+    ++N   +   + +  +ISIRNRLVKQAA
Subjt:  PPSPSPSNLRRRSLVPSSP---MPPQNGIASPPPPPSPINLRLLSFKSSS-QSYTSLKDIL--PSASASASAAVNSPTAASPANSGYEISIRNRLVKQAA

Query:  WAYLQP--MSASSYSAGPSFFHRLWLRLSTG
         +YLQP  +++S  SAG  FF R+WL LS G
Subjt:  WAYLQP--MSASSYSAGPSFFHRLWLRLSTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAAATCCGCCGCCCAAATAACCTATCCACCGCCGTCGCCATCGCCCTCAAATCTCCGCCGCAGAAGCTTGGTTCCCTCCTCCCCCATGCCTCCGCAAAACGGCAT
CGCTTCCCCTCCTCCACCGCCATCTCCGATTAACTTGAGGCTCCTCTCCTTCAAGTCCTCTTCTCAATCCTACACCTCCCTCAAAGACATCCTTCCCTCCGCCTCCGCCT
CCGCCTCCGCTGCCGTCAACTCTCCCACTGCCGCTTCCCCCGCTAACTCTGGCTACGAAATCTCCATCCGCAATCGCCTCGTTAAGCAGGCCGCTTGGGCTTATCTCCAA
CCTATGTCCGCTTCTTCCTATTCCGCCGGCCCCAGTTTCTTCCACCGTCTCTGGCTCCGACTCTCCACTGGAAATCCGATCTACGGTTGTCTCGGATTCATCAGAGGCAG
TATCATTCCGGCCATAATCCGAGTCTTTCGATTCTGCATTTGCTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAAATCCGCCGCCCAAATAACCTATCCACCGCCGTCGCCATCGCCCTCAAATCTCCGCCGCAGAAGCTTGGTTCCCTCCTCCCCCATGCCTCCGCAAAACGGCAT
CGCTTCCCCTCCTCCACCGCCATCTCCGATTAACTTGAGGCTCCTCTCCTTCAAGTCCTCTTCTCAATCCTACACCTCCCTCAAAGACATCCTTCCCTCCGCCTCCGCCT
CCGCCTCCGCTGCCGTCAACTCTCCCACTGCCGCTTCCCCCGCTAACTCTGGCTACGAAATCTCCATCCGCAATCGCCTCGTTAAGCAGGCCGCTTGGGCTTATCTCCAA
CCTATGTCCGCTTCTTCCTATTCCGCCGGCCCCAGTTTCTTCCACCGTCTCTGGCTCCGACTCTCCACTGGAAATCCGATCTACGGTTGTCTCGGATTCATCAGAGGCAG
TATCATTCCGGCCATAATCCGAGTCTTTCGATTCTGCATTTGCTTGTGA
Protein sequenceShow/hide protein sequence
MAKSAAQITYPPPSPSPSNLRRRSLVPSSPMPPQNGIASPPPPPSPINLRLLSFKSSSQSYTSLKDILPSASASASAAVNSPTAASPANSGYEISIRNRLVKQAAWAYLQ
PMSASSYSAGPSFFHRLWLRLSTGNPIYGCLGFIRGSIIPAIIRVFRFCICL