; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G09900 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G09900
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionFibroin heavy chain
Genome locationClcChr11:12777248..12782150
RNA-Seq ExpressionClc11G09900
SyntenyClc11G09900
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051071.1 fibroin heavy chain [Cucumis melo var. makuwa]4.8e-3875.4Show/hide
Query:  MKS-NSNSPWNWNTNNFHTNIF-------DPKEPP--PLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGF
        MKS +S+SPW    NNFHT IF       + KE P   LVLEAGGGAGVGCGLG GFGLVGGIGHGG SPWNHLHLVFGLG GCGVGLGLGIGQGFGYGF
Subjt:  MKS-NSNSPWNWNTNNFHTNIF-------DPKEPP--PLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGF

Query:  SFQSLDSYFSDHKSNPKHK-PPLIQF
        SF+SLDSYFS   S+PKHK P LIQF
Subjt:  SFQSLDSYFSDHKSNPKHK-PPLIQF

KAF3972995.1 hypothetical protein CMV_003541 [Castanea mollissima]2.8e-2249.62Show/hide
Query:  TMKSNSNSPWN-WNTN--------------NFHTNIF---DPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLG
        ++K++S+S WN WN N              +F  N F   +PK+   +VL  GGG G+GCG+GLGFG VGGIG+GG  PWNHL LV G+G+GCGVG+G G
Subjt:  TMKSNSNSPWN-WNTN--------------NFHTNIF---DPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLG

Query:  IGQGFGYGFSFQSLDSYFSDHKSNPKHKPPLIQ
         GQG GYGFS + L+SYFS   S+   K  +I+
Subjt:  IGQGFGYGFSFQSLDSYFSDHKSNPKHKPPLIQ

KAG6578355.1 hypothetical protein SDJN03_22803, partial [Cucurbita argyrosperma subsp. sororia]4.8e-4675.76Show/hide
Query:  MKSNSNSPWNWN---TNNFHT----------NIFD-----PKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGI
        MKS+SNSPW WN   TNNFHT          NIFD      K+PPPLVL+AGGGAGVGCG+GLGFGLVGGIGHGG SPWNHLHLVFGLGLGCGVGLGLGI
Subjt:  MKSNSNSPWNWN---TNNFHT----------NIFD-----PKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGI

Query:  GQGFGYGFSFQSLDSYFSDHKSNPKHKPPLIQ
        G+G GYG SF SLDSYFSDHK NP+ KPPLIQ
Subjt:  GQGFGYGFSFQSLDSYFSDHKSNPKHKPPLIQ

KGN63146.1 hypothetical protein Csa_022245 [Cucumis sativus]2.2e-3575Show/hide
Query:  MKS-NSNSPWNWNTNNFHTNIF-------DPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSF
        MKS +SNSPW    NNF T IF       + KE P LVLEAGGGAGVGCGLG+GFGLVGGIGH G SPWNHLHLVFGLG GCGVGLGLGIGQGFGYG SF
Subjt:  MKS-NSNSPWNWNTNNFHTNIF-------DPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSF

Query:  QSLDSYFSDHKSNPKHK-PPLIQF
        QS+DSYFS   SNPK K P LIQF
Subjt:  QSLDSYFSDHKSNPKHK-PPLIQF

XP_023890268.1 protein TRIGALACTOSYLDIACYLGLYCEROL 5, chloroplastic [Quercus suber]2.2e-2250Show/hide
Query:  MKSNSNSPWN-WNTNN-------------------FHTNIFDPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGL
        +K+NS+S WN WN N+                    H N   PK+   +VL  GGG G+GCG+GLGFGLVGGIG+GG  PWNHL LV G+G+GCGVG+G 
Subjt:  MKSNSNSPWN-WNTNN-------------------FHTNIFDPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGL

Query:  GIGQGFGYGFSFQSLDSYFSDHKSNPKHKPPLIQ
        G GQG GYGFS +SL+SY S   S+   K  +I+
Subjt:  GIGQGFGYGFSFQSLDSYFSDHKSNPKHKPPLIQ

TrEMBL top hitse value%identityAlignment
A0A0A0LSZ2 Uncharacterized protein1.1e-3575Show/hide
Query:  MKS-NSNSPWNWNTNNFHTNIF-------DPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSF
        MKS +SNSPW    NNF T IF       + KE P LVLEAGGGAGVGCGLG+GFGLVGGIGH G SPWNHLHLVFGLG GCGVGLGLGIGQGFGYG SF
Subjt:  MKS-NSNSPWNWNTNNFHTNIF-------DPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSF

Query:  QSLDSYFSDHKSNPKHK-PPLIQF
        QS+DSYFS   SNPK K P LIQF
Subjt:  QSLDSYFSDHKSNPKHK-PPLIQF

A0A2N9FV55 Uncharacterized protein3.0e-2250.78Show/hide
Query:  MKSNSNSPWN-WNTNNFHTNIFDPKEPP-------------PLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGF
        +K++SNS WN WN NN +    + K+ P              +VL  GGG G+GCG+G+GFGLVGGIG+GG  PWNHL LVFG+G+GCGVG+G G GQG 
Subjt:  MKSNSNSPWN-WNTNNFHTNIFDPKEPP-------------PLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGF

Query:  GYGFSFQSLDSYFSDHKSNPKHKPPLIQ
        G GFS  SL+SY S   S+   K  +I+
Subjt:  GYGFSFQSLDSYFSDHKSNPKHKPPLIQ

A0A5A7UC28 Fibroin heavy chain2.3e-3875.4Show/hide
Query:  MKS-NSNSPWNWNTNNFHTNIF-------DPKEPP--PLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGF
        MKS +S+SPW    NNFHT IF       + KE P   LVLEAGGGAGVGCGLG GFGLVGGIGHGG SPWNHLHLVFGLG GCGVGLGLGIGQGFGYGF
Subjt:  MKS-NSNSPWNWNTNNFHTNIF-------DPKEPP--PLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGF

Query:  SFQSLDSYFSDHKSNPKHK-PPLIQF
        SF+SLDSYFS   S+PKHK P LIQF
Subjt:  SFQSLDSYFSDHKSNPKHK-PPLIQF

A0A7N2KWH8 Uncharacterized protein2.3e-2249.62Show/hide
Query:  MKSNSNSPWN-WNTN-------------NFHTNIF---DPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIG
        +K++S+S WN WN N             +F  + F   +PK+   +VL  GGG G+GCG+GLGFGLVGG+G+GG  PWNHL LV G+G+GCGVG+G G G
Subjt:  MKSNSNSPWN-WNTN-------------NFHTNIF---DPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIG

Query:  QGFGYGFSFQSLDSYFSDHKSNPKHKPPLIQ
        QG GYGFS +SL+SY S   S+   K  +I+
Subjt:  QGFGYGFSFQSLDSYFSDHKSNPKHKPPLIQ

W9S8Y9 Uncharacterized protein8.8e-2252.38Show/hide
Query:  KSNSNSPWNWNTNNFHTNIFDPKEPP-------------PLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGY
        K +SN+ WNWN +  +      K  P               VL  G GAG+GCG+GLGFGLVGG G GG   WNHL LV G+GLGCGVGLG+G GQGFGY
Subjt:  KSNSNSPWNWNTNNFHTNIFDPKEPP-------------PLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGY

Query:  GFSFQSLDSYFSDHKSNPKHKPPLIQ
        G+S +SL+S  SDH S+  +K  LIQ
Subjt:  GFSFQSLDSYFSDHKSNPKHKPPLIQ

SwissProt top hitse value%identityAlignment
Q42551 SUMO-conjugating enzyme SCE12.4e-0840.4Show/hide
Query:  MSEGIAHGHLTEERKLWRKNLTVWDFVAKPKTMSDAIVILMIWHCIIIGNALADSKSNQQRYRSLSLFHHLTMKSNSNSPWNWNTNNFHTNIFDPKEPP
        M+ GIA G L EERK WRKN     FVAKP+T  D  V LM+WHC I G A  D +           F  LTM  + + P       F    F P   P
Subjt:  MSEGIAHGHLTEERKLWRKNLTVWDFVAKPKTMSDAIVILMIWHCIIIGNALADSKSNQQRYRSLSLFHHLTMKSNSNSPWNWNTNNFHTNIFDPKEPP

Arabidopsis top hitse value%identityAlignment
AT1G27695.1 glycine-rich protein8.2e-0448.48Show/hide
Query:  LVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSFQS
        +VL    G GVG G G+G G   G G GG+ P N L +  G G GCGVGLGLG G G  +G  ++S
Subjt:  LVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSFQS

AT1G27695.2 glycine-rich protein8.2e-0448.48Show/hide
Query:  LVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSFQS
        +VL    G GVG G G+G G   G G GG+ P N L +  G G GCGVGLGLG G G  +G  ++S
Subjt:  LVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSFQS

AT1G66820.1 glycine-rich protein5.9e-1046.97Show/hide
Query:  GGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSFQSLDSYF
        G G G+GCG G+G GL GG+G G     +H ++V G G+GCG+G G G G G G G+SF  +   F
Subjt:  GGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSFQSLDSYF

AT3G57870.1 sumo conjugation enzyme 11.7e-0940.4Show/hide
Query:  MSEGIAHGHLTEERKLWRKNLTVWDFVAKPKTMSDAIVILMIWHCIIIGNALADSKSNQQRYRSLSLFHHLTMKSNSNSPWNWNTNNFHTNIFDPKEPP
        M+ GIA G L EERK WRKN     FVAKP+T  D  V LM+WHC I G A  D +           F  LTM  + + P       F    F P   P
Subjt:  MSEGIAHGHLTEERKLWRKNLTVWDFVAKPKTMSDAIVILMIWHCIIIGNALADSKSNQQRYRSLSLFHHLTMKSNSNSPWNWNTNNFHTNIFDPKEPP

AT4G10330.1 glycine-rich protein9.4e-0843.59Show/hide
Query:  GAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSFQSLDSYFSDHKSNPKHKPPLI
        G GVGCG G G GL+GG+G G   P     L FGLG G G G+G+G G G G G ++    SY++  K +      LI
Subjt:  GAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSFQSLDSYFSDHKSNPKHKPPLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGAAGGTATTGCACATGGTCATCTTACGGAGGAGCGAAAGTTGTGGCGGAAGAATCTGACCGTATGGGATTTTGTAGCGAAACCAAAGACGATGTCAGATGCTAT
TGTGATTTTGATGATTTGGCATTGCATTATTATTGGAAATGCACTAGCTGATTCCAAATCAAACCAACAACGTTACAGATCTCTCTCTCTTTTTCATCATCTTACTATGA
AATCAAACTCCAATTCGCCATGGAACTGGAACACAAATAACTTCCACACCAACATTTTCGACCCAAAGGAACCGCCACCCCTAGTCTTGGAGGCCGGCGGAGGAGCTGGA
GTTGGCTGCGGCCTTGGACTCGGCTTCGGTCTCGTTGGCGGGATTGGCCACGGCGGCGTCTCGCCCTGGAATCACCTTCACCTCGTTTTCGGCCTCGGTCTCGGCTGTGG
CGTTGGCTTAGGACTTGGAATTGGCCAAGGCTTTGGCTATGGCTTCTCCTTCCAATCTCTCGATTCTTATTTCTCTGATCACAAATCCAACCCTAAACATAAGCCTCCTC
TGATTCAGTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCGAAGGTATTGCACATGGTCATCTTACGGAGGAGCGAAAGTTGTGGCGGAAGAATCTGACCGTATGGGATTTTGTAGCGAAACCAAAGACGATGTCAGATGCTAT
TGTGATTTTGATGATTTGGCATTGCATTATTATTGGAAATGCACTAGCTGATTCCAAATCAAACCAACAACGTTACAGATCTCTCTCTCTTTTTCATCATCTTACTATGA
AATCAAACTCCAATTCGCCATGGAACTGGAACACAAATAACTTCCACACCAACATTTTCGACCCAAAGGAACCGCCACCCCTAGTCTTGGAGGCCGGCGGAGGAGCTGGA
GTTGGCTGCGGCCTTGGACTCGGCTTCGGTCTCGTTGGCGGGATTGGCCACGGCGGCGTCTCGCCCTGGAATCACCTTCACCTCGTTTTCGGCCTCGGTCTCGGCTGTGG
CGTTGGCTTAGGACTTGGAATTGGCCAAGGCTTTGGCTATGGCTTCTCCTTCCAATCTCTCGATTCTTATTTCTCTGATCACAAATCCAACCCTAAACATAAGCCTCCTC
TGATTCAGTTTTGA
Protein sequenceShow/hide protein sequence
MSEGIAHGHLTEERKLWRKNLTVWDFVAKPKTMSDAIVILMIWHCIIIGNALADSKSNQQRYRSLSLFHHLTMKSNSNSPWNWNTNNFHTNIFDPKEPPPLVLEAGGGAG
VGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSFQSLDSYFSDHKSNPKHKPPLIQF