; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C11G215830 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C11G215830
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionFibroin heavy chain
Genome locationCla97Chr11:12732317..12737219
RNA-Seq ExpressionCla97C11G215830
SyntenyCla97C11G215830
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051071.1 fibroin heavy chain [Cucumis melo var. makuwa]5.3e-3774.6Show/hide
Query:  MKS-NSNSPWNWNTNNFHTNIF-------DPKEPP--PLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGF
        MKS +S+SPW    NNFHT IF       + KE P   LVLEAGGGAGVGCGLG GFGLVGGIGHGG SPWNHLHLVFGLG GCGVGLGLGIGQGFGYGF
Subjt:  MKS-NSNSPWNWNTNNFHTNIF-------DPKEPP--PLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGF

Query:  SFQSLGSYFSDHKSNPKHK-PPLIQF
        SF+SL SYFS   S+PKHK P LIQF
Subjt:  SFQSLGSYFSDHKSNPKHK-PPLIQF

KAF3972995.1 hypothetical protein CMV_003541 [Castanea mollissima]1.1e-2149.62Show/hide
Query:  TMKSNSNSPWN-WNTN--------------NFHTNIF---DPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLG
        ++K++S+S WN WN N              +F  N F   +PK+   +VL  GGG G+GCG+GLGFG VGGIG+GG  PWNHL LV G+G+GCGVG+G G
Subjt:  TMKSNSNSPWN-WNTN--------------NFHTNIF---DPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLG

Query:  IGQGFGYGFSFQSLGSYFSDHKSNPKHKPPLIQ
         GQG GYGFS + L SYFS   S+   K  +I+
Subjt:  IGQGFGYGFSFQSLGSYFSDHKSNPKHKPPLIQ

KAG6578355.1 hypothetical protein SDJN03_22803, partial [Cucurbita argyrosperma subsp. sororia]4.0e-4575Show/hide
Query:  MKSNSNSPWNWN---TNNFHT----------NIFD-----PKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGI
        MKS+SNSPW WN   TNNFHT          NIFD      K+PPPLVL+AGGGAGVGCG+GLGFGLVGGIGHGG SPWNHLHLVFGLGLGCGVGLGLGI
Subjt:  MKSNSNSPWNWN---TNNFHT----------NIFD-----PKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGI

Query:  GQGFGYGFSFQSLGSYFSDHKSNPKHKPPLIQ
        G+G GYG SF SL SYFSDHK NP+ KPPLIQ
Subjt:  GQGFGYGFSFQSLGSYFSDHKSNPKHKPPLIQ

KGN63146.1 hypothetical protein Csa_022245 [Cucumis sativus]1.9e-3474.19Show/hide
Query:  MKS-NSNSPWNWNTNNFHTNIF-------DPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSF
        MKS +SNSPW    NNF T IF       + KE P LVLEAGGGAGVGCGLG+GFGLVGGIGH G SPWNHLHLVFGLG GCGVGLGLGIGQGFGYG SF
Subjt:  MKS-NSNSPWNWNTNNFHTNIF-------DPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSF

Query:  QSLGSYFSDHKSNPKHK-PPLIQF
        QS+ SYFS   SNPK K P LIQF
Subjt:  QSLGSYFSDHKSNPKHK-PPLIQF

XP_023890268.1 protein TRIGALACTOSYLDIACYLGLYCEROL 5, chloroplastic [Quercus suber]1.1e-2150Show/hide
Query:  MKSNSNSPWN-WNTNN-------------------FHTNIFDPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGL
        +K+NS+S WN WN N+                    H N   PK+   +VL  GGG G+GCG+GLGFGLVGGIG+GG  PWNHL LV G+G+GCGVG+G 
Subjt:  MKSNSNSPWN-WNTNN-------------------FHTNIFDPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGL

Query:  GIGQGFGYGFSFQSLGSYFSDHKSNPKHKPPLIQ
        G GQG GYGFS +SL SY S   S+   K  +I+
Subjt:  GIGQGFGYGFSFQSLGSYFSDHKSNPKHKPPLIQ

TrEMBL top hitse value%identityAlignment
A0A0A0LSZ2 Uncharacterized protein9.1e-3574.19Show/hide
Query:  MKS-NSNSPWNWNTNNFHTNIF-------DPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSF
        MKS +SNSPW    NNF T IF       + KE P LVLEAGGGAGVGCGLG+GFGLVGGIGH G SPWNHLHLVFGLG GCGVGLGLGIGQGFGYG SF
Subjt:  MKS-NSNSPWNWNTNNFHTNIF-------DPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSF

Query:  QSLGSYFSDHKSNPKHK-PPLIQF
        QS+ SYFS   SNPK K P LIQF
Subjt:  QSLGSYFSDHKSNPKHK-PPLIQF

A0A2N9FV55 Uncharacterized protein1.2e-2150.78Show/hide
Query:  MKSNSNSPWN-WNTNNFHTNIFDPKEPP-------------PLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGF
        +K++SNS WN WN NN +    + K+ P              +VL  GGG G+GCG+G+GFGLVGGIG+GG  PWNHL LVFG+G+GCGVG+G G GQG 
Subjt:  MKSNSNSPWN-WNTNNFHTNIFDPKEPP-------------PLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGF

Query:  GYGFSFQSLGSYFSDHKSNPKHKPPLIQ
        G GFS  SL SY S   S+   K  +I+
Subjt:  GYGFSFQSLGSYFSDHKSNPKHKPPLIQ

A0A5A7UC28 Fibroin heavy chain2.6e-3774.6Show/hide
Query:  MKS-NSNSPWNWNTNNFHTNIF-------DPKEPP--PLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGF
        MKS +S+SPW    NNFHT IF       + KE P   LVLEAGGGAGVGCGLG GFGLVGGIGHGG SPWNHLHLVFGLG GCGVGLGLGIGQGFGYGF
Subjt:  MKS-NSNSPWNWNTNNFHTNIF-------DPKEPP--PLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGF

Query:  SFQSLGSYFSDHKSNPKHK-PPLIQF
        SF+SL SYFS   S+PKHK P LIQF
Subjt:  SFQSLGSYFSDHKSNPKHK-PPLIQF

A0A7N2KWH8 Uncharacterized protein8.8e-2249.62Show/hide
Query:  MKSNSNSPWN-WNTN-------------NFHTNIF---DPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIG
        +K++S+S WN WN N             +F  + F   +PK+   +VL  GGG G+GCG+GLGFGLVGG+G+GG  PWNHL LV G+G+GCGVG+G G G
Subjt:  MKSNSNSPWN-WNTN-------------NFHTNIF---DPKEPPPLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIG

Query:  QGFGYGFSFQSLGSYFSDHKSNPKHKPPLIQ
        QG GYGFS +SL SY S   S+   K  +I+
Subjt:  QGFGYGFSFQSLGSYFSDHKSNPKHKPPLIQ

W9S8Y9 Uncharacterized protein4.4e-2152.38Show/hide
Query:  KSNSNSPWNWNTNNFHTNIFDPKEPP-------------PLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGY
        K +SN+ WNWN +  +      K  P               VL  G GAG+GCG+GLGFGLVGG G GG   WNHL LV G+GLGCGVGLG+G GQGFGY
Subjt:  KSNSNSPWNWNTNNFHTNIFDPKEPP-------------PLVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGY

Query:  GFSFQSLGSYFSDHKSNPKHKPPLIQ
        G+S +SL S  SDH S+  +K  LIQ
Subjt:  GFSFQSLGSYFSDHKSNPKHKPPLIQ

SwissProt top hitse value%identityAlignment
Q42551 SUMO-conjugating enzyme SCE12.4e-0840.4Show/hide
Query:  MSEGIAHGHLTEERKLWRKNLTVWDFVAKPKTMSDAIVILMIWHCIIIGNALADSKSNQQRYRSLSLFHHLTMKSNSNSPWNWNTNNFHTNIFDPKEPP
        M+ GIA G L EERK WRKN     FVAKP+T  D  V LM+WHC I G A  D +           F  LTM  + + P       F    F P   P
Subjt:  MSEGIAHGHLTEERKLWRKNLTVWDFVAKPKTMSDAIVILMIWHCIIIGNALADSKSNQQRYRSLSLFHHLTMKSNSNSPWNWNTNNFHTNIFDPKEPP

Arabidopsis top hitse value%identityAlignment
AT1G66820.1 glycine-rich protein1.3e-0946.97Show/hide
Query:  GGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSFQSLGSYF
        G G G+GCG G+G GL GG+G G     +H ++V G G+GCG+G G G G G G G+SF  +   F
Subjt:  GGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSFQSLGSYF

AT3G57870.1 sumo conjugation enzyme 11.7e-0940.4Show/hide
Query:  MSEGIAHGHLTEERKLWRKNLTVWDFVAKPKTMSDAIVILMIWHCIIIGNALADSKSNQQRYRSLSLFHHLTMKSNSNSPWNWNTNNFHTNIFDPKEPP
        M+ GIA G L EERK WRKN     FVAKP+T  D  V LM+WHC I G A  D +           F  LTM  + + P       F    F P   P
Subjt:  MSEGIAHGHLTEERKLWRKNLTVWDFVAKPKTMSDAIVILMIWHCIIIGNALADSKSNQQRYRSLSLFHHLTMKSNSNSPWNWNTNNFHTNIFDPKEPP

AT4G10330.1 glycine-rich protein1.6e-0743.59Show/hide
Query:  GAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSFQSLGSYFSDHKSNPKHKPPLI
        G GVGCG G G GL+GG+G G   P     L FGLG G G G+G+G G G G G ++    SY++  K +      LI
Subjt:  GAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSFQSLGSYFSDHKSNPKHKPPLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGAAGGTATTGCACATGGTCATCTTACGGAGGAGCGAAAGTTGTGGCGGAAGAATCTGACCGTATGGGATTTTGTAGCGAAACCAAAGACGATGTCAGAT
GCTATTGTGATTTTGATGATTTGGCATTGCATTATTATTGGAAATGCACTAGCTGATTCCAAATCAAACCAACAACGTTACAGATCTCTCTCTCTTTTTCATCAT
CTTACTATGAAATCAAACTCCAATTCGCCATGGAACTGGAACACAAATAACTTCCACACCAACATTTTCGACCCAAAGGAACCGCCACCCCTAGTCTTGGAGGCC
GGCGGAGGAGCTGGAGTTGGCTGCGGCCTTGGACTCGGCTTCGGTCTCGTTGGCGGGATTGGCCACGGCGGCGTCTCGCCCTGGAATCACCTTCACCTCGTTTTC
GGCCTCGGTCTCGGCTGTGGCGTTGGCTTAGGACTTGGAATTGGCCAAGGCTTTGGCTATGGCTTCTCCTTCCAATCTCTTGGTTCTTATTTCTCTGATCACAAA
TCCAACCCTAAACATAAGCCTCCTCTGATTCAGTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCGAAGGTATTGCACATGGTCATCTTACGGAGGAGCGAAAGTTGTGGCGGAAGAATCTGACCGTATGGGATTTTGTAGCGAAACCAAAGACGATGTCAGAT
GCTATTGTGATTTTGATGATTTGGCATTGCATTATTATTGGAAATGCACTAGCTGATTCCAAATCAAACCAACAACGTTACAGATCTCTCTCTCTTTTTCATCAT
CTTACTATGAAATCAAACTCCAATTCGCCATGGAACTGGAACACAAATAACTTCCACACCAACATTTTCGACCCAAAGGAACCGCCACCCCTAGTCTTGGAGGCC
GGCGGAGGAGCTGGAGTTGGCTGCGGCCTTGGACTCGGCTTCGGTCTCGTTGGCGGGATTGGCCACGGCGGCGTCTCGCCCTGGAATCACCTTCACCTCGTTTTC
GGCCTCGGTCTCGGCTGTGGCGTTGGCTTAGGACTTGGAATTGGCCAAGGCTTTGGCTATGGCTTCTCCTTCCAATCTCTTGGTTCTTATTTCTCTGATCACAAA
TCCAACCCTAAACATAAGCCTCCTCTGATTCAGTTTTGA
Protein sequenceShow/hide protein sequence
MSEGIAHGHLTEERKLWRKNLTVWDFVAKPKTMSDAIVILMIWHCIIIGNALADSKSNQQRYRSLSLFHHLTMKSNSNSPWNWNTNNFHTNIFDPKEPPPLVLEA
GGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGQGFGYGFSFQSLGSYFSDHKSNPKHKPPLIQF