; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G08950 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G08950
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionN-(5-amino-5-carboxypentanoyl)-L-cysteinyl-D-valine synthase
Genome locationClcChr04:22633540..22636717
RNA-Seq ExpressionClc04G08950
SyntenyClc04G08950
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047320.1 uncharacterized protein E6C27_scaffold908G001380 [Cucumis melo var. makuwa]1.0e-6751.82Show/hide
Query:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKFVPTLPPRLDDVLLSCVILGE----------------
        MSNL LLRKALGSNFISR A +N  LPLFFRQSP FFSTE EQP  ES  + F D S T GM   PT    LDDVL SCVI  E                
Subjt:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKFVPTLPPRLDDVLLSCVILGE----------------

Query:  -----------------------------------------------AFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEH
                                                       A++HAF  I RKG L+R+ER DRSQW+LLSPYNGKTV LQGIPRNA++EDVE 
Subjt:  -----------------------------------------------AFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEH

Query:  FLSGCDYDGTSINIFIRTSLV----GSLGESSPLERLAATFLLVILSRSHQNGNGAVPFTNTSNACISYKEQRLLSEQPNFDAGSPITSNSYHL------
        FLSGCDYD +SINI+   +L     G L     +  L  +F   IL R+H+NGNGAVPFTNTSNACIS+KEQRLLSEQPNF A SPITSNSYH+      
Subjt:  FLSGCDYDGTSINIFIRTSLV----GSLGESSPLERLAATFLLVILSRSHQNGNGAVPFTNTSNACISYKEQRLLSEQPNFDAGSPITSNSYHL------

Query:  -CW
         CW
Subjt:  -CW

XP_022963065.1 uncharacterized protein LOC111463377 [Cucurbita moschata]4.5e-4455.38Show/hide
Query:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKFVP-------TLPPRLDDVLLSCVI------------
        MS L LLRKAL S+F++ SA AN  LP+FFRQSP FFSTEGEQP  ESPAD FLDTSKT G+ +         TL   + ++L  C +            
Subjt:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKFVP-------TLPPRLDDVLLSCVI------------

Query:  -----------LGEAFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTS
                     +A+++AF +I RKGRLYR+ER DRSQWD+LSPY+GKTVLLQGIPRNA++EDVE FL GCDYD TSIN+F R S
Subjt:  -----------LGEAFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTS

XP_022972751.1 uncharacterized protein LOC111471264 [Cucurbita maxima]1.4e-4556.45Show/hide
Query:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKFVP-------TLPPRLDDVLLSCVI------------
        MS L LLRKALGS+F++ SA AN  LP+FFRQSP FFSTEGEQP  E PADSFLDTSKT G+ +         TL   + ++L  C +            
Subjt:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKFVP-------TLPPRLDDVLLSCVI------------

Query:  -----------LGEAFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTS
                     +A+++AF +I RKGRLYR+ER DRSQWD+LSPYNGKTVLLQGIPRNA++EDVE FL GCDYD TSIN+F R S
Subjt:  -----------LGEAFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTS

XP_023518188.1 uncharacterized protein LOC111781730 [Cucurbita pepo subsp. pepo]2.7e-4455.38Show/hide
Query:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKFVP-------TLPPRLDDVLLSCVI------------
        MS L LLRKAL S+F++ SA AN  LP+FFRQSP FFSTEGEQP +E PAD FLDTSKT G+ +         TL   + ++L  C +            
Subjt:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKFVP-------TLPPRLDDVLLSCVI------------

Query:  -----------LGEAFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTS
                     +A+++AF +I RKGRLYR+ER DRSQWD+LSPYNGKTVLLQGIPRNA++EDVE FL GCDYD TSIN+F R S
Subjt:  -----------LGEAFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTS

XP_023544682.1 uncharacterized protein LOC111804194 [Cucurbita pepo subsp. pepo]2.9e-4355.08Show/hide
Query:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKF------------------VPTLPPRLDDV-------
        M+NL+LLRKALGS+FISRSA  N  LP+FFRQS  FFSTEGEQ   ES ADSFLDTS  TG+ +                  +      LDDV       
Subjt:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKF------------------VPTLPPRLDDV-------

Query:  -----LLSCVILGEAFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTSL
             ++      +A+++AF +I R+GR+YR+ER DRSQWDLLSPYNGKTVLLQGIPRNA ++DVE FLSGCDYD TSIN+F R S+
Subjt:  -----LLSCVILGEAFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTSL

TrEMBL top hitse value%identityAlignment
A0A5A7TVY8 Uncharacterized protein4.8e-6851.82Show/hide
Query:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKFVPTLPPRLDDVLLSCVILGE----------------
        MSNL LLRKALGSNFISR A +N  LPLFFRQSP FFSTE EQP  ES  + F D S T GM   PT    LDDVL SCVI  E                
Subjt:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKFVPTLPPRLDDVLLSCVILGE----------------

Query:  -----------------------------------------------AFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEH
                                                       A++HAF  I RKG L+R+ER DRSQW+LLSPYNGKTV LQGIPRNA++EDVE 
Subjt:  -----------------------------------------------AFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEH

Query:  FLSGCDYDGTSINIFIRTSLV----GSLGESSPLERLAATFLLVILSRSHQNGNGAVPFTNTSNACISYKEQRLLSEQPNFDAGSPITSNSYHL------
        FLSGCDYD +SINI+   +L     G L     +  L  +F   IL R+H+NGNGAVPFTNTSNACIS+KEQRLLSEQPNF A SPITSNSYH+      
Subjt:  FLSGCDYDGTSINIFIRTSLV----GSLGESSPLERLAATFLLVILSRSHQNGNGAVPFTNTSNACISYKEQRLLSEQPNFDAGSPITSNSYHL------

Query:  -CW
         CW
Subjt:  -CW

A0A6J1GDZ1 uncharacterized protein LOC1114531266.0e-4252.94Show/hide
Query:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKF------------------VPTLPPRLDDV-------
        M+NL+LLRKALGS+F SRSA  N  LP+FFRQS  FFSTEGEQ   ES A+SFLDTS+T G+ +                  +      LDDV       
Subjt:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKF------------------VPTLPPRLDDV-------

Query:  -----LLSCVILGEAFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTSL
             ++      +A+++AF +I R+GR+YR+ER DRSQWDLLSPYNGK +LLQGIPRNA ++DVE FLSGCDYD TSIN+F R S+
Subjt:  -----LLSCVILGEAFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTSL

A0A6J1HGY5 uncharacterized protein LOC1114633772.2e-4455.38Show/hide
Query:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKFVP-------TLPPRLDDVLLSCVI------------
        MS L LLRKAL S+F++ SA AN  LP+FFRQSP FFSTEGEQP  ESPAD FLDTSKT G+ +         TL   + ++L  C +            
Subjt:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKFVP-------TLPPRLDDVLLSCVI------------

Query:  -----------LGEAFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTS
                     +A+++AF +I RKGRLYR+ER DRSQWD+LSPY+GKTVLLQGIPRNA++EDVE FL GCDYD TSIN+F R S
Subjt:  -----------LGEAFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTS

A0A6J1I6U3 uncharacterized protein LOC1114712646.8e-4656.45Show/hide
Query:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKFVP-------TLPPRLDDVLLSCVI------------
        MS L LLRKALGS+F++ SA AN  LP+FFRQSP FFSTEGEQP  E PADSFLDTSKT G+ +         TL   + ++L  C +            
Subjt:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKFVP-------TLPPRLDDVLLSCVI------------

Query:  -----------LGEAFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTS
                     +A+++AF +I RKGRLYR+ER DRSQWD+LSPYNGKTVLLQGIPRNA++EDVE FL GCDYD TSIN+F R S
Subjt:  -----------LGEAFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTS

A0A6J1INA4 uncharacterized protein LOC1114779762.5e-4051.87Show/hide
Query:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKF------------------VPTLPPRLDDV-------
        M+NL+LLRKALGS+FIS SA  N  LP+FFRQS  FFS EGEQ   ES A+SFLDT +T G+ +                  +      LDDV       
Subjt:  MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKF------------------VPTLPPRLDDV-------

Query:  -----LLSCVILGEAFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTSL
             ++      +++++AF +I R+GR+YR+ER DRSQWDLLSPYNGKTVLLQGIPRNA ++DVE FLSGCDYD T IN+F R S+
Subjt:  -----LLSCVILGEAFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G02740.1 Ribosomal protein S24e family protein8.9e-1450.77Show/hide
Query:  AFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSI
        A++ A   I +KG+LYR+E+  R+QWD + PY GK V L GIP NA+ +D++ FLSGC Y   SI
Subjt:  AFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSI

AT5G02740.2 Ribosomal protein S24e family protein8.9e-1450.77Show/hide
Query:  AFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSI
        A++ A   I +KG+LYR+E+  R+QWD + PY GK V L GIP NA+ +D++ FLSGC Y   SI
Subjt:  AFEHAFVMIRRKGRLYRMERVDRSQWDLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAACCTCAGTCTGCTCCGTAAGGCACTTGGATCCAACTTCATTTCTCGATCGGCGGCGGCAAATGATCGCCTCCCTCTGTTTTTCCGCCAATCTCCAAGTTTCTT
CTCCACGGAAGGGGAACAACCTCGAGCTGAATCGCCCGCCGATTCTTTTCTCGATACATCAAAAACAACAGGTATGAAGTTCGTTCCTACGTTGCCGCCGCGTCTGGACG
ATGTCCTTCTGTCGTGTGTGATATTGGGAGAGGCCTTTGAGCATGCTTTTGTAATGATTAGAAGAAAAGGTCGCTTGTACAGAATGGAACGGGTTGATCGTTCGCAGTGG
GACCTTCTTTCACCTTACAATGGAAAAACTGTTCTGCTGCAAGGAATACCTCGAAATGCAATGATAGAAGACGTGGAACACTTCTTATCTGGCTGTGACTATGATGGAAC
CTCAATCAATATATTTATCAGGACTTCTTTAGTTGGGAGTTTGGGAGAGTCTAGTCCTCTCGAAAGGCTAGCGGCTACCTTTTTACTTGTCATCCTTTCCAGATCCCATC
AAAATGGCAACGGTGCTGTTCCCTTCACCAACACAAGCAATGCATGCATTTCTTACAAAGAACAGAGGCTTTTGTCTGAACAACCGAATTTCGATGCAGGTTCTCCAATA
ACTAGTAACTCTTATCATCTATGTTGGGCACATTCTTTTGTTGGCTCGTTGGCTGGACGTTTTGTAGTAGTCTTTATTGAGTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAACCTCAGTCTGCTCCGTAAGGCACTTGGATCCAACTTCATTTCTCGATCGGCGGCGGCAAATGATCGCCTCCCTCTGTTTTTCCGCCAATCTCCAAGTTTCTT
CTCCACGGAAGGGGAACAACCTCGAGCTGAATCGCCCGCCGATTCTTTTCTCGATACATCAAAAACAACAGGTATGAAGTTCGTTCCTACGTTGCCGCCGCGTCTGGACG
ATGTCCTTCTGTCGTGTGTGATATTGGGAGAGGCCTTTGAGCATGCTTTTGTAATGATTAGAAGAAAAGGTCGCTTGTACAGAATGGAACGGGTTGATCGTTCGCAGTGG
GACCTTCTTTCACCTTACAATGGAAAAACTGTTCTGCTGCAAGGAATACCTCGAAATGCAATGATAGAAGACGTGGAACACTTCTTATCTGGCTGTGACTATGATGGAAC
CTCAATCAATATATTTATCAGGACTTCTTTAGTTGGGAGTTTGGGAGAGTCTAGTCCTCTCGAAAGGCTAGCGGCTACCTTTTTACTTGTCATCCTTTCCAGATCCCATC
AAAATGGCAACGGTGCTGTTCCCTTCACCAACACAAGCAATGCATGCATTTCTTACAAAGAACAGAGGCTTTTGTCTGAACAACCGAATTTCGATGCAGGTTCTCCAATA
ACTAGTAACTCTTATCATCTATGTTGGGCACATTCTTTTGTTGGCTCGTTGGCTGGACGTTTTGTAGTAGTCTTTATTGAGTTATGACTTGCATCTTCATATCTCAAGCC
TCCAGTAGGTTAGCAATCACATGCACATTCAAAGTGACATGAAATCTTTCTTTCTTTTCACATATTAGCAGTTTGTTTGTGCTAAAGAATCTGCATGTGCAATGAAGGAT
ATTTGGATTTTGAATTTAGGCCATGTTTACTATTCCGTAATCAAAATTGCAG
Protein sequenceShow/hide protein sequence
MSNLSLLRKALGSNFISRSAAANDRLPLFFRQSPSFFSTEGEQPRAESPADSFLDTSKTTGMKFVPTLPPRLDDVLLSCVILGEAFEHAFVMIRRKGRLYRMERVDRSQW
DLLSPYNGKTVLLQGIPRNAMIEDVEHFLSGCDYDGTSINIFIRTSLVGSLGESSPLERLAATFLLVILSRSHQNGNGAVPFTNTSNACISYKEQRLLSEQPNFDAGSPI
TSNSYHLCWAHSFVGSLAGRFVVVFIEL