; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc00G02530 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc00G02530
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPhotosystem I assembly protein ycf3
Genome locationClcCtg023:34117..34851
RNA-Seq ExpressionClc00G02530
SyntenyClc00G02530
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0015979 - photosynthesis (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
GO:0015935 - small ribosomal subunit (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0019843 - rRNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ERN02875.1 hypothetical protein AMTR_s00334p00015220 [Amborella trichopoda]4.6e-2894.03Show/hide
Query:  ETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
        ETLKYGSKGRNL AYSDRGEQAIRQGDSEIAEAWF+QAAEYWKQA+ALTPGNYIEAQNWLKITRRF+
Subjt:  ETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK

KAF3433879.1 hypothetical protein FNV43_RR24982 [Rhamnella rubrinervis]5.0e-2792.54Show/hide
Query:  ETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
        ETL YGSKGRNL AYSDRGEQAIRQGDSEIAEAWF+QAAEYWKQAIALTPGNYIEAQNWLKIT+RF+
Subjt:  ETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK

KAG6573954.1 Photosystem I assembly protein Ycf3, partial [Cucurbita argyrosperma subsp. sororia]1.0e-2783.54Show/hide
Query:  IVECLCFI-LLKETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
        I + LC   ++KETLKYGSKGRNL AYSDRGEQAIRQGDSEIAE WFNQAAEYWKQAIAL PGN+IEA NWLKITRRFK
Subjt:  IVECLCFI-LLKETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK

KAG6735386.1 hypothetical protein POTOM_062012 [Populus tomentosa]9.2e-2992.86Show/hide
Query:  LLKETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
        +L+ETLKYGSKGRNL AYSDRGEQAIRQGDSEIAEAWF+QAAEYWKQAIALTPGNYIEAQNWLKITRRF+
Subjt:  LLKETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK

TKS04885.1 ribosomal protein S4 [Populus alba]9.2e-2992.86Show/hide
Query:  LLKETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
        +L+ETLKYGSKGRNL AYSDRGEQAIRQGDSEIAEAWF+QAAEYWKQAIALTPGNYIEAQNWLKITRRF+
Subjt:  LLKETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK

TrEMBL top hitse value%identityAlignment
A0A4U5Q996 30S ribosomal protein S4, chloroplastic4.5e-2992.86Show/hide
Query:  LLKETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
        +L+ETLKYGSKGRNL AYSDRGEQAIRQGDSEIAEAWF+QAAEYWKQAIALTPGNYIEAQNWLKITRRF+
Subjt:  LLKETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK

A0A7J6HQR9 Uncharacterized protein2.2e-2894.03Show/hide
Query:  ETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
        ETLKYGSKGRNL AYSDRGEQAIRQGDSEIAEAWF+QAAEYWKQA+ALTPGNYIEAQNWLKITRRF+
Subjt:  ETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK

A0A7N2N7U5 Uncharacterized protein4.9e-2892.54Show/hide
Query:  ETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
        ETLKYGSKGRN PAYSDRGEQAIRQG+SEIAEAWF+QAAEYWKQAI LTPGNYIEAQNWLKITRRF+
Subjt:  ETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK

G7KJV5 Photosystem I assembly protein ycf37.1e-2789.71Show/hide
Query:  KETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
        +ETLKYGSKGRNL  YSDRGEQAIRQGDSEIAE+WF+QAAEYWKQAIALTPGNYIEAQNWLKIT RF+
Subjt:  KETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK

W1P7C5 Uncharacterized protein2.2e-2894.03Show/hide
Query:  ETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
        ETLKYGSKGRNL AYSDRGEQAIRQGDSEIAEAWF+QAAEYWKQA+ALTPGNYIEAQNWLKITRRF+
Subjt:  ETLKYGSKGRNLPAYSDRGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK

SwissProt top hitse value%identityAlignment
A0ZZ36 Photosystem I assembly protein Ycf31.5e-2196Show/hide
Query:  RGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
        RGEQAIRQGDSEIAEAWF+QAAEYWKQAIALTPGNYIEAQNWLKITRRF+
Subjt:  RGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK

Q2L906 Photosystem I assembly protein Ycf31.5e-2196Show/hide
Query:  RGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
        RGEQAIRQGDSEIAEAWF+QAAEYWKQAIALTPGNYIEAQNWLKITRRF+
Subjt:  RGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK

Q49KZ7 Photosystem I assembly protein Ycf31.5e-2196Show/hide
Query:  RGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
        RGEQAIRQGDSEIAEAWF+QAAEYWKQAIALTPGNYIEAQNWLKITRRF+
Subjt:  RGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK

Q4VZH4 Photosystem I assembly protein Ycf31.3e-22100Show/hide
Query:  RGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
        RGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
Subjt:  RGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK

Q9MTP0 Photosystem I assembly protein Ycf31.5e-2196Show/hide
Query:  RGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
        RGEQAIRQGDSEIAEAWF+QAAEYWKQAIALTPGNYIEAQNWLKITRRF+
Subjt:  RGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK

Arabidopsis top hitse value%identityAlignment
ATCG00360.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-2088Show/hide
Query:  RGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK
        RGEQAI+QGDSE+AEAWF QAAEYWKQAI LTPGNYIEAQNWL ITRRF+
Subjt:  RGEQAIRQGDSEIAEAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTAGATACTTTATTCTATGGATAAAGGATCTAATTGATAGAAGAAGCACCGGTGGATCCCCTACGATATGGAGCAGCGGTGTAGCATCAGATCCCAAAGACAGTAA
GTCTTTTCTTTCTTATCAAGGAAAGTCTTTTTCAAGGATTCTATATAAATTTCTAGATGAAACCGAGATAGTTATCTTTCAGAAAATTGTAACGATAGTGGAATGCCTAT
GCTTTATTCTTCTGAAGGAAACTCTCAAGTACGGTTCTAAGGGAAGGAATTTACCTGCCTATTCCGACCGGGGAGAACAGGCCATTCGACAGGGAGATTCTGAAATTGCG
GAGGCTTGGTTCAATCAAGCCGCTGAGTATTGGAAACAAGCCATAGCACTTACTCCCGGTAATTATATTGAAGCCCAAAATTGGTTGAAGATCACAAGGCGTTTCAAATA
A
mRNA sequenceShow/hide mRNA sequence
ATGCCTAGATACTTTATTCTATGGATAAAGGATCTAATTGATAGAAGAAGCACCGGTGGATCCCCTACGATATGGAGCAGCGGTGTAGCATCAGATCCCAAAGACAGTAA
GTCTTTTCTTTCTTATCAAGGAAAGTCTTTTTCAAGGATTCTATATAAATTTCTAGATGAAACCGAGATAGTTATCTTTCAGAAAATTGTAACGATAGTGGAATGCCTAT
GCTTTATTCTTCTGAAGGAAACTCTCAAGTACGGTTCTAAGGGAAGGAATTTACCTGCCTATTCCGACCGGGGAGAACAGGCCATTCGACAGGGAGATTCTGAAATTGCG
GAGGCTTGGTTCAATCAAGCCGCTGAGTATTGGAAACAAGCCATAGCACTTACTCCCGGTAATTATATTGAAGCCCAAAATTGGTTGAAGATCACAAGGCGTTTCAAATA
A
Protein sequenceShow/hide protein sequence
MPRYFILWIKDLIDRRSTGGSPTIWSSGVASDPKDSKSFLSYQGKSFSRILYKFLDETEIVIFQKIVTIVECLCFILLKETLKYGSKGRNLPAYSDRGEQAIRQGDSEIA
EAWFNQAAEYWKQAIALTPGNYIEAQNWLKITRRFK