; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC11G211560 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC11G211560
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionPhotosystem I P700 chlorophyll a apoprotein A1
Genome locationCmU531Chr11:7822688..7826672
RNA-Seq ExpressionCmUC11G211560
SyntenyCmUC11G211560
Gene Ontology termsGO:0009853 - photorespiration (biological process)
GO:0018298 - protein-chromophore linkage (biological process)
GO:0019253 - reductive pentose-phosphate cycle (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0009507 - chloroplast (cellular component)
GO:0009522 - photosystem I (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016984 - ribulose-bisphosphate carboxylase activity (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
GO:0000287 - magnesium ion binding (molecular function)
GO:0016168 - chlorophyll binding (molecular function)
GO:0004497 - monooxygenase activity (molecular function)
InterPro domainsIPR036408 - Photosystem I PsaA/PsaB superfamily
IPR036376 - Ribulose bisphosphate carboxylase, large subunit, C-terminal domain superfamily
IPR006243 - Photosystem I PsaA
IPR001280 - Photosystem I PsaA/PsaB
IPR000685 - Ribulose bisphosphate carboxylase, large subunit, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8499239.1 hypothetical protein CXB51_005773 [Gossypium anomalum]1.5e-2367.31Show/hide
Query:  HASALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDLALRYGRRSNLNLRQW------------ILCNP--RRFRSGIEPSITTS
        H  ALTEIFGDDSVLQFGGGTLGH WGNAPGAVANRVALEACVQARNEGRDLA R G        +W            IL  P    F S IEPSITTS
Subjt:  HASALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDLALRYGRRSNLNLRQW------------ILCNP--RRFRSGIEPSITTS

Query:  STHP
        STHP
Subjt:  STHP

QHO47946.1 Ribulose bisphosphate carboxylase large chain [Arachis hypogaea]9.7e-2366.3Show/hide
Query:  HASALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDL--------------------ALRYGRRSNLNLRQWILCNP
        H  ALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDL                    A    +  NLN +QWILCNP
Subjt:  HASALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDL--------------------ALRYGRRSNLNLRQWILCNP

RYR79961.1 hypothetical protein Ahy_A01g004755 [Arachis hypogaea]9.7e-2366.3Show/hide
Query:  HASALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDL--------------------ALRYGRRSNLNLRQWILCNP
        H  ALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDL                    A    +  NLN +QWILCNP
Subjt:  HASALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDL--------------------ALRYGRRSNLNLRQWILCNP

THG10007.1 hypothetical protein TEA_024123 [Camellia sinensis var. sinensis]2.2e-2258.97Show/hide
Query:  HASALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDLALRYGRRSNLNLRQW-----ILCNPRRFRSGIEPSITTSSTHPGQESS
        H  ALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDLA R G        +W       C   +      P++ T     GQE+ 
Subjt:  HASALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDLALRYGRRSNLNLRQW-----ILCNPRRFRSGIEPSITTSSTHPGQESS

Query:  MEKRWLNSMLSNGEVEY
        MEK   NS+LS  E+E+
Subjt:  MEKRWLNSMLSNGEVEY

YP_004927583.1 orf170 [Brassica carinata]6.3e-2278.38Show/hide
Query:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLADAFSSYPLRTWLPSVYR
        I R  GITSELQLYCTAIGALV AALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA        RTW+P + R
Subjt:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLADAFSSYPLRTWLPSVYR

TrEMBL top hitse value%identityAlignment
A0A1D6WKI3 Orf1703.0e-2278.38Show/hide
Query:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLADAFSSYPLRTWLPSVYR
        I R  GITSELQLYCTAIGALV AALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA        RTW+P + R
Subjt:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLADAFSSYPLRTWLPSVYR

A0A445EX07 Ribulose bisphosphate carboxylase large chain4.7e-2366.3Show/hide
Query:  HASALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDL--------------------ALRYGRRSNLNLRQWILCNP
        H  ALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDL                    A    +  NLN +QWILCNP
Subjt:  HASALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDL--------------------ALRYGRRSNLNLRQWILCNP

A0A4V3WMU6 Photosystem I assembly protein Ycf41.0e-2258.97Show/hide
Query:  HASALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDLALRYGRRSNLNLRQW-----ILCNPRRFRSGIEPSITTSSTHPGQESS
        H  ALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDLA R G        +W       C   +      P++ T     GQE+ 
Subjt:  HASALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDLALRYGRRSNLNLRQW-----ILCNPRRFRSGIEPSITTSSTHPGQESS

Query:  MEKRWLNSMLSNGEVEY
        MEK   NS+LS  E+E+
Subjt:  MEKRWLNSMLSNGEVEY

G4XYP4 Orf1703.0e-2278.38Show/hide
Query:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLADAFSSYPLRTWLPSVYR
        I R  GITSELQLYCTAIGALV AALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA        RTW+P + R
Subjt:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLADAFSSYPLRTWLPSVYR

R9QTB6 Uncharacterized protein3.0e-2278.38Show/hide
Query:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLADAFSSYPLRTWLPSVYR
        I R  GITSELQLYCTAIGALV AALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA        RTW+P + R
Subjt:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLADAFSSYPLRTWLPSVYR

SwissProt top hitse value%identityAlignment
A4QJB5 Photosystem I P700 chlorophyll a apoprotein A12.0e-2392.98Show/hide
Query:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA
        I R  GITSELQLYCTAIGALV AALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA
Subjt:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA

A4QJJ9 Photosystem I P700 chlorophyll a apoprotein A12.0e-2392.98Show/hide
Query:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA
        I R  GITSELQLYCTAIGALV AALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA
Subjt:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA

A4QK18 Photosystem I P700 chlorophyll a apoprotein A12.0e-2392.98Show/hide
Query:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA
        I R  GITSELQLYCTAIGALV AALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA
Subjt:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA

Q3V534 Photosystem I P700 chlorophyll a apoprotein A12.0e-2392.98Show/hide
Query:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA
        I R  GITSELQLYCTAIGALV AALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA
Subjt:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA

Q70Y03 Photosystem I P700 chlorophyll a apoprotein A12.0e-2392.98Show/hide
Query:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA
        I R  GITSELQLYCTAIGALV AALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA
Subjt:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA

Arabidopsis top hitse value%identityAlignment
ATCG00340.1 Photosystem I, PsaA/PsaB protein6.7e-0641.07Show/hide
Query:  GITSELQLYCTAIGALVIAALMLFAGWFHYH-KAAPKLAWFQDVESMLNHHLADAF
        G+ +   LY  A+  L ++AL L  GW H   K  P+++WF++ ES LNHHL+  F
Subjt:  GITSELQLYCTAIGALVIAALMLFAGWFHYH-KAAPKLAWFQDVESMLNHHLADAF

ATCG00350.1 Photosystem I, PsaA/PsaB protein1.4e-2492.98Show/hide
Query:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA
        I R  GITSELQLYCTAIGALV AALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA
Subjt:  IQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLA

ATCG00490.1 ribulose-bisphosphate carboxylases4.2e-2494.44Show/hide
Query:  HASALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDLAL
        H  ALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDLA+
Subjt:  HASALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDLAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACTATTTGCTATAGTTGAATTATATATACACATGAGCAACTTCACTTTCGGTGTTCGGCGCTCCCTTCGAACTGATATCCAAAGATTCCCTGGAATAACT
AGTGAATTACAACTCTATTGTACCGCAATTGGTGCATTAGTCATTGCAGCCTTAATGCTTTTTGCTGGTTGGTTTCATTATCACAAGGCTGCTCCTAAATTGGCT
TGGTTCCAAGATGTCGAATCCATGTTGAATCACCATTTAGCGGATGCTTTCAGCAGTTATCCGCTCCGCACTTGGCTACCCAGCGTTTACCGTGGGCACGATAAC
TGGTACACCAGAGGTTATCGAGATTCTCGTTCCACTCGAAAGATTCTTGTTATTGCTTCAACCCATATTCCCCAAAAAGCCAGCCCATTCGACGAAGCAATCTTG
GACCCTCTTGGCTGCGCTGTGCTTGGATGCACTTGTATTCCACACGCCAGCGCTCTAACCGAGATTTTTGGAGATGATTCTGTACTACAATTCGGTGGAGGAACT
TTGGGGCACCCTTGGGGTAATGCACCTGGTGCCGTAGCTAACCGAGTAGCTCTAGAAGCATGTGTACAAGCTCGTAATGAGGGACGTGATCTTGCTCTGAGGTAT
GGAAGGAGATCAAATTTGAATTTGAGGCAATGGATACTTTGTAATCCACGTCGTTTCAGATCCGGGATCGAGCCAAGTATCACAACTTCTTCTACCCATCCTGGG
CAAGAAAGCTCTATGGAAAAAAGATGGTTAAATTCGATGTTGTCTAACGGCGAGGTAGAATACAGCTTTGTTCCAATCGAGTCTAAGGCAAAGCAAAAGTGGTTC
GTATACCTATTTTTTGACCTACTTGACGAAAGGGAACGCACTTTGTTTCGTTTCCCAGTTCAAATCCGGGTGTCGCCTCATCAACAAACAAAAGAATCGAAATAC
CTTCTTCTGTTTTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCACTATTTGCTATAGTTGAATTATATATACACATGAGCAACTTCACTTTCGGTGTTCGGCGCTCCCTTCGAACTGATATCCAAAGATTCCCTGGAATAACT
AGTGAATTACAACTCTATTGTACCGCAATTGGTGCATTAGTCATTGCAGCCTTAATGCTTTTTGCTGGTTGGTTTCATTATCACAAGGCTGCTCCTAAATTGGCT
TGGTTCCAAGATGTCGAATCCATGTTGAATCACCATTTAGCGGATGCTTTCAGCAGTTATCCGCTCCGCACTTGGCTACCCAGCGTTTACCGTGGGCACGATAAC
TGGTACACCAGAGGTTATCGAGATTCTCGTTCCACTCGAAAGATTCTTGTTATTGCTTCAACCCATATTCCCCAAAAAGCCAGCCCATTCGACGAAGCAATCTTG
GACCCTCTTGGCTGCGCTGTGCTTGGATGCACTTGTATTCCACACGCCAGCGCTCTAACCGAGATTTTTGGAGATGATTCTGTACTACAATTCGGTGGAGGAACT
TTGGGGCACCCTTGGGGTAATGCACCTGGTGCCGTAGCTAACCGAGTAGCTCTAGAAGCATGTGTACAAGCTCGTAATGAGGGACGTGATCTTGCTCTGAGGTAT
GGAAGGAGATCAAATTTGAATTTGAGGCAATGGATACTTTGTAATCCACGTCGTTTCAGATCCGGGATCGAGCCAAGTATCACAACTTCTTCTACCCATCCTGGG
CAAGAAAGCTCTATGGAAAAAAGATGGTTAAATTCGATGTTGTCTAACGGCGAGGTAGAATACAGCTTTGTTCCAATCGAGTCTAAGGCAAAGCAAAAGTGGTTC
GTATACCTATTTTTTGACCTACTTGACGAAAGGGAACGCACTTTGTTTCGTTTCCCAGTTCAAATCCGGGTGTCGCCTCATCAACAAACAAAAGAATCGAAATAC
CTTCTTCTGTTTTGCTGA
Protein sequenceShow/hide protein sequence
MPLFAIVELYIHMSNFTFGVRRSLRTDIQRFPGITSELQLYCTAIGALVIAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLADAFSSYPLRTWLPSVYRGHDN
WYTRGYRDSRSTRKILVIASTHIPQKASPFDEAILDPLGCAVLGCTCIPHASALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVQARNEGRDLALRY
GRRSNLNLRQWILCNPRRFRSGIEPSITTSSTHPGQESSMEKRWLNSMLSNGEVEYSFVPIESKAKQKWFVYLFFDLLDERERTLFRFPVQIRVSPHQQTKESKY
LLLFC