; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC04G063530 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC04G063530
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCicolChr04:19322989..19358485
RNA-Seq ExpressionCcUC04G063530
SyntenyCcUC04G063530
Gene Ontology termsGO:0009853 - photorespiration (biological process)
GO:0015986 - ATP synthesis coupled proton transport (biological process)
GO:0019253 - reductive pentose-phosphate cycle (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0009579 - thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0045261 - proton-transporting ATP synthase complex, catalytic core F(1) (cellular component)
GO:0000287 - magnesium ion binding (molecular function)
GO:0004497 - monooxygenase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0016984 - ribulose-bisphosphate carboxylase activity (molecular function)
GO:0046933 - proton-transporting ATP synthase activity, rotational mechanism (molecular function)
InterPro domainsIPR001280 - Photosystem I PsaA/PsaB


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAD4981938.1 hypothetical protein E3N88_18609 [Mikania micrantha]4.5e-1172.55Show/hide
Query:  RGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYA
        + RS PSLR CTHAS+AT LAT PGA PQGCP++PP NC+TESSPKIS  A
Subjt:  RGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYA

KAE8652954.1 hypothetical protein Csa_017771 [Cucumis sativus]5.9e-1130.81Show/hide
Query:  SSTSEKSSEKALMNSPFSLNPVNQISITKLTENNYLAWRFQVLNVIQGHRLEEHIDEDSKIPEKLL--------------------NRRPDV--------
        SST    +  +  N    +NP + +++ KLT+NNYL W+ Q+LN I GH LE HI  DSK PEK+                       R D+        
Subjt:  SSTSEKSSEKALMNSPFSLNPVNQISITKLTENNYLAWRFQVLNVIQGHRLEEHIDEDSKIPEKLL--------------------NRRPDV--------

Query:  --------------------------CETTKELWEALSKTHASQNTAKIMQYKTQVQTLKKGDMKMNGIIWLAKVKRFLNRIIDY
                                  C T +E+W  L +T++S NTAKIMQ K Q+Q LKKG+  +    + AKVK  ++ + ++
Subjt:  --------------------------CETTKELWEALSKTHASQNTAKIMQYKTQVQTLKKGDMKMNGIIWLAKVKRFLNRIIDY

KAF6994832.1 hypothetical protein CFC21_011433 [Triticum aestivum]5.5e-1768.75Show/hide
Query:  SILRPFHLYQ-TASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYA
        S L  FH  Q  ASS LHL A+ IISLPSR RS PSLR CT  SKAT LA  PGA PQGCP++PP NCNTESSPKIS  A
Subjt:  SILRPFHLYQ-TASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYA

RDY14675.1 hypothetical protein CR513_00201, partial [Mucuna pruriens]1.1e-1770Show/hide
Query:  SILRPFHLYQ-TASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYA
        S L  FH  Q  A+S LHL ASRIISLPSR RS PS R CTHAS+AT LATAPGA PQGCP++PP N +TESSPKIS  A
Subjt:  SILRPFHLYQ-TASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYA

RYR42398.1 hypothetical protein Ahy_A08g038871 [Arachis hypogaea]3.2e-1759.78Show/hide
Query:  IIQYKMIPVYPLSILRPFHLYQ-TASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYA
        ++  +++ +   S L  FH  Q  A+S LHLLASR+ISLPSR RS PS R CTHAS+AT LATAP A PQGCP++PP N +TESSPKIS  A
Subjt:  IIQYKMIPVYPLSILRPFHLYQ-TASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYA

TrEMBL top hitse value%identityAlignment
A0A371II75 Uncharacterized protein (Fragment)5.4e-1870Show/hide
Query:  SILRPFHLYQ-TASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYA
        S L  FH  Q  A+S LHL ASRIISLPSR RS PS R CTHAS+AT LATAPGA PQGCP++PP N +TESSPKIS  A
Subjt:  SILRPFHLYQ-TASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYA

A0A445BUW6 Proton_antipo_M domain-containing protein1.6e-1759.78Show/hide
Query:  IIQYKMIPVYPLSILRPFHLYQ-TASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYA
        ++  +++ +   S L  FH  Q  A+S LHLLASR+ISLPSR RS PS R CTHAS+AT LATAP A PQGCP++PP N +TESSPKIS  A
Subjt:  IIQYKMIPVYPLSILRPFHLYQ-TASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYA

T1L6T2 Uncharacterized protein4.0e-2176.25Show/hide
Query:  SILRPFHLYQ-TASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYA
        S L  FH  Q  ASS LHLLASRIISLPSR RS PSLR CTHASKAT LATAPGA PQGCP++PP NC+TESSPKIS  A
Subjt:  SILRPFHLYQ-TASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYA

T1M4H7 Uncharacterized protein2.4e-1873.68Show/hide
Query:  SILRPFHLYQ-TASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKI
        S L  FH  Q  ASS LHL A+RIISLPSR RS PSLR CT ASKAT LA APGA PQGCP++PP NCNTESSPKI
Subjt:  SILRPFHLYQ-TASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKI

T1M9C2 Uncharacterized protein3.7e-1972.5Show/hide
Query:  SILRPFHLYQ-TASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYA
        S L  FH  Q  ASS LHL A+RIISLPSR RS PSLR CT ASKAT LA APGA PQGCP++PP NCNTESSPKIS  A
Subjt:  SILRPFHLYQ-TASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYA

SwissProt top hitse value%identityAlignment
A4QKT1 Photosystem I P700 chlorophyll a apoprotein A19.5e-0446.55Show/hide
Query:  PISINFALIRVENFH------PLLTKPNGDILVSRSSRLILDKVNIGFHLPCDGLGRG
        PI +  A   V + H      P+L    G +L +RSSRLI DK N+GF  PCDG GRG
Subjt:  PISINFALIRVENFH------PLLTKPNGDILVSRSSRLILDKVNIGFHLPCDGLGRG

P51284 Photosystem I P700 chlorophyll a apoprotein A17.3e-0447.37Show/hide
Query:  PISINFALIRVENFHPLLTKPNGDILV-----SRSSRLILDKVNIGFHLPCDGLGRG
        PIS+  A   V + H         ILV     SR+SRLI DK N+GF  PCDG GRG
Subjt:  PISINFALIRVENFHPLLTKPNGDILV-----SRSSRLILDKVNIGFHLPCDGLGRG

Q19V91 Photosystem I P700 chlorophyll a apoprotein A15.6e-0445.61Show/hide
Query:  PISINFALIRVENFHPLLTKPN-----GDILVSRSSRLILDKVNIGFHLPCDGLGRG
        PIS+  A   V + H              +L +RSSRLI DKVN+GF  PCDG GRG
Subjt:  PISINFALIRVENFHPLLTKPN-----GDILVSRSSRLILDKVNIGFHLPCDGLGRG

Q1XDK4 Photosystem I P700 chlorophyll a apoprotein A19.5e-0447.37Show/hide
Query:  PISINFALIRVENFHPLLTKPNGDILV-----SRSSRLILDKVNIGFHLPCDGLGRG
        PIS+  A   V + H         ILV     SR+SRLI DK N+GF  PCDG GRG
Subjt:  PISINFALIRVENFHPLLTKPNGDILV-----SRSSRLILDKVNIGFHLPCDGLGRG

Arabidopsis top hitse value%identityAlignment
ATCG00350.1 Photosystem I, PsaA/PsaB protein3.4e-0467.86Show/hide
Query:  ILVSRSSRLILDKVNIGFHLPCDGLGRG
        +L +RSSRLI DK N+GF  PCDG GRG
Subjt:  ILVSRSSRLILDKVNIGFHLPCDGLGRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGGGGGACCGAGGGCGTGGAGAGTACTCACATTCTTGAATCGATCAATTAATTGGTTTGTTGTTCTATTTTATCTCTTGTCACGACGGGCACATGAGTTTTGTAA
TAGCCCTCGTCAGAACATGAACCGAAGGCGTGTTAGAGATAATATCTATTTAGAAGATGGTTCAAACCAAGAAAGCTTCAGAGCAGTGACGTTTCAAGCCAAGAAAGCTC
GATCCATGGGTTCTTCAACGAGTGAGAAATCTTCAGAGAAAGCTCTCATGAATTCCCCTTTTTCTCTTAATCCTGTAAATCAAATCTCGATTACCAAGCTTACAGAAAAT
AATTACCTGGCCTGGAGATTTCAAGTCCTCAATGTTATACAAGGTCATCGACTTGAAGAACACATTGATGAAGATTCAAAGATTCCGGAAAAATTATTAAATCGTCGTCC
AGATGTTTGCGAAACAACGAAAGAACTATGGGAGGCTCTTTCAAAAACTCATGCTTCACAAAATACTGCCAAGATTATGCAGTATAAGACACAAGTTCAGACACTTAAGA
AAGGAGATATGAAAATGAACGGGATTATCTGGCTGGCTAAAGTCAAAAGATTTTTGAACAGGATCATTGATTACGACCAAAAAATAAAACAAAACCTCTTCAAGAAATCT
ACTCGTTGTTGCTTTGCAGTTAAGCAAGAACTGAAAAGAATTATTCAGTACAAAATGATTCCAGTCTACCCTCTGTCAATCTTGCGACCTTTCCATCTGTATCAAACAGC
TAGTTCAAGACTCCATTTACTAGCCTCACGGATAATTTCATTGCCCTCACGAGGAAGATCACATCCCTCATTACGAACTTGTACACATGCTTCTAAAGCTACTCTGTTAG
CTACTGCACCAGGTGCATTACCCCAAGGGTGTCCCGAAATTCCTCCACTGAATTGTAATACAGAATCATCTCCAAAAATCTCGGCATATGCTAAACGTGAATACCACTGG
GAGCCACTAGTAGAACACCTGAACGCTCGCTTACTCCCATCTCCTCTTACTTACACATTTCCAAGAAAGTTAGATCCCATCTCGATCAATTTCGCATTGATCAGGGTGGA
AAACTTTCATCCTCTTCTTACTAAGCCAAATGGCGACATTCTAGTTTCTCGTAGCTCGCGTTTGATACTAGATAAAGTAAACATTGGTTTTCATTTACCTTGTGATGGAC
TTGGAAGAGGGAATAAACGGGCAATAGTTAAGAGAGGTGCGCCAGCGGCCTTGAAGTGCATGCCAAGGGTTTCCTCTCTCTCCTTTGATCTTCTCATCGTTCTTCAGCCG
CCGAGCACTTCATCAAAGCTGATTCAACCTCACCGTCGTCAAACTGTGAATGATGATATTGTGAATGATGATTTGAAGAATGTTGTCTTGTCTGATATTGATCCTGAAAT
TATCAATTTCTCTCCTCCAAGAAAACTAAGGTTTTTTCAAGCAAATGTTTCATCAATGGACACCACCACCATTGAGAGCCAACAGAGCTTTGGCTGGCTGAAGCAACGCT
ACCGCTCCCCGCTCGGAGCTCGAGTACTTTCGGGGTTTTCTTCAGCGTGCATCGAGCCGCATAGACTGCCGAGGCGGCAATCATGGACGGGCAATACATCATTGCCGTAT
TGTAATGCATTATACCAAGTTCAGCCAGAAAATAAACCAGATTTTCCATCTACAATACAAGAAAGTTCAATGTTTCATATAATCAGTTCTAAGTCGTTATATATTATCAA
GAAAGGGAAACCCCCCCACCCAAGTAGTACCGATTTCTCTCACCTCGTGATTGGAGTCCTTGGACGCCTTGATGAATCGAGCGAGGAAAACGTAGGGTGTAGGAACAGTC
AAGGTCCATTCCAACTTGCCAAGCGCCCAAATTTCTTCATATTTGGAAGCTATAAGCATGGCTCCAATGCCCACCAATTGCAATTCCCTTCTTGGAACTATTTTTGTCGC
AAGGAATCGATCGATTATATTGATCGTGAGGTAGAAAGTTTCGGGCGAAAGTTCAAATTTATTGTGGACATCAACCAGCCAATCCACCAAAATAGCCCTCATGGTCGAGT
TTATCTCAGGACTGAAGTGAGAGTCTGTGCTTTCTTCTTTGAGGGCCCTTCTCCTTCCTTTTTCTTGTTTGCACATTTGACTTCCTTGGCTTTAACTTTTTCGACCGTGT
CCGAGCTTATCTCGATAACCTCCGATGTCGGTTTAATTGCAACTTTCTTATGTGCTGGCTTGGGAGCTCCTGCTTTCTTAACAGCCATAACACCAGCATCAAGAATGGGA
GCAGCCCCATCTACACTAACAGGACGATTTGCCTTTGCGTCAACTCCTCGAACAGTTACCAGATTCCCAATATCACCCAATGCTCGGCGGTTCCTTGCATCTGCCGCCGC
TGCGCCCTTTGCCTGTTTTCCTCCGCCGATCGCTGCCTCGCCTTCCCTCATTAAAGAAACCAAATTCAACACAAAAGAAAGAGAAAGAATACAATTGAAGAAACACAAGA
CACCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGGGGGACCGAGGGCGTGGAGAGTACTCACATTCTTGAATCGATCAATTAATTGGTTTGTTGTTCTATTTTATCTCTTGTCACGACGGGCACATGAGTTTTGTAA
TAGCCCTCGTCAGAACATGAACCGAAGGCGTGTTAGAGATAATATCTATTTAGAAGATGGTTCAAACCAAGAAAGCTTCAGAGCAGTGACGTTTCAAGCCAAGAAAGCTC
GATCCATGGGTTCTTCAACGAGTGAGAAATCTTCAGAGAAAGCTCTCATGAATTCCCCTTTTTCTCTTAATCCTGTAAATCAAATCTCGATTACCAAGCTTACAGAAAAT
AATTACCTGGCCTGGAGATTTCAAGTCCTCAATGTTATACAAGGTCATCGACTTGAAGAACACATTGATGAAGATTCAAAGATTCCGGAAAAATTATTAAATCGTCGTCC
AGATGTTTGCGAAACAACGAAAGAACTATGGGAGGCTCTTTCAAAAACTCATGCTTCACAAAATACTGCCAAGATTATGCAGTATAAGACACAAGTTCAGACACTTAAGA
AAGGAGATATGAAAATGAACGGGATTATCTGGCTGGCTAAAGTCAAAAGATTTTTGAACAGGATCATTGATTACGACCAAAAAATAAAACAAAACCTCTTCAAGAAATCT
ACTCGTTGTTGCTTTGCAGTTAAGCAAGAACTGAAAAGAATTATTCAGTACAAAATGATTCCAGTCTACCCTCTGTCAATCTTGCGACCTTTCCATCTGTATCAAACAGC
TAGTTCAAGACTCCATTTACTAGCCTCACGGATAATTTCATTGCCCTCACGAGGAAGATCACATCCCTCATTACGAACTTGTACACATGCTTCTAAAGCTACTCTGTTAG
CTACTGCACCAGGTGCATTACCCCAAGGGTGTCCCGAAATTCCTCCACTGAATTGTAATACAGAATCATCTCCAAAAATCTCGGCATATGCTAAACGTGAATACCACTGG
GAGCCACTAGTAGAACACCTGAACGCTCGCTTACTCCCATCTCCTCTTACTTACACATTTCCAAGAAAGTTAGATCCCATCTCGATCAATTTCGCATTGATCAGGGTGGA
AAACTTTCATCCTCTTCTTACTAAGCCAAATGGCGACATTCTAGTTTCTCGTAGCTCGCGTTTGATACTAGATAAAGTAAACATTGGTTTTCATTTACCTTGTGATGGAC
TTGGAAGAGGGAATAAACGGGCAATAGTTAAGAGAGGTGCGCCAGCGGCCTTGAAGTGCATGCCAAGGGTTTCCTCTCTCTCCTTTGATCTTCTCATCGTTCTTCAGCCG
CCGAGCACTTCATCAAAGCTGATTCAACCTCACCGTCGTCAAACTGTGAATGATGATATTGTGAATGATGATTTGAAGAATGTTGTCTTGTCTGATATTGATCCTGAAAT
TATCAATTTCTCTCCTCCAAGAAAACTAAGGTTTTTTCAAGCAAATGTTTCATCAATGGACACCACCACCATTGAGAGCCAACAGAGCTTTGGCTGGCTGAAGCAACGCT
ACCGCTCCCCGCTCGGAGCTCGAGTACTTTCGGGGTTTTCTTCAGCGTGCATCGAGCCGCATAGACTGCCGAGGCGGCAATCATGGACGGGCAATACATCATTGCCGTAT
TGTAATGCATTATACCAAGTTCAGCCAGAAAATAAACCAGATTTTCCATCTACAATACAAGAAAGTTCAATGTTTCATATAATCAGTTCTAAGTCGTTATATATTATCAA
GAAAGGGAAACCCCCCCACCCAAGTAGTACCGATTTCTCTCACCTCGTGATTGGAGTCCTTGGACGCCTTGATGAATCGAGCGAGGAAAACGTAGGGTGTAGGAACAGTC
AAGGTCCATTCCAACTTGCCAAGCGCCCAAATTTCTTCATATTTGGAAGCTATAAGCATGGCTCCAATGCCCACCAATTGCAATTCCCTTCTTGGAACTATTTTTGTCGC
AAGGAATCGATCGATTATATTGATCGTGAGGTAGAAAGTTTCGGGCGAAAGTTCAAATTTATTGTGGACATCAACCAGCCAATCCACCAAAATAGCCCTCATGGTCGAGT
TTATCTCAGGACTGAAGTGAGAGTCTGTGCTTTCTTCTTTGAGGGCCCTTCTCCTTCCTTTTTCTTGTTTGCACATTTGACTTCCTTGGCTTTAACTTTTTCGACCGTGT
CCGAGCTTATCTCGATAACCTCCGATGTCGGTTTAATTGCAACTTTCTTATGTGCTGGCTTGGGAGCTCCTGCTTTCTTAACAGCCATAACACCAGCATCAAGAATGGGA
GCAGCCCCATCTACACTAACAGGACGATTTGCCTTTGCGTCAACTCCTCGAACAGTTACCAGATTCCCAATATCACCCAATGCTCGGCGGTTCCTTGCATCTGCCGCCGC
TGCGCCCTTTGCCTGTTTTCCTCCGCCGATCGCTGCCTCGCCTTCCCTCATTAAAGAAACCAAATTCAACACAAAAGAAAGAGAAAGAATACAATTGAAGAAACACAAGA
CACCATGA
Protein sequenceShow/hide protein sequence
MKGGPRAWRVLTFLNRSINWFVVLFYLLSRRAHEFCNSPRQNMNRRRVRDNIYLEDGSNQESFRAVTFQAKKARSMGSSTSEKSSEKALMNSPFSLNPVNQISITKLTEN
NYLAWRFQVLNVIQGHRLEEHIDEDSKIPEKLLNRRPDVCETTKELWEALSKTHASQNTAKIMQYKTQVQTLKKGDMKMNGIIWLAKVKRFLNRIIDYDQKIKQNLFKKS
TRCCFAVKQELKRIIQYKMIPVYPLSILRPFHLYQTASSRLHLLASRIISLPSRGRSHPSLRTCTHASKATLLATAPGALPQGCPEIPPLNCNTESSPKISAYAKREYHW
EPLVEHLNARLLPSPLTYTFPRKLDPISINFALIRVENFHPLLTKPNGDILVSRSSRLILDKVNIGFHLPCDGLGRGNKRAIVKRGAPAALKCMPRVSSLSFDLLIVLQP
PSTSSKLIQPHRRQTVNDDIVNDDLKNVVLSDIDPEIINFSPPRKLRFFQANVSSMDTTTIESQQSFGWLKQRYRSPLGARVLSGFSSACIEPHRLPRRQSWTGNTSLPY
CNALYQVQPENKPDFPSTIQESSMFHIISSKSLYIIKKGKPPHPSSTDFSHLVIGVLGRLDESSEENVGCRNSQGPFQLAKRPNFFIFGSYKHGSNAHQLQFPSWNYFCR
KESIDYIDREVESFGRKFKFIVDINQPIHQNSPHGRVYLRTEVRVCAFFFEGPSPSFFLFAHLTSLALTFSTVSELISITSDVGLIATFLCAGLGAPAFLTAITPASRMG
AAPSTLTGRFAFASTPRTVTRFPISPNARRFLASAAAAPFACFPPPIAASPSLIKETKFNTKERERIQLKKHKTP