greedy algorithm, scheduling

Question

I am trying to understand how Greedy Algorithm scheduling problem works.

So I've been reading and googling for a while since I could not understand Greedy algorithm scheduling problem.

We have n jobs to schedule on a single resource. The job (i) has a requested start time s(i) and finish time f(i).

There are some greedy ideas which we select...

Accept in increasing order of s ("earliest start time")
Accept in increasing order of f - s ("shortest job time")
Accept in increasing order of number of conflicts ("fewest conflicts")
Accept in increasing order of f ("earliest finish time")

And the book says the last one, accept in increasing order of f will always gives an optimal solution.

However it did not mention why it always gives optimal solution and why other 3 will not give optimal solution.

They provided the figure that says why other three will not provide optimal solution but I could not understand what it means.

Since I have low reputation, I can not post any image so I will try to draw it.

　|---| |---| |---|
|-------------------------|
increasing order of s underestimated solution

|-----------| |-----------|
　　　|-----|
increasing order of f-s underestimated solution

|----|　 |----|　|----| 　|----|

　|-----|　|-----|　|-----|

　|-----|　　　　|-----|

increasing order of number of conflicts. underestimated solution

This is what it looks like and I don't see why this is a counterexample of each scenario.

If anyone can explain why each greedy idea does/ does not work, it will be very helpful.

Thank you.

Please specify more what are your conditions. Will you know all the staring and finishing times in advance, or will new task come after some time? Are you allowed to to move your task to be processed at another time? If so, will you receive some penalty for doing so? How do you evaluate what is a good result? Is only a fully performed task worth doing? Is doing long task more important than doing shorter ones? — kajacx

vish4071 vish4071 · Accepted Answer · 2015-09-04T10:06:24

I think I can explain this.
Lets say, we have n jobs, start times as s[1..n] and finish times as f[1..n]. So if we sort it according to finish times, then, we will always be able to complete most number of tasks. Lets see, how.

If a job is finishing earlier (even if it started later in the series, a short job), then, we always have more time for later jobs. Lets assume, we have other jobs that we could start/complete in this interval so that our number of tasks could increase. Now, this is not actually possible as if any task completed before this, then that would be the one with earliest finish time so we would be working on that one. And, if any task has not been completed till now (but has started), then if we selected that, we would not have completed any task but now we actually have done one at least. So, in any case, this is the most optimal choice.
There are many possible solutions with maximum number of tasks that can be done in an interval, EFT gives one such solution. But it is always the max number possible.

I hope I could explain it well.

greedy algorithm, scheduling

2 Answers