Common Functions Used in Analysis

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 7

Common Functions Used in Analysis

The Constant Function f(n) = C


For any argument n, the constant function f(n) assigns the value C. It doesn't matter what the input size n is, f(n) will always be equal
to the constant value C. The most fundamental constant function is f(n) = 1, and this is the typical constant function that is used in the
book.
The constant function is useful in algorithm analysis, because it characterizes the number of steps needed to do a basic operation on a
computer, like adding two numbers, assigning a value to some variable, or comparing two numbers. Executing one instruction a fixed
number of times also needs constant time only.
Constant algorithm does not depend on the input size.
Examples: arithmetic calculation, comparison, variable declaration, assignment statement, invoking a method or function.
The Logarithm Function f(n) = logn
It is one of the interesting and surprising aspects of the analysis of data structures and algorithms. The general form of a logarithm
function is f(n) = logbn, for some constant b > 1. This function is defined as follows:

x = logbn, if and only if bx = n

The value b is known as the base of the logarithm. Computing the logarithm function for any integer n is not always easy, but we can
easily compute the smallest integer greater than or equal to logbn, for this number is equal to the number of times we can divide n by b
until we get a number less than or equal to 1. For example, log327 is 3, since 27/3/3/3 = 1. Likewise, log212 = 4, since 12/2/2/2/2 =
0.75 <= 1.
The base-two approximating arises in the algorithm analysis, since a common operation in many algorithms is to repeatedly divide an
input in half. In fact, the most common base for the logarithm in computer science is 2. We typically leave it off when it is 2.
Logarithm function gets slightly slower as n grows. Whenever n doubles, the running time increases by a constant.
Examples: binary search.
The Linear Function f(n) = n
Another simple yet important function. Given an input value n, the linear function f assigns the value n itself.
This function arises in an algorithm analysis any time we do a single basic operation for each of n elements. For example, comparing
a number x to each element of an array of size n will require n comparisons. The linear function also represents the best running time
we hope to achieve for any algorithm that processes a collection of n inputs.
Whenever n doubles, so does the running time.
Example: print out the elements of an array of size n.
The N-Log-N Function f(n) = nlogn
This function grows a little faster than the linear function and a lot slower than the quadratic function (n2). If we can improve the
running time of solving some problem from quadratic to N-Log-N, we will have an algorithm that runs much faster in general. It
scales to a huge problem, since whenever n doubles, the running time more than doubles.

Example: merge sort, which will be discussed later.


The Quadratic Function f(n) = n2
It appears a lot in the algorithm analysis, since there are many algorithms that have nested loops, where the inner loop performs a
linear number of operations and the outer loop is performed a linear number of times. In such cases, the algorithm performs n*n = n2
operations.
The quadratic function can also be used in the context of nested loops where the first iteration of a loop uses one operation, the second
uses two operations, the third uses three operations, and so on. That is, the number of operations is 1 + 2 + 3 + ... + (n-1) + n.
For any integer n >= 1, we have 1 + 2 + 3 + ... + (n-1) + n = n*(n+1) / 2.
Quadratic algorithms are practical for relatively small problems. Whenever n doubles, the running time increases fourfold.
Example: some manipulations of the n by n array.
The Cubic Function and Other Polynomials
The cubic function f(n) = n3. This function appears less frequently in the context of the algorithm analysis than the constant, linear,
and quadratic functions. It's practical for use only on small problems. Whenever n doubles, the running time increases eightfold.
Example: n by n matrix multiplication.
The functions we have learned so far can be viewed as all being part of a larger class of functions, the polynomials. A polynomial
function is a function of the form:
.
where a0, a1, ... , an are constants, called the coefficients of the polynomial, and an != 0. Integer n, which indicates the highest power in
the polynomial, is called the degree of the polynomial.
The Exponential Function f(n) = bn
In this function, b is a positive constant, called the base, and the argument n is the exponent. In the algorithm analysis, the most
common base for the exponential function is b = 2. For instance, if we have a loop that starts by performing one operation and then
doubles the number of operations performed with each iteration, then the number of operations performed in the nth iteration is 2n.
Exponential algorithm is usually not appropriate for practical use.
Example: Towers of the Hanoi.
The Factorial Function f(n) = n!
Factorial function is even worse than the exponential function. Whenever n increases by 1, the running time increases by a factor of
n.
For example, permutations of n elements.

Comparing Growth Rates


Ideally, we would like data structure operations to run in times proportional to the constant or logarithm function, and we would like
our algorithms to run in linear or n-log-n time. Algorithms with quadratic or cubic running times are less practical, but algorithms with
exponential running times are infeasible for all but the smallest sized inputs.

Let's draw the growth rates for the above functions and take a look at the following table.

The Big-Oh Notation


Let f(n) and g(n) be functions mapping nonnegative integers to real numbers. We say that f(n) is O(g(n)) if there is a real constant c >
0 and an integer constant n0 >= 1 such that f(n) <= cg(n), for n >= n0. This definition is often referred to as the "big-Oh" notation, for it
is sometimes pronounced as "f(n) is big-Oh of g(n)." Alternatively, we can also say "f(n) is order of g(n)."

Example: The function f(n) = 8n - 2 is O(n).


Justification: By the big-Oh definition, we need to find a real constant c > 0 and an integer constant n0 >= 1 such that 8n - 2 <= cn for
every integer n >= n0. It is easy to see that a possible choice is c = 8 and n0 = 1.
More examples: 2n + 10 is O(n), 7n - 2 is O(n), 3n3 + 20n2 + 5 is O(n3), 3log n + 5 is O(log n)

The big-Oh notation allows us to say that a function f(n) is less than or equal to another function g(n) up to a constant factor and in
the asymptotic sense as n grows towards infinity. If f(n) is of the form of An + B, where A and B are constants. It's called a linear
function, and it is O(n).
The big-Oh notation gives an upper bound on the growth rate of a function. The statement "f(n) is O(g(n))" means that the growth
rate of f(n) is no more than the growth rate of g(n).
Characterizing Running Times using the Big-Oh Notation
The big-Oh notation is used widely to characterize running times and space bounds in terms of some parameter n, which varies from
problem to problem, but is always defined as a chosen measure of the size of the problem. For example, if we are interested in finding
a specific element in an array of integers, we should let n denote the number of elements of the array. Using the big-Oh notation, we
can write the following mathematically precise statement on the running time of a sequential search algorithm for any computer.
Proposition: The sequential search algorithm, for searching a specific element in an array of n integers, runs in O(n) time.
Justification: The number of primitive operations executed by the algorithm findElement in each iteration is a constant. Hence, since
each primitive operation runs in constant time, we can say that the running time of the algorithm findElement on an input of size n is
at most a constant times n, that is, we may conclude that the running time of the algorithm findElement is O(n).
Some Properties of the Big-Oh Notation
The big-Oh notation allows us to ignore constant factors and lower order terms and focus on the main components of a function that
affect its growth the most.
Example: 5n4 + 3n3 + 2n2 + 4n + 1 is O(n4).
Justification: 5n4 + 3n3 + 2n2 + 4n + 1 <= (5 + 3 + 2 + 4 + 1)n4 = cn4 , for c = 15 and n0 = 1.
Proposition: If f(n) is a polynomial of degree d, that is, f(n) = a0 + a1n + a2n2 + ... + adnd, and ad > 0, then f(n) is O(nd).
Justification: Note that, for n >= 1, we have 1 <= n <= n2 <= ... <= nd; hence f(n) = a0 + a1n + a2n2 + ... + adnd <= (a0 + a1 +... + ad)
nd; therefore, we can show that f(n) is O(nd) by defining c = a0 + a1 +... + ad and n0 = 1.
The highest degree term in a polynomial is the term that determines the asymptotic growth rate of that polynomial.
General rules: Characterizing Functions in Simplest Terms
In general we should use the big-Oh notation to characterize a function as closely as possible. For example, while it is true that f(n) =
4n3 + 3n2 is O(n5) or even O(n4), it is more accurate to say that f(n) is O(n3).
It is also considered a poor taste to include constant factors and lower order terms in the big-Oh notation. For example, it is
unfashionable to say that the function 2n3 is O(4n3 + 8nlogn), although it is completely correct. We should strive to describe the
function in the big-Oh in simplest terms.
Rules of using big-Oh:

If f(n) is a polynomial of degree d, then f(n) is O(nd). We can drop the lower order terms and constant factors.

Use the smallest/closest possible class of functions, for example, "2n is O(n)" instead of "2n is O(n2)"

Use the simplest expression of the class, for example, "3n + 5 is O(n)" instead of "3n+5 is O(3n)"

Some words of caution


It is somewhat misleading when the constant factors are very large. It is true that 10100n is O(n). When it is compared with another
running time 10nlogn, we should prefer O(nlogn) time, even though the linear-time algorithm is asymptotically faster.

Generally speaking, any algorithm running in O(nlogn) time (with a reasonable constant factor) should be considered efficient. Even
O(n2) may be fast when n is small. But O(2n) should almost never be considered efficient.
If we must draw a line between efficient and inefficient algorithms, it is natural to make this distinction be that between those
algorithms running in polynomial time and those running in exponential time. Again, be reasonable here. O(n100) is not efficient at all.
Exercises

Show that n is O(nlogn)

Show that if d(n) is O(f(n)), then ad(n) is O(f(n)), for any constant a > 0.

If f(n) is in O(g(n)), and g(n) is in O(h(n)), then f(n) is in O(h(n)).

If f1(n) is in O(g1(n)) and f2(n) is in O(g2(n)), then f1(n)*f2(n) is in O(g1(n)*g2(n)).

Big-Omega
The big-Oh notation provides an asymptotic way of saying that a function is less than or equal to another function. This big-Omega
notation provides an asymptotic way of saying that a function grows at a rate that is greater than or equal to that of another.
Let f(n) and g(n) be functions mapping nonnegative integers to real numbers. We say that f(n) is (g(n)) (pronounced "f(n) is bigOmega of g(n)") if there is a real constant c > 0 and an integer constant n0 >= 1 such that f(n) >= cg(n), for n >= n0.

Example, show that n2 is (nlogn).


Big-Theta
In addition, there is a notation that allows us to say that two functions grow at the same rate, up to constant factors. We way that f(n) is
(g(n)) (pronounced "f(n) is big-Theta of g(n)") if f(n) is O(g(n)) and f(n) is (g(n)), that is, there are real constants c1 > 0 and c2 > 0,
and an integer constant n0 >= 1 such that c1g(n) <= f(n) <= c2g(n), for n >= n0.

Example: 3nlogn + 4n + 5logn is (nlogn).


Justification: 3nlogn <= 3nlogn + 4n + 5logn <= (3 + 4 + 5) nlogn for n >= 2.
To summarize, the asymptotic notations of big-Oh, big-Omega, and big-Theta provide a convenient language for us to analyze data
structures and algorithms. They let us concentrate on the "big-picture" rather than low-level details.
Example, show that 5n2 is O(n2), (n2) and (n2).

Analyze the big-Oh of the above MergeSort.


We can actually solve the recurrence relation given above. We'll sketch how to do that here. We'll write n
instead of O(n) in the first line below because it makes the algebra much simpler.
T(n) =
=
=
=
=

2T(n/2) +
2[2T(n/4)
4T(n/4) +
...
2k T(n/2k)

n
+ n/2] + n
2n
+ kn

We know that T(1) = 1 and this is a way to end the derivation above. In particular we want T(1) to appear on
the right hand side of the = sign. This means we want:
n/2k = 1 OR n = 2k OR log2n = k

Continuing with the previous derivation we get the following since k = log2n:
=
=
=
=

2kT(n/2k) + kn
2log2n T(1) + (log2n)n
n + n [ remember that T(1) = 1 ]
O(n log n)

So we've solved the recurrence relation and its solution is what we "knew" it would be. To make this a formal
proof you would need to use induction to show that O(n log n) is the solution to the given recurrence relation,
but the "plug and chug" method shown above shows how to derive the solution --- the subsequent verification
that this is the solution is something that can be left to a more advanced algorithms class.

Recurrence Relations to Remember


Recurrence

Algorithm

Binary Search
T(n) = T(n-1) + O(1) Sequential Search
T(n) = T(n/2) + O(1)
T(n) = 2 T(n/2) +
O(1)
T(n) = T(n-1) + O(n)
T(n) = 2 T(n/2) +
O(n)

Big-Oh
Solution
O(log n)
O(n)

tree traversal

O(n)

Selection Sort (other n2 sorts)


Mergesort (average case
Quicksort)

O(n2)
O(n log n)

You might also like