Theory of numerical analysis
Theory of numerical analysis
The following is a rough categorization of the mathematical theory underlying numerical analysis, keeping in mind that there is often a great deal of overlap between the listed areas.
Numerical linear and nonlinear algebra
Many problems in applied mathematics involve solving systems of linear equations, with the linear system occurring naturally in some cases and as a part of the solution process in other cases. Linear systems are usually written using matrix-vector notation, Ax = b with A the matrix of coefficients for the system, x the column vector of the unknown variables x1,…, xn, and b a given column vector. Solving linear systems with up to 1,000 variables is now considered relatively straightforward in most cases. For small to moderately sized linear systems (say, n ≤ 1,000), the favoured numerical method is Gaussian elimination and its variants; this is simply a precisely stated algorithmic variant of the method of elimination of variables that is introduced in elementary algebra. For larger linear systems, there is a variety of approaches depending on the structure of the coefficient matrix A. Direct methods lead to a theoretically exact solution x in a finite number of steps, with Gaussian elimination the best-known example. In practice, there are errors in the computed value of x due to rounding errors in the computation, arising from the finite length of numbers in standard computer arithmetic. Iterative methods are approximate methods that create a sequence of approximating solutions of increasing accuracy.
Nonlinear problems are often treated numerically by reducing them to a sequence of linear problems. As a simple but important example, consider the problem of solving a nonlinear equation f(x) = 0. Approximate the graph of y = f(x) by the tangent line at a point x(0) near the desired root (use of parentheses is a common notational convention to distinguish successive iterations from exponentiation), and use the root of the tangent line to approximate the root of the original nonlinear function f(x). This leads to Newton’s iterative method for finding successively better approximations to the desired root:x(k +1) = x(k) − f(x(k))/f′(x(k)), k = 0, 1, 2, …,where f′(x) indicates the first derivative of the original function.
This generalizes to handling systems of nonlinear equations. Let f(x) = 0 denote a system of n nonlinear equations in n unknowns x = (x1,…, xn). Newton’s method for solving this system is given byx(k + 1) = x(k) + δ(k)f′(x(k))δ(k) = −f(x(k)), k = 0, 1, 2,…
In this, f′(x) is a generalization of the derivative known as the Jacobian matrix of f(x), and the second equation is a linear system of order n. There are numerous other approaches to solving nonlinear systems, most based on using some type of approximation involving linear functions.
An important related class of problems occurs under the heading of optimization. Given a real-valued function f(x) with x a vector of unknowns, a value of x that minimizes f(x) is sought. In some cases x is allowed to vary freely, and in other cases there are constraints on x. Such problems occur frequently in business applications.
Approximation theory
This category includes the approximation of functions with simpler or more tractable functions and methods based on using such approximations. When evaluating a function f(x) with x a real or complex number, it must be kept in mind that a computer or calculator can only do a finite number of operations. Moreover, these operations are the basic arithmetic operations of addition, subtraction, multiplication, and division, together with comparison operations such as determining whether x > y is true or false. With the four basic arithmetic operations, it is possible to evaluate polynomialsp(x) = a0 + a1x + a2x2 + ⋯ + anxnas well as rational functions (polynomials divided by polynomials). By including the comparison operations, it is possible to evaluate different polynomials or rational functions on different sets of real numbers x. The evaluation of all other functions—e.g., f(x) = Square root of√x or 2x—must be reduced to the evaluation of a polynomial or rational function that approximates the given function with sufficient accuracy. All function evaluations on calculators and computers are accomplished in this manner.
One common method of approximation is known as interpolation. Consider a set of points (xi,yi) where i = 0, 1, …, n, and then find a polynomial that satisfies p(xi) = yi for all i = 0, 1, …, n. The polynomial p(x) is said to interpolate the given data points. Interpolation can be performed with functions other than polynomials (although these are most common), with important cases being rational functions, trigonometric polynomials, and spline functions (made by connecting several polynomial functions at their endpoints—they are commonly used in statistics and computer graphics).
Interpolation has a number of applications. If a function f(x) is known only at a discrete set of data points x0, …, xn, with yi = f(xi), then interpolation can be used to extend the definition to nearby points x. If n is at all large, spline functions are generally preferable to simple polynomials.
Most numerical methods for the approximation of integrals and derivatives of a given function f(x) are based on interpolation. For example, begin by constructing an interpolating function p(x), often a polynomial, that approximates f(x), and then integrate or differentiate p(x) to approximate the corresponding integral or derivative of f(x).
Solving differential and integral equations
Most mathematical models used in the natural sciences and engineering are based on ordinary differential equations, partial differential equations, and integral equations. Numerical methods for solving these equations are primarily of two types. The first type approximates the unknown function in the equation by a simpler function, often a polynomial or piecewise polynomial (spline) function, chosen to closely follow the original equation. The finite element method discussed above is the best known approach of this type. The second type of numerical method approximates the equation of interest, usually by approximating the derivatives or integrals in the equation. The approximating equation has a solution at a discrete set of points, and this solution approximates that of the original equation. Such numerical procedures are often called finite difference methods. Most initial value problems for ordinary differential equations and partial differential equations are solved in this way. Numerical methods for solving differential and integral equations often involve both approximation theory and the solution of quite large linear and nonlinear systems of equations.
Effects of computer hardware
Almost all numerical computation is carried out on digital computers. The structure and properties of digital computers affect the structure of numerical algorithms, especially when solving large linear systems. First and foremost, the computer arithmetic must be understood. Historically, computer arithmetic varied greatly between different computer manufacturers, and this was a source of many problems when attempting to write software that could be easily ported between different computers. Variations were reduced significantly in 1985 with the development of the Institute for Electrical and Electronic Engineering (IEEE) standard for computer floating-point arithmetic. The IEEE standard has been adopted by all personal computers and workstations as well as most mainframe computers.
For large-scale problems, especially in numerical linear algebra, it is important to know how the elements of an array A or a vector x are stored in memory. Knowing this can lead to much faster transfer of numbers from the memory into the arithmetic registers of the computer, thus leading to faster programs. A somewhat related topic is that of “pipelining.” This is a widely used technique whereby the executions of computer operations are overlapped, leading to faster execution. Machines with the same basic clock speed can have very different program execution times due to differences in pipelining and differences in the way memory is accessed.
Most personal computers are sequential in their operation, but parallel computers are being used ever more widely in public and private research institutions. (See supercomputer.) Shared-memory parallel computers have several independent central processing units (CPUs) that all access the same computer memory, whereas distributed-memory parallel computers have separate memory for each CPU. Another form of parallelism is the use of pipelining of vector arithmetic operations. Numerical algorithms must be modified to run most efficiently on whatever combination of methods a particular computer employs.
Comments
Post a Comment