Calculus · Proof

Proof of the Chain Rule

Many functions encountered in calculus are compositions: a function applied to the output of another function. The chain rule tells you how to differentiate such compositions by working from the outside in, multiplying the derivative of each layer together.

Statement

Let g be differentiable at x and let f be differentiable at g(x). Then the composite function F(x) = f(g(x)) is differentiable at x and:

F′(x) = (f∘g)′(x) = f′(g(x)) · g′(x)

In Leibniz notation, if y = f(u) and u = g(x), then:

dy/dx = (dy/du) · (du/dx)

Proof

Set u = g(x) and write the difference quotient for F(x) = f(g(x)):

F′(x) = limh→0 [f(g(x+h)) − f(g(x))] / h

Let Δu = g(x+h) − g(x), so g(x+h) = g(x) + Δu = u + Δu. Note that Δu → 0 as h → 0 (because g is differentiable, hence continuous, at x).

Rewrite the numerator using Δu:

[f(u + Δu) − f(u)] / h

Multiply and divide by Δu (valid whenever Δu ≠ 0):

= [f(u + Δu) − f(u)] / Δu · Δu / h

As h → 0, the first factor tends to f′(u) = f′(g(x)) (by the definition of the derivative of f at u), and the second factor Δu/h = [g(x+h)−g(x)]/h tends to g′(x). Therefore:

F′(x) = f′(g(x)) · g′(x)

Note: The argument above is valid when Δu ≠ 0 for small h. A more careful proof handles the case Δu = 0 by defining an auxiliary function that removes the division; the conclusion is the same.

Common Mistakes

  • Forgetting the inner derivative: d/dx cos(5x) = −sin(5x)·5, not just −sin(5x). The factor 5 comes from d/dx (5x) = 5.
  • Evaluating the outer derivative at x instead of g(x): For d/dx (x²+1)³, the outer derivative is 3u² evaluated at u = x²+1, giving 3(x²+1)². The inner derivative is 2x. Result: 6x(x²+1)².
  • Confusing order of composition: f(g(x)) and g(f(x)) are different. Always identify which function is innermost.

Worked Example

Differentiate F(x) = ln(x² + 1).

  • Outer function: f(u) = ln(u), so f′(u) = 1/u.
  • Inner function: g(x) = x² + 1, so g′(x) = 2x.
  • Apply the chain rule: F′(x) = f′(g(x))·g′(x) = 1/(x²+1) · 2x.
F′(x) = 2x / (x² + 1)

Check at x = 1: F′(1) = 2·1/(1+1) = 1. Numerically, [ln(1.000001² + 1) − ln(2)] / 0.000001 ≈ 1.000, confirming the result.

Frequently Asked Questions

What does the chain rule state?

If g is differentiable at x and f is differentiable at g(x), then the composite function (f∘g)(x) = f(g(x)) is differentiable at x and its derivative is f′(g(x))·g′(x). The outer derivative is evaluated at the inner function.

What is the most common mistake when applying the chain rule?

Forgetting to multiply by the derivative of the inner function. For example, d/dx sin(x²) = cos(x²)·2x, not just cos(x²). The '·2x' part comes from differentiating the inner function x².

How do you identify the 'inner' and 'outer' functions?

The outer function is the last operation you would apply if computing by hand. For sin(x²), you first compute x² (inner), then take the sine (outer). For e^(3x), you first compute 3x (inner), then exponentiate (outer).

Can the chain rule be applied multiple times?

Yes. For deeply nested compositions like f(g(h(x))), the chain rule applies twice: the derivative is f′(g(h(x)))·g′(h(x))·h′(x). Each layer contributes a multiplicative factor.

Adapted and rewritten from Silas Maths source notes; formulas reviewed for CalxSolver publication.