First Principles Differentiation
Introduction: What does it mean to find an instant rate of change?
students, imagine a car speeding up on a road 🚗. A speedometer shows how fast it is moving at one moment, not over a whole trip. In calculus, first principles differentiation is the method used to find that exact instantaneous rate of change. It helps us answer questions like: How quickly is a population growing right now? How steep is a curve at a specific point? How fast is water leaving a tank at a certain instant?
In this lesson, you will learn to:
- explain the key ideas and vocabulary behind first principles differentiation,
- use the limit definition of the derivative,
- differentiate simple functions from first principles,
- connect the method to gradients, tangents, and motion,
- see how this idea fits into the wider study of calculus.
The main idea is that we start with an average rate of change over a very small interval and let that interval shrink to zero. This is one of the most important ideas in calculus because it creates the link between difference and change at an instant.
The big idea behind first principles
Suppose a function is given by $f(x)$. If we want to know how much $f(x)$ changes when $x$ changes from $x$ to $x+h$, we look at the change in output:
$$f(x+h)-f(x)$$
To measure how fast the function changes, we divide by the input change $h$:
$$\frac{f(x+h)-f(x)}{h}$$
This is the average rate of change over the interval from $x$ to $x+h$. If $h$ is small, this ratio gives a good estimate of the gradient of the curve near $x$.
To make the interval infinitely small, we take the limit as $h \to 0$:
$$f'(x)=\lim_{h\to 0}\frac{f(x+h)-f(x)}{h}$$
This is the first principles definition of the derivative. The notation $f'(x)$ means “the derivative of $f$ at $x$.” Another common notation is $\frac{dy}{dx}$ when $y=f(x)$.
The important vocabulary includes:
- derivative: the rate of change of a function,
- gradient: the slope of a curve or tangent line,
- tangent line: a line that just touches a curve at a point and has the same instantaneous gradient,
- limit: the value a quantity approaches as the input gets close to a certain number,
- instantaneous rate of change: the rate at a single moment.
Why the limit is necessary
If you tried to find the gradient using only two points, you would always get a secant line, not a tangent line. A secant line passes through two points on the curve, so it gives an average slope. But the tangent line is about what happens at one point only.
Think of walking up a hill ⛰️. If you measure the slope between two positions one metre apart, you get an average slope. If you measure over just a few centimetres, the slope begins to look like the slope right where you are standing. The closer the two points get, the better the approximation.
That is why the limit matters. It turns “almost at a point” into “exactly at a point.” In IB Mathematics, this is one of the core ideas of calculus: using limits to define derivatives and, later, integrals.
It is also important to know that the fraction $\frac{f(x+h)-f(x)}{h}$ is only defined when $h \neq 0$. The limit process lets us study what happens as $h$ gets very close to $0$ without actually setting $h=0$ inside the fraction.
Working through a first principles example
Let us find the derivative of $f(x)=x^2$ from first principles.
Start with the definition:
$$f'(x)=\lim_{h\to 0}\frac{(x+h)^2-x^2}{h}$$
Now expand the numerator:
$$\frac{x^2+2xh+h^2-x^2}{h}$$
The $x^2$ terms cancel:
$$\frac{2xh+h^2}{h}$$
Factor out $h$:
$$\frac{h(2x+h)}{h}$$
For $h \neq 0$, cancel $h$:
$$2x+h$$
Now take the limit as $h \to 0$:
$$f'(x)=\lim_{h\to 0}(2x+h)=2x$$
So the derivative of $x^2$ is $2x$.
This result tells us the gradient of the curve $y=x^2$ at any point $x$ is $2x$. For example, at $x=3$, the gradient is $6$. At $x=-2$, the gradient is $-4$.
This also helps with tangent lines. If the point is $(3,9)$ on the curve $y=x^2$, then the tangent line has gradient $6$. Using point-slope form, the tangent line is
$$y-9=6(x-3)$$
which simplifies to
$$y=6x-9$$
Another example: a linear function and a constant derivative
Now consider $f(x)=5x-7$.
Using first principles:
$$f'(x)=\lim_{h\to 0}\frac{[5(x+h)-7]-(5x-7)}{h}$$
Simplify the numerator:
$$\frac{5x+5h-7-5x+7}{h}$$
This becomes
$$\frac{5h}{h}$$
so the expression is $5$, and therefore
$$f'(x)=\lim_{h\to 0}5=5$$
This makes sense because a straight line has the same gradient everywhere. The derivative of a linear function is constant.
This example is useful because it shows the derivative is not just a formula to memorize. It measures the steepness of the graph, and straight lines have unchanging steepness.
Common algebra skills needed in first principles
First principles differentiation often looks harder than other methods because it needs careful algebra before the limit can be taken. students, these are the main skills you should use:
- Expanding brackets: for example, $(x+h)^2=x^2+2xh+h^2$.
- Collecting like terms: combine terms such as $x^2-x^2=0$.
- Factoring: especially taking out a common factor of $h$.
- Cancelling carefully: only cancel factors, not terms.
- Using limit laws: once simplification is complete, let $h \to 0$.
A common mistake is trying to substitute $h=0$ too early. If you do that in the fraction $\frac{f(x+h)-f(x)}{h}$, you get division by zero, which is undefined. The correct order is: simplify first, then take the limit.
Another common mistake is forgetting that the derivative is a function of $x$, not just one number. For $f(x)=x^2$, the derivative is $f'(x)=2x$, which changes with $x$.
What first principles tells us about calculus
First principles differentiation is more than just one method. It is the foundation of the derivative itself. Many later differentiation rules are built from this idea.
For example:
- the power rule can be derived from first principles for polynomial functions,
- the product rule, quotient rule, and chain rule extend differentiation to more complicated expressions,
- derivatives are used in kinematics to find velocity and acceleration,
- derivatives help solve optimization problems, such as finding maximum profit or minimum cost.
In physics, if the position of an object is $s(t)$, then the velocity is the derivative $s'(t)$, and acceleration is the derivative of velocity, written as $s''(t)$. The first principles idea explains why velocity is the rate of change of position at a specific moment.
In real life, this matters a lot. A phone’s location data may change over time, a company’s sales may grow at different rates, and the spread of a disease may speed up or slow down. Calculus gives a precise way to study all these changes.
Conclusion
First principles differentiation is the starting point for understanding derivatives in calculus. It defines the derivative using the limit
$$f'(x)=\lim_{h\to 0}\frac{f(x+h)-f(x)}{h}$$
and shows how an average rate of change becomes an instantaneous rate of change. It connects algebra, limits, gradients, tangent lines, and real-world change.
For IB Mathematics: Analysis and Approaches HL, this topic is essential because it builds the logic behind all later differentiation techniques. If you understand first principles, you understand why derivatives exist and what they measure. That makes the rest of calculus much easier to learn and much more meaningful ✅
Study Notes
- First principles differentiation is the limit definition of the derivative.
- The derivative measures instantaneous rate of change and gradient.
- The formula is $f'(x)=\lim_{h\to 0}\frac{f(x+h)-f(x)}{h}$.
- Start by substituting $x+h$ into the function, then subtract $f(x)$.
- Simplify the numerator algebraically before taking the limit.
- Never substitute $h=0$ directly into the fraction.
- For $f(x)=x^2$, the derivative from first principles is $f'(x)=2x$.
- For $f(x)=5x-7$, the derivative from first principles is $f'(x)=5$.
- A secant line gives an average gradient; a tangent line gives an instantaneous gradient.
- First principles forms the foundation for later differentiation rules and applications in kinematics, optimisation, and modelling.
