A Comprehensive Guide To Projection Matrices In Computer Graphics

Abstract¶

We derive perspective projection and orthographic projection matrices from first principles using projective geometry. After establishing the topological manifold structure of real projective space, we find matrix representations for linear, affine, and projective transformations in real projective space. We then formulate the projection specifications in a coordinate-independent manner, leading to general formulas for perspective and orthographic projection matrices. Finally, we apply these results to succinctly calculate the projection matrices for OpenGL, Vulkan, Metal, and DirectX. This process illustrates why real projective space is a convenient setting for solving computer graphics problems.

Keywords:computer-graphicscomputer-sciencelinear-algebrarenderingcomputer-visiongeometric-modelingprojective-geometryapplied-computing¶

Introduction¶

We construct the perspective and orthographic projection matrices common to computer graphics in a very general way. The usual parametrizations are defined with respect to specific coordinate system and a specific canonical view volume, such that the view volume parameters define the view volume planes directly in terms of coordinates. This is how most computer graphics books construct the projection transformations.

Rendering is done in the manifold $\mathbb{RP}^{3}$ in homogeneous coordinates for multiple reasons:

Affine transformations become linear transformations one dimension higher, so translations can be treated like any other linear transformations.
Transformations in $\mathbb{RP}^{3}$ are well-defined under changes in scale, so we can handle projective transformations in a unified fashion with affine transformations.
Coordinate systems and scales become equivalent, allowing spatial simulation problems to be expressed directly in a coordinate independent way.
We can change coordinate systems to whatever coordinate system makes the problem at hand convenient to work with.
For very practical reasons, we can express the problem in a coordinate system that affords the best numerical precision possible on the hardware.

A platform such as OpenGL, Vulkan, DirectX, Metal, or WebGPU typically defines its normalized device coordinate system as one that is a numerically favorable coordinate system. Any spatial computing domain whose problems are formulated in Euclidean space can take advantage of the manifold $\mathbb{RP}^{3}$ . Such domains include computer graphics, computer vision, geometric modeling, and robotics. In the context of computer graphics, the platform coordinate system is whichever one maps the view volume in the viewing space to the canonical view volume defined by the platform interface. In particular, the canonical view volume tends to be parametrized by either $[-1, 1] \times [-1, 1] \times [-1, 1]$ or $[-1, 1] \times \ [-1, 1] \times [0, 1]$ . In either case, transforming the problem to a unit interval adds one free bit of extra precision when working with floating point numbers. This maximizes the accuracy of floating point computations on the GPU, including tasks such as intersection testing, depth testing, texture sampling, and clipping algorithms.

We parametrize the view space view volume in a slightly different way than the usual one to make the perspective view volume specification coordinate independent. This allows us to construct the matrices for any specific view space coordinate system, perspective view volume, orthographic view volume, normalized device coordinate system, and canonical view volume. We define a canonical set of transformations where the view space is a left-handed orthonormal frame where the horizontal axis points right, the vertical axis points up, and the depth axis points into the view volume, and a clip coordinate system with a left-handed orthonormal frame where the horizontal axis points right, the vertical axis points up, and the depth axis points into the view volume. We show how to construct a general perspective projection or orthographic projection from any source view coordinate system to any target clip coordinate system.

The Topological Manifold Structure Of Real Projective Space¶

This section establishes the topological structure and manifold structure of the real projective space $\mathbb{RP}^{3}$ . It can be skipped if the reader so desires.

We define real projective space $\mathbb{RP}^{3}$ as follows. Define $\mathbf{w_{1}} \sim \mathbf{w_{2}}$ if an only if there exists a nonzero real number $\lambda \in \mathbb{R} - \{0\}$ such that $\mathbf{w_{1}} = \lambda \mathbf{w_{2}}$ . The relation $\sim$ is an equivalence relation. To prove this, we need to show that $\sim$ is reflexive, symmetric, and transitive. For reflexivity, trivially $\mathbf{w} = \mathbf{w}$ so we can take $\lambda = 1$ which shows that $\mathbf{w} \sim \mathbf{w}$ . To show symmetry, suppose that $\mathbf{w}_{1} \sim \mathbf{w}_{2}$ . Then there exists $\lambda \in \mathbb{R} - \{ 0 \}$ such that $\mathbf{w}_{2} = \lambda \mathbf{w}_{1}$ implying that $\mathbf{w}_{1} = \frac{1}{\lambda} \mathbf{w}_{2}$ hence $\mathbf{w}_{2} \sim \mathbf{w}_{1}$ . Now it remains to prove that $\sim$ is transitive. Suppose that $\mathbf{w}_{1} \sim \mathbf{w}_{2}$ and $\mathbf{w}_{2} \sim \mathbf{w}_{3}$ . Then there exist real numbers $\mathbf{\lambda}, \mathbf{\mu} \in \mathbb{R} - \{ 0 \}$ such that $\mathbf{w}_{2} = \lambda \mathbf{w}_{1}$ and $\mathbf{w}_{3} = \mu \mathbf{w}_{2}$ . This implies that

\mathbf{w}_{3} = \mu \mathbf{w}_{2} = \mu \left( \lambda \mathbf{w}_{1} \right) = \left( \mu \lambda \right) \mathbf{w}_{1} = \mu \lambda \mathbf{w}_{1}

(1)

implying that $\mathbf{w}_{1} \sim \mathbf{w}_{3}$ . This proves transitivity. Therefore, $\sim$ is and equivalence relation.

We define the real projective space by $\mathbb{RP}^{3} = ( \mathbb{R}^{4} - \{ \mathbf{0}\} )/\sim$ . The real projective space identifies lines through the origin in $\mathbb{R}^{3}$ with points in $\mathbb{RP}^{3}$ .

Define a map $\pi : \mathbb{R}^{4} - \{\mathbf{0}\} \rightarrow \mathbb{RP}^{3}$ by

\pi\left( P \right) = \begin{bmatrix} P \end{bmatrix}

(2)

where $[.]$ on the right-hand side indicates the equivalence class of $\begin{pmatrix} P^{T}, w \end{pmatrix}^{T}$ . The function $\pi$ is surjective. To show this, suppose that $[P]$ is a homogeneous point in $\mathbb{RP}^{3}$ . Since $[P]$ is an equivalence class, it is nonempty, so there is at least one element in $[P]$ , namely $P$ itself. Since the map $\pi$ maps elements to its equivalence class, we obtain $\pi(P) = [P]$ . Since the equivalence class $[P]$ was chosen arbitrarily, the function $\pi$ is surjective.

Now that we have established that $\pi$ is surjective, we can use $\pi$ to define a topology on $\mathbb{RP}^{3}$ . We say that a set $U \subset \mathbb{RP}^{3}$ is open if and only if the inverse image $\pi^{-1}(U)$ is open in $\mathbb{R}^{4}$ . In particular, we define the topology of $\mathbb{RP}^{3}$ to be the quotient topology induced by $\pi$ . The real projective space $\mathbb{RP}^{3}$ in conjunction with the quotient toplogy induced by $\pi$ is a topological space. The quotient topology automatically makes the surjection $\pi$ a continuous function.

Now we want to create a topological manifold out of $\mathbb{RP}^{3}$ . We need to define an atlas, then show that the charts in the atlas are homeomorphic to open subsets of $\mathbb{R}^{3} - \{ \mathbf{0} \}$ . Then we need to show that the resulting atlas makes $\mathbb{RP}^{3}$ locally Euclidean. That is, every point in $\mathbb{RP}^{3}$ has an open neighborhood homeomorphic to $\mathbb{R}^{3} - \{ \mathbf{0} \}$ . After that, we show that $\mathbb{RP}^{3}$ is Hausdorff and second countable. All of this together shows that $\mathbb{RP}^{3}$ is a topological 3-manifold.

For every $i \in \{ 0, 1, 2, 3 \}$ , define the set $U_{i}$ by

U_{i} = \{ \mathbf{x} \in \mathbb{R}^{4} - \{ \mathbf{0} \} \mid x_{i} \neq 0 \}

(3)

The set $U_{i}$ is an open set in $\mathbb{R}^{4} - \{ \mathbf{0} \}$ . To prove this, we will show that $U_{i}$ can be covered by open balls of elements in $U_{i}$ . Suppose that $\mathbf{p} \in U_{i}$ . Let $h = |\mathbf{p} \cdot \mathbf{\hat{e}}_{i}| = |p_{i}|$ be the distance of the point $\mathbf{p}$ from the hyperplane defined by $x_{i} = 0$ . Let $d = \Vert \mathbf{p} \Vert = \Vert \mathbf{p} - \mathbf{0} \Vert$ be the Euclidean norm of the vector $\mathbf{p}$ . Let $r = \min \{ d, h \}$ and consider the open ball $B_{r}(\mathbf{p})$ in $\mathbb{R}^{4}$ . Choose $\mathbf{q} \in B_{r}(\mathbf{p})$ . By the triangle inequality

\Vert \mathbf{q} \Vert \geq \Vert \mathbf{p} \Vert - \Vert \mathbf{q} - \mathbf{p} \Vert > d - r \geq d - \frac{d}{2} = \frac{d}{2} > 0.

(4)

Hence $\mathbf{q} \in \mathbb{R}^{4} - \{ \mathbf{0} \}$ . Since $\mathbf{q}$ was chosen arbitrarily, this implies that $B_{r}(\mathbf{p}) \subset \mathbb{R}^{4} - \{ \mathbf{0} \}$ . Applying the triangle inequality again,

|q_{i}| \geq |p_{i}| - |q_{i} - p_{i}| > h - r \geq h - \frac{h}{2} = \frac{h}{2} > 0.

(5)

Thus $\mathbf{q}$ does not lie on the hyperplane defined by $x_{i} = 0$ . Therefore $\mathbf{q} \in U_{i}$ implying that $B_{r}(\mathbf{p}) \subset U_{i}$ . Since the vector $\mathbf{p} \in U_{i}$ was arbitrary, every point in $U_{i}$ lies in an open neighborhood that contains only elements of $U_{i}$ , we see that

\bigcup_{\mathbf{p} \in U_{i}} B_{r(\mathbf{p})}(\mathbf{p}) = U_{i}.

(6)

The open balls constructed above cover $U_{i}$ in open sets. Since $\mathbb{R}^{4}$ is a topological space, and $U_{i}$ is the union of open sets in the topology of $\mathbb{R}^{4}$ , then $U_{i}$ is an open set in $\mathbb{R}^{4}$ . Moreover, since $\mathbb{R}^{4} - \{ \mathbf{0} \}$ is an open set in $\mathbb{R}^{4}$ , the set $U_{i}$ is an open set of $\mathbb{R}^{4} - \{ \mathbf{0} \}$ in the subspace topology that $\mathbb{R}^{4} - \{ \mathbf{0} \}$ inherits from $\mathbb{R}^{4}$ .

Define the set $V_{i} \subset \mathbb{RP}^{3}$ by $V_{i} = \pi\left(U_{i}\right)$ . The set $V_{i}$ is open in $\mathbb{RP}^{3}$ . To show this, observe the following equalities

\begin{align*} \pi^{-1}\left( V_{i} \right) &= \pi^{-1}\left( \pi\left( U_{i} \right) \right) \\ &= \{ \mathbf{p} \in \mathbb{R}^{4} - \{ \mathbf{0} \} \mid \pi\left( \mathbf{p} \right) \in V_{i} \} \\ &= \{ \mathbf{p} \in \mathbb{R}^{4} - \{ \mathbf{0} \} \mid \exists \lambda \in \mathbb{R} - \{ 0 \} \hspace{4 pt} \text{such that} \hspace{4 pt} \mathbf{p} \in \lambda U_{i} \} \\ &= U_{i} \end{align*}

(7)

where the last equality follows because the preimage of any element in $V_{i}$ contains all nonzero multiples of an element of $U_{i}$ , which are still an elements of $U_{i}$ . Therefore, the set $\pi^{-1}(V_{i})$ is an open set in $\mathbb{R}^{4} - \{ \mathbf{0} \}$ , which makes $V_{i}$ an open set in $\mathbb{RP}^{3}$ under the quotient topology.

For each $i \in \{ 0, 1, 2, 3 \}$ define the map $\pi_{i} : U_{i} \rightarrow V_{i}$ by $\pi_{i} = \pi|_{U_{i}}$ . Since the map $\pi_{i}$ is the restriction of a continuous map, it is also a continuous map. It is surjective by definition, since the codomain is the image $\pi(U_{i})$ . To show that $\pi_{i}$ is a quotient map, we must prove that a subset $W \subset V_{i}$ is open in $V_{i}$ if and only if $\pi^{-1}_{i}(W)$ is open in $U_{i}$ . Suppose that $W$ is open in $V_{i}$ . Then the inverse image of $W$ under $\pi$ is open in $\mathbb{R}^{4} - \{ \mathbf{0} \}$ . Since $U_{i}$ is open in $\mathbb{R}^{4} - \{ \mathbf{0} \}$ we see that $\pi^{-1}_{i}(W) = \pi^{-1}(W) \cap U_{i}$ . Since $W$ is open in $V_{i}$ , there exists an open set $X \subset \mathbb{RP}^{3}$ such that $W = X \cap V_{i}$ . Since $\pi$ is continuous, $\pi^{-1}(X)$ is open in $\mathbb{R}^{4} - \{ \mathbf{0} \}$ , which implies that $\pi^{-1}(X) \cap U_{i}$ is open in $U_{i}$ . But

\pi^{-1}\left( X \right) \cap U_{i} = \pi^{-1}\left( X \cap V_{i} \right) \cap U_{i} = \pi^{-1}\left( W \right) \cap U_{i} = \pi^{-1}_{i}\left( W \right)

(8)

which is an open set in $U_{i}$ . Thus $\pi^{-1}_{i}(W)$ is an open set in $U_{i}$ . This proves that if $W \subset V_{i}$ is open, then $\pi^{-1}(W) \subset U_{i}$ is open. Conversely, suppose that $\pi^{-1}_{i}(W)$ is open in $U_{i}$ . Since $U_{i}$ is open in $\mathbb{R}^{4} - \{ \mathbf{0} \}$ , there exists an open set $X$ of $\mathbb{R}^{4} - \{ \mathbf{0} \}$ such that $\pi^{-1}_{i}(W) = X \cap U_{i}$ . Define the set $X^{\prime} = X \cup ((\mathbb{R}^{4} - \{ \mathbf{0} \}) - U_{i})$ . We show that the set $X^{\prime}$ is open in $\mathbb{R}^{4} - \{ \mathbf{0} \}$ . Suppose that $\mathbf{q} \in (\mathbb{R}^{4} - \{ \mathbf{0} \}) - U_{i}$ . We can find a radius $r > 0$ such that $B_{r}(\mathbf{q})$ does not intersect $U_{i}$ such that $B_{r}(\mathbf{q}) \cap U_{i} = \emptyset$ . Indeed, we can choose $r$ to be less than the distance to the nearest point where the $i^{th}$ coordinate is zero, i.e. $r < |q_{i}|$ . Choose $r = |q_{i}| / 2$ for concreteness. With the chosen $r$ , we have $B_{r}(\mathbf{q}) \subset \mathbb{R}^{4} - \{ \mathbf{0} \} - U_{i}$ for each $\mathbf{q} \in \mathbb{R}^{4} - \{ \mathbf{0} \} - U_{i}$ . This shows that $\mathbb{R}^{4} - \{ \mathbf{0} \} - U_{i}$ is open in $\mathbb{R}^{4} - \{ \mathbf{0} \}$ . Since $X^{\prime}$ is the union of two open sets, it is also open. The set $X^{\prime}$ is open and saturated with respect to $\pi$ , so that image $\pi(X^{\prime})$ is open in $\mathbb{RP}^{3}$ because $\pi$ is a quotient map. We have

W = \pi_{i}(\pi^{-1}_{i}\left( W \right)) = \pi_{i}\left( X \cap U_{i} \right) = \pi\left( X^{\prime} \right) \cap V_{i}.

(9)

and it follows that $\pi^{-1}_{i}(W)$ is open in $\mathbb{R}^{4} - \{ \mathbf{0} \}$ . This demonstrates that $\pi_{i}$ is a quotient map.

We show that each set $V_{i}$ is homeomorphic to $\mathbb{R}^{4}$ . For each $i \in \{ 0, 1, 2, 3 \}$ , define the map $H_{i} : V_{i} \rightarrow \mathbb{RP}^{3}$ by

H_{i} \left( \begin{bmatrix} \begin{pmatrix} P_{0} \\ P_{1} \\ \vdots \\ P_{i - 1} \\ P_{i} \\ P_{i + 1} \\ \vdots \\ P_{3} \\ \end{pmatrix} \end{bmatrix} \right) = \begin{bmatrix} \begin{pmatrix} P_{0} / P_{i} \\ P_{1} / P_{i} \\ \vdots \\ P_{i-1} / P_{i} \\ 1 \\ P_{i+1} / {P_{i}} \\ \vdots \\ P_{3} / P_{i} \\ \end{pmatrix} \end{bmatrix}.

(10)

That is, the $i^{th}$ coordinate is 1, and the rest are divided by $P_{i}$ . This map sends a homogeneous point to the homogeneous point with the $i^{th}$ coordinate normalized. Let $W_{i}$ be the set of points in $\mathbb{RP}^{3}$ whose $i^{th}$ coordinate is 1. Define the map $\text{proj}_{i} : \mathbb{RP}^{3} \supset W_{i} \rightarrow \mathbb{R}^3$ by

\text{proj}_{i} \left( \begin{bmatrix} \begin{pmatrix} P_{0} \\ P_{1} \\ \vdots \\ P_{i-1} \\ 1 \\ P_{i+1} \\ \vdots \\ P_{3} \\ \end{pmatrix} \end{bmatrix} \right) = \begin{pmatrix} P_{0} \\ P_{1} \\ \vdots \\ P_{i-1} \\ P_{i+1} \\ \vdots \\ P_{3} \\ \end{pmatrix}

(11)

which maps the homogeneous point to a Euclidean point with the $i^{th}$ component removed. The maps $\text{proj}_{i}$ and $H_{i}$ together allow us to define our coordinate maps. Define the map $\psi_{i} : V_{i} \rightarrow \mathbb{R}^{3}$ by $\psi_{i} = \text{proj}_{i} \circ H_{i}$ , written out as

\psi_{i} \left( \begin{bmatrix} \begin{pmatrix} P_{0} \\ P_{1} \\ \vdots \\ P_{i - 1} \\ P_{i} \\ P_{i + 1} \\ \vdots \\ P_{3} \\ \end{pmatrix} \end{bmatrix} \right) = \begin{pmatrix} P_{0} / P_{i} \\ P_{1} / P_{i} \\ \vdots \\ P_{i-1} / P_{i} \\ P_{i+1} / {P_{i}} \\ \vdots \\ P_{3} / P_{i} \\ \end{pmatrix}.

(12)

Let $\varphi_{i} : U_{i} \rightarrow \mathbb{R}^{3}$ be given by $\varphi_{i} = \psi_{i} \circ \pi$ , written out as

\varphi_{i} \left( \begin{pmatrix} P_{0} \\ P_{1} \\ \vdots \\ P_{i - 1} \\ P_{i} \\ P_{i + 1} \\ \vdots \\ P_{3} \\ \end{pmatrix} \right) = \begin{pmatrix} P_{0} / P_{i} \\ P_{1} / P_{i} \\ \vdots \\ P_{i-1} / P_{i} \\ P_{i+1} / {P_{i}} \\ \vdots \\ P_{3} / P_{i} \\ \end{pmatrix}.

(13)

The map $\varphi_{i}$ is continuous by the universal property of product spaces applied to $\mathbb{R}^{4}$ . Since the map $\varphi_{i}$ is continuous, the universal property of quotient maps implies that the map $\psi_{i}$ is continuous. We must show that $\psi_{i}$ is bijective. Let’s prove surjectivity: let $\begin{pmatrix} P_{0} \ P_{1} \ \dots \ P_{i-1} \ P_{i+1} \ \dots & P_{3} \end{pmatrix}^{T} \in \mathbb{R}^{3}$ , let $\begin{bmatrix} \begin{pmatrix} P_{0} \ P_{1} \ \dots \ P_{i-1} \ 1 \ P_{i+1} \dots \ P_{3} \end{pmatrix} \end{bmatrix}^{T} \in V_{i}$ , From the definition of $\psi_{i}$ , we see that

\psi_{i} \left( \begin{bmatrix} \begin{pmatrix} P_{0} \ P_{1} \ \dots \ P_{i - 1} \ 1 \ P_{i + 1} \ \dots \ P_{3} \ \end{pmatrix} \end{bmatrix}^{T} \right) = \begin{pmatrix} P_{0} \ P_{1} \ \dots \ P_{i-1} \ P_{i+1} \ \dots \ P_{3} \ \end{pmatrix}^{T}.

(14)

which establishes surjectivity. To show injectivity, we require the following fact: every element in $V_{i}$ has a unique representative whose $i^{th}$ coordinate is 1. To see this, suppose that $[Q]$ is a homogeneous point in $V_{i}$ with representatives $\begin{pmatrix} Q_{0} \ Q_{1} \ \dots \ Q_{i-1} \ 1 \ Q_{i+1} \dots \ Q_{3} \end{pmatrix}^{T} \in \mathbb{R}^{3}$ and $\begin{pmatrix} Q^{\prime}_{0} \ Q^{\prime}_{1} \ \dots \ Q^{\prime}_{i-1} \ 1 \ Q^{\prime}_{i+1} \dots \ Q^{\prime}_{3} \end{pmatrix}^{T} \in \mathbb{R}^{3}$ where the $i^{th}$ component is 1. Since both representatives represent the same point, they must be equivalent, i.e. there exists $\lambda \in \mathbb{R} - \{0\}$ such that

\begin{pmatrix} Q_{0} \ Q_{1} \ \dots \ Q_{i-1} \ 1 \ Q_{i+1} \dots \ Q_{3} \end{pmatrix}^{T} = \lambda \begin{pmatrix} Q^{\prime}_{0} \ Q^{\prime}_{1} \ \dots \ Q^{\prime}_{i-1} \ 1 \ Q^{\prime}_{i+1} \dots \ Q^{\prime}_{3} \end{pmatrix}^{T}.

(15)

The fact that the $i^{th}$ component is 1 in both representatives implies that $Q^{\prime}_{j} = Q_{j}$ for each $j \in \{ 0, 1, 2, 3 \}$ . Therefore

\begin{pmatrix} Q_{0} \ Q_{1} \ \dots \ Q_{i-1} \ 1 \ Q_{i+1} \dots \ Q_{3} \end{pmatrix}^{T} = \begin{pmatrix} Q^{\prime}_{0} \ Q^{\prime}_{1} \ \dots \ Q^{\prime}_{i-1} \ 1 \ Q^{\prime}_{i+1} \dots \ Q^{\prime}_{3} \end{pmatrix}^{T}.

(16)

establishing uniqueness of the representative of $[Q]$ whose $i^{th}$ component is 1. Proving the injectivity of $\psi_{i}$ becomes easy. Recall the original definition of $\psi_{i}$ as $\psi_{i} = \text{proj}_{i} \circ H_{i}$ . Since a given homogeneous point has a unique representative whose $i^{th}$ component is 1, the normalization map $H_{i}$ is injective. That is, given homogeneous points $[Q]$ and $[Q^{\prime}]$ with $H_{i}([Q]) = H_{i}([Q^{\prime}])$ , we conclude $[Q] = [Q^{\prime}]$ . Also, notice that $\text{proj}_{i}$ is bijective with inverse map $\text{proj}^{-1}_{i} : \mathbb{R}^{3} \rightarrow \mathbb{RP}^{3}$ defined by

\text{proj}^{-1}_{i} \left( \begin{pmatrix} P_{0} \ P_{1} \ \dots \ P_{i-1} \ P_{i+1} \ \dots \ P_{3} \ \end{pmatrix}^{T} \right) = \begin{bmatrix} \begin{pmatrix} P_{0} \ P_{1} \ \dots \ P_{i-1} \ 1 \ P_{i+1} \ \dots \ P_{3} \ \end{pmatrix}^{T} \end{bmatrix}.

(17)

Since $\text{proj}_{i}$ is bijective, it is injective, and since $H_{i}$ is injective, so is their composite $\psi_{i}$ . This proves that $\psi_{i}$ is bijective.

Next, we show that $\psi^{-1}_{i}$ is continuous. Consider the map $\rho_{i} : \mathbb{R}^{3} \rightarrow \mathbb{R}^{4} - \{ \mathbf{0} \}$ given by

\rho_{i} \left( \begin{pmatrix} P_{0} \ P_{1} \ \dots \ P_{i-1} \ P_{i+1} \ \dots \ P_{3} \ \end{pmatrix}^{T} \right) = \begin{pmatrix} P_{0} \ P_{1} \ \dots \ P_{i-1} \ 1 \ P_{i+1} \ \dots \ P_{3} \ \end{pmatrix}^{T}.

(18)

The map $\rho_{i}$ is continuous, its image is contained in $V_{i}$ , and $\pi_{i} \circ \rho_{i} = \psi^{-1}_{i}$ . This implies that $\psi^{-1}_{i}$ is continuous, since it is the composite of continuous functions. We have shown that $\psi_{i}$ is continuous, bijective, and has a continuous inverse. Therefore, it is a homeomorphism. Moreover, we can cover $\mathbb{RP}^{3}$ with the charts $\{ (V_{i}, \psi_{i}) \mid i \in \{ 0, 1, 2, 3 \} \}$ , which means that $\mathbb{RP}^{3}$ is locally Euclidean.

We show that $\mathbb{RP}^{3}$ is Hausdorff. Suppose that $[P]$ and $[Q]$ are two distinct points in $\mathbb{RP}^{3}$ . There are two possibilities: there is an $0 \leq i \leq 2$ such that both points lie in $V_{i}$ , or no such $i$ exists such that both points lie in $V_{i}$ . In the first case, $\psi_{i}([P])$ and $\psi_{i}([Q])$ are distinct points in $\mathbb{R}^{3}$ . Since $\mathbb{R}^{3}$ is Hausdorff, there exist disjoint sets $A$ and $B$ such that $\psi_{i}([P]) \in A$ and $\psi_{i}([Q]) \in B$ . Hence, $\psi^{-1}_{i}(A)$ and $\psi^{-1}_{i}$ are disjoint open subsets of $V_{i}$ , and hence $\psi^{-1}_{i}(A)$ and $\psi^{-1}_{i}$ are open subsets of $\mathbb{RP}^{3}$ . This yields the first case. Consider the second case. Suppose that no such $i$ exists such that $\psi_{i}([P])$ and $\psi_{i}([Q])$ are distinct points in $V_{i}$ . Let $(x_{0}, x_{1}, x_{2}, x_{3})$ be a representative of $[P]$ and let $(y_{0}, y_{1}, y_{2}, y_{3})$ be a representative of $[Q]$ . There exists $i \neq j$ or $0 \leq i, j \leq 3$ such that $x_{i} \neq 0, y_{i} = 0$ and $x_{j} = 0, y_{j} \neq 0$ . Choose a representative such that $x_{i} = 1$ and $y_{j} = 1$ . Assume without loss of generality that $i < j$ , and choose $0 < \epsilon < 1$ . The set

A = \{ \begin{pmatrix} a_{0} \ a_{1} \ \dots a_{i-1} \ 1 \ a_{i + 1} \dots a_{3} \end{pmatrix}^{T} \mid \forall k \neq i \hspace{4 pt} \lvert a_{k} - x_{k} \rvert < \epsilon \} \subset V_{i}

(19)

is an open set containing $[P]$ and the set

B = \{ \begin{pmatrix} b_{0} \ b_{1} \ \dots b_{j-1} \ 1 \ b_{j + 1} \dots b_{3} \end{pmatrix}^{T} \mid \forall k \neq i \hspace{4 pt} \lvert b_{k} - y_{k} \rvert < \epsilon \} \subset V_{j}

(20)

is an open set containing $[Q]$ . The image $\psi_{i}(A)$ is an open rectangle in $\mathbb{R}^{3}$ centered on $\psi_{i}([P])$ with a side length of $2 \epsilon$ . Similarly, the image $\psi_{i}(B)$ is an open rectangle in $\mathbb{R}^{3}$ centered on $\psi_{i}([Q])$ with a side length of $2 \epsilon$ . The sets $A$ and $B$ are disjoint. To show this, suppose $A$ and $B$ are not disjoint. Then for $\begin{pmatrix} a_{0} \ a_{1} \ \dots a_{i-1} \ 1 \ a_{i + 1} \dots a_{3} \end{pmatrix}^{T} = \begin{pmatrix} b_{0} \ b_{1} \ \dots b_{j-1} \ 1 \ b_{j + 1} \dots b_{3} \end{pmatrix}^{T}$ we must have $a_{j} \neq 0$ and $b_{i} \neq 0$ which implies that $a_{j} b_{i} = 1$ . But this is impossible, because $\lvert a_{j} \rvert < 1$ and $\lvert b_{i} \rvert < 1$ . Therefore, the sets $A$ and $B$ must be disjoint. This proves that $\mathbb{RP}^{3}$ is Hausdorff.

Finally, second countability follows from the fact that $\mathbb{R}^{4} - \{ \mathbf{0} \}$ is second countable. More precisely, let $\mathcal{B}$ be a countable base for the topology on $\mathbb{R}^{4} - \{ \mathbf{0} \}$ . The set

\mathcal{B}^{\prime} = \{ \pi\left( U \right) \mid U \in \mathcal{B} \}

(21)

is a countable set of open sets in $\mathbb{RP}^{3}$ because $\pi$ is saturated. Since $\pi$ is surjective, $\cup_{B \in \mathcal{B}^{\prime}} B = \mathbb{RP}^{3}$ , i.e. the set $\mathcal{B}^{\prime}$ covers $\mathbb{RP}^{3}$ . Hence $\mathbb{RP}^{3}$ is second countable. Note that second countability implies that $\mathbb{RP}^{3}$ is compact. We sketch a proof here. It comes from the fact that the 3-sphere $S^{3}$ is a compact subset of $\mathbb{R}^{4}$ , that the canonical surjection $\pi_{S^{3}}$ is continuous, and that the image of a compact set by a continuous map is compact. Since $\pi_{S^{3}}(S^{3})$ is compact, and $\pi_{S^{3}}(S^{3}) = \mathbb{RP}^{3}$ we immediately infer that $\mathbb{RP}^{3}$ is compact.

We have topologized real projective space, and proven that $\mathbb{RP}^{3}$ is second countable, Hausdorff and locally Euclidean. Therefore $\mathbb{RP}^{3}$ is a topological 3-manifold (or we just say that it is a 3-manifold, or a manifold). Finally, $\mathbb{RP}^{3}$ is compact, so it is a nice setting to work with topologically.

Representing Transformations In Real Projective Space¶

Any orthonormal frame $(\tilde{O}_{frame}, \left( \mathbf{\hat{u}}_{h}, \mathbf{\hat{u}}_{v}, \mathbf{\hat{u}}_{d} \right))$ on $\mathbb{E}^{3}$ induces a coordinate chart as follows. A point $\tilde{P} \in \mathbb{E}^{3}$ is written as

\tilde{P} = \tilde{O}_{frame} + P_{h} \mathbf{\hat{u}}_{h} + P_{v} \mathbf{\hat{u}}_{v} + P_{d} \mathbf{\hat{u}}_{d}

(22)

and the orthonormal frame $(\tilde{O}_{frame}, \left( \mathbf{\hat{u}}_{h}, \mathbf{\hat{u}}_{v}, \mathbf{\hat{u}}_{d} \right))$ defines a coordinate chart $\varphi_{frame} : \mathbb{E}^{3} \rightarrow \mathbb{R}^{3}$ by $\varphi_{frame}( \tilde{P} ) = \tilde{P} - \tilde{O}$ . Written out

P = \varphi_{frame}(\tilde{P}) \equiv \tilde{P} - \tilde{O}_{frame} = P_{h} \mathbf{\hat{u}}_{h} + P_{v} \mathbf{\hat{u}}_{v} + P_{d} \mathbf{\hat{u}}_{d}

(23)

is the representation of $\tilde{P}$ in $\mathbb{R}^{3}$ . Also, the coordinate chart $\varphi_{frame}$ maps the origin $\tilde{O}_{frame}$ of the view coordinate system in $\mathbb{E}^{3}$ to $\mathbf{0}$ : $O_{frame}$ = $\varphi_{frame}(\tilde{O}_{frame}) = \tilde{O}_{frame} - \tilde{O}_{frame} = \mathbf{0}$ . This shows that the view space frame origin in $\mathbb{E}^{3}$ indeed maps to the vector space origin $\mathbf{0}$ in $\mathbb{R}^{3}$ .

Recall that the definition of real projective space defines the manifold structure using the surjection $\pi : \mathbb{R}^{4} - \{\mathbf{0}\} \rightarrow \mathbb{RP}^{3}$ given by

\pi\left( \begin{pmatrix} P \\ w \\ \end{pmatrix} \right) = \begin{bmatrix} \begin{pmatrix} P \\ w \\ \end{pmatrix} \end{bmatrix}

(24)

where $[.]$ on the right-hand side indicates the equivalence class of $\begin{pmatrix} P^{T}, w \end{pmatrix}^{T}$ .

Define a map $\rho : \mathbb{R}^{3} \rightarrow \mathbb{R}^{4} - \{\mathbf{0}\}$ by

\rho\left( P \right) = \begin{pmatrix} P \\ 1 \\ \end{pmatrix}

(25)

The maps $\pi$ and $\rho$ together allow us to map from view space to projective view space with the camera orthonormal frame by

\left(\pi \circ \rho\right)\left(P\right) = \pi\left( \rho\left( P \right) \right) = \pi\left( \begin{pmatrix} P \\ 1 \\ \end{pmatrix} \right) = \begin{bmatrix} \begin{pmatrix} P \\ 1 \\ \end{pmatrix} \end{bmatrix}.

(26)

The map $\rho$ defines an embedding (injection) of $\mathbb{R}^{3}$ into $\mathbb{R}^{4} - \{ \mathbf{0} \}$ , because each element in $\rho(\mathbb{R}^{3})$ has a unique elements $P \in \mathbb{R}^{3}$ such that $\rho(P) = \begin{pmatrix} P^{T} \ 1 \end{pmatrix}^{T}$ . Since $\rho$ is surjective on its image, injective, and continuous with continuous inverse, it is a homeomorphism on its image, hence an embedding. Since every element of $\mathbb{R}^{3}$ has a unique homogeneous point $Q$ such that $Q = [\begin{pmatrix} P^{T} \ 1 \end{pmatrix}^{T}]$ , the map $\pi \circ \rho$ is also an embedding. The set $\pi(\rho(\mathbb{R}^{3}))$ is sometimes called the affine patch or Euclidean patch of $\mathbb{RP}^{3}$ .

We can lift points in $\mathbb{E}^{3}$ into $\mathbb{RP}^{3}$ . Let’s do the same for transformations. Suppose that $A : \mathbb{R}^{3} \rightarrow \mathbb{R}^{3}$ is an affine map. We define the lifted map of $A$ over to $\mathbb{RP}^{3}$ to be the map $\hat{A} : \mathbb{RP}^{3} \rightarrow \mathbb{RP}^{3}$ such that

\left( \rho \circ A \right)\left( P \right) = \left( \hat{A} \circ \rho \right) \begin{pmatrix} P \end{pmatrix}.

(27)

Recall that an affine map has the form $A(P) = L(P) + \mathbf{t}$ where $L : \mathbb{R}^{3} \rightarrow \mathbb{R}^{3}$ is a linear map, and $\mathbf{t}$ is a vector, i.e. a linear map plus a translation term. Expanding this out, the left-hand side of (27) becomes

\left( \rho \circ A \right)\left( P \right) = \begin{pmatrix} A\left( P \right) \\ 1 \\ \end{pmatrix} = \begin{pmatrix} L\left( P \right) + \mathbf{t} \\ 1 \\ \end{pmatrix} = \begin{bmatrix} L & \mathbf{t} \\ \mathbf{0}^{T} & 1 \\ \end{bmatrix} \begin{pmatrix} P \\ 1 \\ \end{pmatrix}

(28)

and on the right-hand side we have

\left( \hat{A} \circ \rho \right) \left( P \right) = \hat{A} \left(\rho \left( P \right) \right) = \hat{A} \left( \begin{pmatrix} P \\ 1 \\ \end{pmatrix} \right).

(29)

Equating both sides of (27) we have

\hat{A} \left( \begin{pmatrix} P \\ 1 \\ \end{pmatrix} \right) = \begin{bmatrix} L & \mathbf{t} \\ \mathbf{0}^{T} & 1 \\ \end{bmatrix} \begin{pmatrix} P \\ 1 \\ \end{pmatrix}

(30)

\hat{A} = \begin{bmatrix} L & \mathbf{t} \\ \mathbf{0}^{T} & 1 \\ \end{bmatrix}

(31)

so that $\hat{A}$ is unique. An affine map in $\mathbb{R}^{3}$ becomes a linear map in $\mathbb{RP}^{3}$ . In the case of a linear map $L$ , $\mathbf{t} = \mathbf{0}$ and

\hat{L} = \begin{bmatrix} L & \mathbf{0} \\ \mathbf{0}^{T} & 1 \\ \end{bmatrix}.

(32)

Now suppose that $T : \mathbb{R}^{3} \rightarrow \mathbb{R}^{3}$ is a projective transformation. A projective transformation is a map of the form

T \left( P \right) = \left( \frac{1}{g(P)} \right) A \left( P \right) = \left( \frac{1}{g(P)} \right) \left( L(P) + \mathbf{t} \right)

(33)

where $A$ is an affine map and $g : \mathbb{R}^{3} \rightarrow \mathbb{R}$ is an affine scalar function. This means that $g$ has the form $g(P) = \mathbf{c} \cdot P + h$ . We define the lifted map of $T$ over to $\mathbb{RP}^{3}$ to be the map $\hat{T} : \mathbb{RP}^{3} \rightarrow \mathbb{RP}^{3}$ such that

g\left( P \right) \left( \rho \circ T \right) \left( P \right) = \left( \hat{T} \circ \rho \right) \left( P \right)

(34)

Expanding out the right-hand side of (34)

\left( \hat{T} \circ \rho \right) \left( P \right) = \hat{T} \left( \rho \left( P \right) \right) = \hat{T} \begin{pmatrix} P \\ 1 \end{pmatrix}.

(35)

More interestingly, expanding out the left-hand side of (34) gives

\begin{align*} g\left( P \right) \left( \rho \circ T \right) \left( P \right) &= g\left( P \right) \begin{pmatrix} T(P) \\ 1 \end{pmatrix} \\ &= \begin{pmatrix} g\left( P \right) T(P) \\ g\left( P \right) \end{pmatrix} \\ &= \begin{pmatrix} A(P) \\ g\left( P \right) \end{pmatrix} \\ &= \begin{pmatrix} L(P) + \mathbf{t} \\ g\left( P \right) \end{pmatrix} \\ &= \begin{pmatrix} L(P) + \mathbf{t} \\ \mathbf{c} \cdot P + h \end{pmatrix} \\ &= \begin{bmatrix} L & \mathbf{t} \\ \mathbf{c}^{T} & h \\ \end{bmatrix} \begin{pmatrix} P \\ 1 \\ \end{pmatrix} \end{align*}

(36)

where the third equality follows from the definition of the projective transformation. Combining (35) and (36) back into (34) yields

\hat{T} \begin{pmatrix} P \\ 1 \end{pmatrix} = \begin{bmatrix} L & \mathbf{t} \\ \mathbf{c}^{T} & h \\ \end{bmatrix} \begin{pmatrix} P \\ 1 \\ \end{pmatrix}

(37)

and therefore

\hat{T} = \begin{bmatrix} L & \mathbf{t} \\ \mathbf{c}^{T} & h \\ \end{bmatrix}.

(38)

This is a unique representation of the projective transformation $T$ lifted over to $\mathbb{RP}^{3}$ . Here is a wonderful discovery: in $\mathbb{RP}^{3}$ where all coordinate scales are treated as equivalent, we can work with linear, affine, and projective transformations in a unified setting by going one dimension higher in our representation from $T : \mathbb{R}^{3} \rightarrow \mathbb{R}^{3}$ to its lifted counterpart $\hat{T} : \mathbb{RP}^{3} \rightarrow \mathbb{RP}^{3}$ .

The punchline is that we can lift the Euclidean space $\mathbb{E}^{3}$ into real projective space, in such a way that each point in $\mathbb{E}^{3}$ has a corresponding unique point in the affine subspace of $\mathbb{RP}^{3}$ under any coordinate chart. This lift allows us to construct our projections between different coordinate systems in real projective space in a coherent and well-defined manner. Via the embedding $\pi \circ \rho$ , the manifold structure of $\mathbb{RP}^{3}$ allows us to scale our coordinates in any way we like, because in projective coordinates we don’t care about scaling. We can also lift transformations from $\mathbb{E}^{3}$ to $\mathbb{RP}^{3}$ as a consequence of the embedding $\pi \circ \rho$ too. This setting makes it convenient to construct perspective and orthographic projections, because at the end, we are back in the affine portion of $\mathbb{RP}^{3}$ after normalization, so we can map back out of $\mathbb{RP}^{3}$ again by choosing a chart on $\mathbb{RP}^{3}$ and applying it.

We have shown how to map Euclidean space to real projective space, and how to represent linear, affine, and projective transformations as linear transformations in linear projective space. It remains to apply these developments to our primary goal: constructing orthographic and perspective projection matrices in homogeneous coordinates.

Matrix Representations Of Projective Transformations In

\mathbb{RP}^{3}

A linear transformation $L : \mathbb{R}^{3} \rightarrow \mathbb{R}^{3}$ lifted over to $\mathbb{RP}^{3}$ is a linear map $\hat{L} : \mathbb{RP}^{3} \rightarrow \mathbb{RP}^{3}$ with the matrix representation

\hat{L} = \begin{bmatrix} L & \mathbf{0} \\ \mathbf{0}^{T} & 1 \\ \end{bmatrix}.

(39)

An affine transformation $A : \mathbb{R}^{3} \rightarrow \mathbb{R}^{3}$ lifted over to $\mathbb{RP}^{3}$ is a linear map $\hat{A} : \mathbb{RP}^{3} \rightarrow \mathbb{RP}^{3}$ with the matrix representation

\hat{A} = \begin{bmatrix} L & \mathbf{t} \\ \mathbf{0}^{T} & 1 \\ \end{bmatrix}.

(40)

A projective transformation $T : \mathbb{R}^{3} \rightarrow \mathbb{R}^{3}$ lifted over to $\mathbb{RP}^{3}$ is a linear map $\hat{T} : \mathbb{RP}^{3} \rightarrow \mathbb{RP}^{3}$ with the matrix representation

\hat{T} = \begin{bmatrix} L & \mathbf{t} \\ \mathbf{c}^{T} & h \\ \end{bmatrix}.

(41)

Specifying Projection Matrices¶

To define a projection matrix, we need two coordinate systems: the view coordinate system, and the normalized device coordinate system. The view coordinate system is where the view volume is defined. The normalized device coordinate system is where the canonical view volume is defined. The task of the projection transformation is to map the view volume to the canonical view volume.

To understand what the projection matrices do, we must understand the coordinate systems that we map between at each step in the pipeline leading from the view space to the canonical view volume. The transformations stem from choosing a convenient coordinate system in which to render computer graphics. The convenient coordinate system we choose is often called normalized device coordinates, defined in the following. The canonical coordinate system differs from platform to platform. The other coordinates systems exist to articulate a clear path from projective view coordinates to normalized device coordinates. First we must define the view space and the view coordinates.

The view coordinate system for Euclidean space $\mathbb{E}^{3}$ is given by the orthonormal frame $(\tilde{O}_{view}, \mathcal{B}_{view})$ where (1) the origin of the orthonormal frame is the point $\tilde{O}_{view} \in \mathbb{E}^{3}$ ; (2) the basis of the orthonormal frame is $\mathcal{B}_{view} = (\mathbf{\hat{u}}_{h}, \mathbf{\hat{u}}_{v}, \mathbf{\hat{u}}_{d})$ where the basis vector $\mathbf{\hat{u}}_{h}$ points to the right, the basis vector $\mathbf{\hat{u}}_{v}$ points up, and the basis vector $\mathbf{\hat{u}}_{d}$ points into the view volume or out of the view volume, depending on the choice of orientation; (3) the view volume in this coordinate system is called the view volume (or in the case of perspective projection, the perspective view volume).

The projected coordinate system for Euclidean space $\mathbb{E}^{3}$ is given by the orthonormal frame $(\tilde{O}_{proj}, \mathcal{B}_{proj})$ where (1) the origin of the orthonormal frame is the point $\tilde{O}_{proj} \in \mathbb{E}^{3}$ ; (2) the basis of the orthonormal frame is $\mathcal{B}_{proj} = (\mathbf{\hat{u}}_{proj,h}, \mathbf{\hat{u}}_{proj,v}, \mathbf{\hat{u}}_{proj,d})$ where the basis vector $\mathbf{\hat{u}}_{proj,h}$ points to the right, the basis vector $\mathbf{\hat{u}}_{proj,v}$ points up, and the basis vector $\mathbf{\hat{u}}_{proj,d}$ points into the view volume or out of the view volume, depending on the choice of orientation; (3) the view volume in this coordinate system is called the projected view volume (or in the case of perspective projection, the perspective projected view volume).

The clip coordinate system for Euclidean space $\mathbb{E}^{3}$ is given by the orthonormal frame $(\tilde{O}_{clip}, \mathcal{B}_{clip})$ defined on $\mathbb{E}^{3}$ where (1) the origin of the orthonormal frame $(\tilde{O}_{clip}, \mathcal{B}_{clip})$ is the point $\tilde{O}_{clip} \in \mathbb{E}^{3}$ ; (2) the basis of the orthonormal frame is $\mathcal{B}_{clip} = (\mathbf{\hat{u}}_{clip,h}, \mathbf{\hat{u}}_{clip,v}, \mathbf{\hat{u}}_{clip,d})$ where the basis vector $\mathbf{\hat{u}}_{clip,h}$ points to the right, the basis vector $\mathbf{\hat{u}}_{clip,v}$ points up, and the basis vector $\mathbf{\hat{u}}_{clip,d}$ points into the view volume, or out of the view volume depending on the choice or orientation; (3) the view volume in this coordinate system is called the orthographic view volume.

The normalized device coordinate system for Euclidean space $\mathbb{E}^{3}$ is given by the orthonormal frame $(\tilde{O}_{ndc}, \mathcal{B}_{ndc})$ defined on $\mathbb{E}^{3}$ where (1) The origin of the orthonormal frame $(\tilde{O}_{ndc}, \mathcal{B}_{ndc})$ is the point $\tilde{O}_{ndc} \in \mathbb{E}^{3}$ ; (2) The basis of the orthonormal frame is $\mathcal{B}_{ndc} = (\mathbf{\hat{u}}_{ndc,h}, \mathbf{\hat{u}}_{ndc,v}, \mathbf{\hat{u}}_{ndc,d})$ where the basis vector $\mathbf{\hat{u}}_{ndc,h}$ points to the right, the basis vector $\mathbf{\hat{u}}_{ndc,v}$ points up, and the basis vector $\mathbf{\hat{u}}_{ndc,d}$ points into the view volume, or out of the view volume depending on the choice or orientation; (3) The view volume in this coordinate system is called the canonical view volume.

The view coordinate system, projected coordinate system, clip coordinate system, and normalized device coordinate system induce coordinate charts via $\varphi_{view}(\tilde{P}) = \tilde{P} - \tilde{O}_{view}$ , $\varphi_{proj}(\tilde{P}) = \tilde{P} - \tilde{O}_{proj}$ , $\varphi_{clip}(\tilde{P}) = \tilde{P} - \tilde{O}_{clip}$ , and $\varphi_{ndc}(\tilde{P}) = \tilde{P} - \tilde{O}_{ndc}$ , respectively.

The projection transformations are parametrized by two set of parameters. The first one is the view volume parameters $l$ , $r$ , $b$ , $t$ , $n$ , $f$ where $l > 0$ , $r > 0$ , $b > 0$ , $t > 0$ , and $f > n > 0$ such that the view volume is parametrized by $[-l, r] \times [-b, t] \times [n, f]$ . The second one is the canonical view volume parametrized by $[\alpha_{min}, \alpha_{max}] \times [\beta_{min}, \beta_{max}] \times [\gamma_{min}, \gamma_{max}]$ . where $\alpha_{max} > \alpha_{min}$ , $\beta_{max} > \beta_{min}$ , and $\gamma_{max} > \gamma_{min}$ .

Projection Matrix Specification

A projection matrix is a projective transformation $T : \mathbb{E}^{3} \rightarrow \mathbb{E}^{3}$ lifted over to $\mathbb{RP}^{3}$ as $\hat{T} : \mathbb{RP}^{3} \rightarrow \mathbb{RP}^{3}$ specified by the following data:

The view coordinate system, defined by an orthonormal frame $\mathcal{F}_{view} = (\tilde{O}_{view}, \mathcal{B}_{view})$ .
The clip coordinate system, defined by an orthonormal frame $\mathcal{F}_{clip} = (\tilde{O}_{clip}, \mathcal{B}_{clip})$ .
The normalized device coordinate system, defined by an orthonormal frame $\mathcal{F}_{ndc} = (\tilde{O}_{ndc}, \mathcal{B}_{ndc})$ .
The view volume, parametrized by $[-l, r] \times [-b, r] \times [n, f]$ where $l > 0$ , $r > 0$ , $b > 0$ , $t > 0$ , and $f > n > 0$ .
The canonical view volume, parametrized by $[\alpha_{min}, \alpha_{max}] \times [\beta_{min}, \beta_{max}] \times [\gamma_{min}, \gamma_{max}]$ where $\alpha_{max} > \alpha_{min}$ , $\beta_{max} > \beta_{min}$ , and $\gamma_{max} > \gamma_{min}$ .

The projection matrix maps the view volume to the canonical view volume.

The Canonical Projection Matrices¶

In this section, we construct the canonical projection matrices. We call them canonical because each of the projections has the same output coordinate system as the input coordinate system. So it is particularly convenient to work with.

The Canonical Coordinate System¶

We choose a canonical set of coordinate systems to construct the canonical projective transformations that will be used to derive the projective transformations in specific settings.

The canonical view coordinates is the view space coordinate system with orthonormal frame $(\tilde{O}_{view}, \mathcal{B}_{view})$ where $\mathcal{B}_{view} = (\mathbf{\hat{u}}_{h}, \mathbf{\hat{u}}_{v}, \mathbf{\hat{u}}_{d})$ such that the depth frame vector $\mathbf{\hat{u}}_{d}$ points into the view volume. This gives the canonical view space a left-handed orientation. The canonical projected coordinates coordinate system is the projected coordinate system with the same orthonormal frame as the canonical view coordinates. The canonical clip coordinates coordinate system is the clip coordinate system with the same orthonormal frame as the canonical view coordinate system. The canonical normalized device coordinates coordinate system is the normalized device coordinate system with the same orthonormal frame as in canonical view coordinates. In particular, each coordinate system uses the same orthonormal frame, and has a left-handed orientation.

It is not strictly required that each space have the same orthonormal frame, but using the same coordinate system in each step makes it easier to see what is going on in the transformation pipeline. Since we derive each transformation between the coordinate systems above, we can always transform any one of them with a combination of orthogonal transformations and changes of orientation to get the desired ones. It is also much less error-prone with fewer pesky signs to deal with.

The Canonical Perspective Projection Matrix¶

With these considerations, we construct the canonical perspective projection matrix for the frame $(\tilde{O}_{view}, (\mathbf{\hat{u}}_{h}, \mathbf{\hat{u}}_{v}, \mathbf{\hat{u}}_{d}))$ with the perspective view volume parametrized by $[-l, r] \times [-b, t] \times [n, f]$ and the canonical view volume parametrized by $[\alpha_{min}, \alpha_{max}] \times [\beta_{min}, \beta_{max}] \times [\gamma_{min}, \gamma_{max}]$ . Let $P \in \mathbb{R}^{3}$ be a point given by $P = P_{h} \mathbf{\hat{u}}_{h} + P_{v} \mathbf{\hat{u}}_{v} + P_{d} \mathbf{\hat{u}}_{d}$ . We derive the perspective projected horizontal and vertical components.

Using similar triangles for the horizontal component, we have

\frac{P_{h}}{P_{d}} = \frac{P \cdot \mathbf{\hat{u}}_{h}}{P \cdot \mathbf{\hat{u}}_{d}} = \frac{P^{\prime}_{proj} \cdot \mathbf{\hat{u}}_{h}}{n} = \frac{P^{\prime}_{proj,h}}{n}

(42)

which yields

P^{\prime}_{proj,h} = n P_{h} \left( \frac{1}{P_{d}} \right).

(43)

Analagously for the vertical component, applying similar triangles gives us

\frac{P_{v}}{P_{d}} = \frac{P \cdot \mathbf{\hat{u}}_{v}}{P \cdot \mathbf{\hat{u}}_{d}} = \frac{P^{\prime}_{proj} \cdot \mathbf{\hat{u}}_{v}}{n} = \frac{P^{\prime}_{proj,v}}{n}

(44)

implying that

P^{\prime}_{proj,v} = n P_{v} \left( \frac{1}{P_{d}} \right).

(45)

Observe that the horizontal and vertical components both transform as projective transformations. Indeed, the perspective projection transformation is a projective transformation where the affine function is given by $P_{d}$ . In other words, the perspective projection is dividing by depth. Using our reasoning about representing projective transformations that lead to the matrix representation (38), we can deduce the lifted mapping by rearranging the perspective equations

P_{d} P^{\prime}_{proj,h} = n P_{h} \\ P_{d} P^{\prime}_{proj,v} = n P_{v} \\

(46)

suggestively. The right hand side in both cases are affine transformations. This suggests to form of the transformation for the depth component. We define our projected coordinate system components as follows. Let $P_{proj,h} = P_{d} P^{\prime}_{proj,h}$ and $P_{proj,v} = P_{d} P^{\prime}_{proj,v}$ . Since the affine scalar function is perspective division, we immediately see that $P_{proj,w} = P_{d}$ . This leaves the depth component. The depth component does not directly participate in depth normalization, and since our coordinate system is orthogonal, so $P_{proj,d}$ cannot depend on $P_{h}$ or $P_{v}$ . This implies that $P_{proj,d}$ must be a function of $P_{d}$ and $P_{w}$ . Since the transformation is projective, the depth component must transform affinely. Since $P_{w} = 1$ . This leaves

P_{proj,d} = \theta \left( P_{d} \right) = A^{\prime} P_{d} + B^{\prime}.

(47)

Taken together, the projected coordinates components of the perspective transformation are

\begin{align*} P_{proj,h} &= n P_{h} \\ P_{proj,v} &= n P_{v} \\ P_{proj,d} &= \theta \left ( P_{d} \right) = A^{\prime} P_{d} + B^{\prime} \\ P_{proj,w} &= P_{d} \\ \end{align*}

(48)

where we solve for the constants $A^{\prime}$ and $B^{\prime}$ later.

After projection, we must map the projected coordinates to clip coordinates. To obtain the right clip coordinates, we need to consider normalized device coordinates. This is necessary since the mapping from projected coordinates to clip coordinates depends on the choice of parametrization of the canonical view volume. The resulting transformation is an orthographic transformation that maps the orthographic view volume $[-l, r] \times [-b, t] \times [n, f]$ to the canonical view volume $[\alpha_{min}, \alpha_{max}] \times [\beta_{min}, \beta_{max}] \times [\gamma_{min}, \gamma_{max}]$ . The map from projected coordinates to clip coordinates is a linear map in $\mathbb{RP}^{3}$ , so the coordinates must transform affinely.

We need a set of maps $\phi_{h}, \phi_{v}, \phi_{d}, \phi_{w} : \mathbb{R}^{3} \rightarrow \mathbb{R}$ such that

\begin{align*} P_{ndc,h} &= \frac{P_{clip,h}}{P_{clip,w}} &&= \left( H_{w} \circ \phi_{h} \right) \left( P_{proj} \right) &&= \phi_{h} \left( P_{proj} \right) \left( \frac{1}{\phi_{w} \left( P_{proj} \right) } \right) &&\equiv \xi_{h} \left( P \right) \\ P_{ndc,v} &= \frac{P_{clip,v}}{P_{clip,w}} &&= \left( H_{w} \circ \phi_{v} \right) \left( P_{proj} \right) &&= \phi_{v} \left( P_{proj} \right) \left( \frac{1}{\phi_{w} \left( P_{proj} \right) } \right) &&\equiv \xi_{v} \left( P \right) \\ P_{ndc,d} &= \frac{P_{clip,d}}{P_{clip,w}} &&= \left( H_{w} \circ \phi_{d} \right) \left( P_{proj} \right) &&= \phi_{d} \left( P_{proj} \right) \left( \frac{1}{\phi_{w} \left( P_{proj} \right) } \right) &&\equiv \xi_{d} \left( P \right) \\ P_{ndc,w} &= \frac{P_{clip,w}}{P_{clip,w}} &&= \left( H_{w} \circ \phi_{w} \right) \left( P_{proj} \right) &&= \phi_{w} \left( P_{proj} \right) \left( \frac{1}{\phi_{w} \left( P_{proj} \right)} \right) &&= 1 \\ \end{align*}

(49)

where $P_{proj}$ denotes a point $P \in \mathbb{R}^{3}$ in projected coordinates, $P_{clip,w} = \phi_{w}(P_{proj})$ is an affine scalar function, and $H_{w} : \mathbb{R}^{4} \rightarrow \mathbb{R}^{4}$ denotes homogenization by the $w$ component. Observe that the functions $\xi_{h}, \xi_{v}, \xi_{d} : \mathbb{R}^{3} \rightarrow \mathbb{R}$ on the right-hand side of (49) are projective functions of the view space point $P$ . This implies that the clip coordinates must be affine functions of $P_{proj}$ . In particular,

\begin{align*} P_{clip,h} &= \phi_{h} \left( P_{proj} \right) \\ P_{clip,v} &= \phi_{v} \left( P_{proj} \right) \\ P_{clip,d} &= \phi_{d} \left( P_{proj} \right) \\ P_{clip,w} &= \phi_{w} \left( P_{proj} \right) \\ \end{align*}

(50)

is the form of the clip components. Since the purpose of the $w$ coordinate is to perform perspective division, this implies that

\phi_{w} \left( P_{proj} \right) = P_{d}

(51)

so that the equations in (49) become

\begin{align*} P_{ndc,h} &= \xi_{h} \left( P \right) = \phi_{h} \left( P_{proj} \right) \left( \frac{1}{ P_{d} } \right) \\ P_{ndc,v} &= \xi_{v} \left( P \right) = \phi_{v} \left( P_{proj} \right) \left( \frac{1}{ P_{d} } \right) \\ P_{ndc,d} &= \xi_{d} \left( P \right) = \phi_{d} \left( P_{proj} \right) \left( \frac{1}{ P_{d} } \right) \\ \end{align*}

(52)

To derive the perspective projection matrix, we solve for the clip coordinate functions $\phi_{h}, \phi_{v}, \phi_{d}$ indirectly using the auxiliary functions $\xi_{h}, \xi_{v}, \xi_{d}$ in (52), and use the constraints on the orthographic view volume to compute the auxiliary functions. To establish constraints, we need to talk about some well chosen points. We need to construct the maps $\xi_{h}, \xi_{v}, \xi_{d}$ such that the parametrization of the orthographic view volume maps to the parametrization of the canonical view volume. That it, such that coordinates map as $-l \mapsto \alpha_{min}, r \mapsto \alpha_{max}, -b \mapsto \beta_{min}, t \mapsto \beta_{max}, n \mapsto \gamma_{min}, f \mapsto \gamma_{max}$ . Consider the points in view coordinates

\begin{align*} Q_{left} &= -l \mathbf{\hat{u}}_{h} + n \mathbf{\hat{u}}_{d} \\ Q_{right} &= r \mathbf{\hat{u}}_{h} + n \mathbf{\hat{u}}_{d} \\ Q_{bottom} &= -b \mathbf{\hat{u}}_{v} + n \mathbf{\hat{u}}_{d} \\ Q_{top} &= t \mathbf{\hat{u}}_{v} + n \mathbf{\hat{u}}_{d} \\ Q_{near} &= n \mathbf{\hat{u}}_{d} \\ Q_{far} &= f \mathbf{\hat{u}}_{d} \\ \end{align*}

(53)

The points $Q_{near}$ and $Q_{far}$ are the points along the viewing axis that intersect the near plane and the far plane, respectively, of the perspective view volume. The point $Q_{left}$ represents the point of intersection of the left plane, near plane, and the horizontal-vertical plane of the view volume. The point $Q_{right}$ represents the point of intersection of the right plane, near plane, and the horizontal-vertical plane of the view volume. The point $Q_{bottom}$ represents the point of intersection of the bottom plane, near plane, and depth-vertical plane of the view volume. The point $Q_{top}$ represents the point of intersection of the top plane, near plane, and the depth-vertical plane of the view volume. In short, the points $Q_{near}$ and $Q_{far}$ are the origins of the near and far planes, respectively. The other four points and points chosen along the edge of the viewport in the near plane that allow us to easily set up the boundary conditions to compute the functions $\xi_{h}, \xi_{v}, \xi_{d}$ .

We now calculate the auxiliary functions. Consider the map $\xi_{h}$ , where

\begin{align*} \xi_{h} \left( P \right) &= \left( A P_{proj,h} + B P_{proj,v} + C^{\prime} P_{proj,d} + D^{\prime} \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A P_{proj,h} + B P_{proj,v} + C^{\prime} \theta \left( P_{d} \right) + D^{\prime} \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A P_{proj,h} + B P_{proj,v} + C^{\prime} \left( A^{\prime} P_{d} + B^{\prime} \right) + D^{\prime} \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A P_{proj,h} + B P_{proj,v} + C^{\prime} A^{\prime} P_{d} + C^{\prime} B^{\prime} + D^{\prime} \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A P_{proj,h} + B P_{proj,v} + \left( C^{\prime} A^{\prime} \right) P_{d} + \left( C^{\prime} B^{\prime} + D^{\prime} \right) \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A P_{proj,h} + B P_{proj,v} + C P_{d} + D \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A n P_{h} + B n P_{v} + C P_{d} + D \right) \left( \frac{1}{P_{d}} \right) \\ &= A n P_{h} \left( \frac{1}{P_{d}} \right) + B n P_{v} \left( \frac{1}{P_{d}} \right) + C + D \left( \frac{1}{P_{d}} \right) \\ \end{align*}

(54)

where $C = C^{\prime} A^{\prime}$ and $D = C^{\prime} B^{\prime} + D^{\prime}$ , so that

\xi_{h} \left( P \right) = A n P_{h} \left( \frac{1}{P_{d}} \right) + B n P_{v} \left( \frac{1}{P_{d}} \right) + C + D \left( \frac{1}{P_{d}} \right).

(55)

Define the boundary conditions for our chosen points

\begin{align*} \xi_{h} \left( Q_{left} \right) &= \alpha_{min} \\ \xi_{h} \left( Q_{right} \right) &= \alpha_{max} \\ \xi_{h} \left( Q_{bottom} \right) &= 0 \\ \xi_{h} \left( Q_{top} \right) &= 0 \\ \xi_{h} \left( Q_{near} \right) &= 0 \\ \xi_{h} \left( Q_{far} \right) &= 0 \\ \end{align*}

(56)

which we need to justify. The view coordinates are orthogonal to each other, and the normalized device coordinates are also orthogonal to each other. This means that $\xi_{h}$ should only be a function of the horizontal component and not the vertical component. The points $Q_{bottom}, Q_{top}, Q_{near}, Q_{far}$ lie on the depth-vertical plane, which have a zero horizontal component, so they should keep a zero horizontal component after transformation.

Applying the boundary conditions, we have

\begin{align*} \xi_{h} \left( Q_{left} \right) &= A n \cdot \left( -l \right) \cdot\left( \frac{1}{n} \right) + B n \cdot 0 \cdot \left( \frac{1}{n} \right) + C + D \left( \frac{1}{n} \right) \\ &= -A l + C + D \left( \frac{1}{n} \right) \\ &= \alpha_{min} \\ \xi_{h} \left( Q_{right} \right) &= A n \cdot r \cdot \left( \frac{1}{n} \right) + B n \cdot 0 \cdot \left( \frac{1}{n} \right) + C + D \left( \frac{1}{n} \right) \\ &= A r + C + D \left( \frac{1}{n} \right) \\ &= \alpha_{max} \\ \xi_{h} \left( Q_{bottom} \right) &= A n \cdot 0 \cdot \left( \frac{1}{n} \right) + B n \cdot \left( -b \right) \cdot \left( \frac{1}{n} \right) + C + D \left( \frac{1}{n} \right) \\ &= -B b + C + D \left( \frac{1}{n} \right) \\ &= 0 \\ \xi_{h} \left( Q_{top} \right) &= A n \cdot 0 \cdot \left( \frac{1}{n} \right) + B n \cdot t \cdot \left( \frac{1}{n} \right) + C + D \left( \frac{1}{n} \right) \\ &= B t + C + D \left( \frac{1}{n} \right) \\ &= 0 \\ \xi_{h} \left( Q_{near} \right) &= A n \cdot 0 \cdot \left( \frac{1}{n} \right) + B n \cdot 0 \cdot \left( \frac{1}{n} \right) + C + D \left( \frac{1}{n} \right) \\ &= C + D \left( \frac{1}{n} \right) \\ &= 0 \\ \xi_{h} \left( Q_{far} \right) &= A n \cdot 0 \cdot \left( \frac{1}{f} \right) + B n \cdot 0 \cdot \left( \frac{1}{f} \right) + C + D \left( \frac{1}{f} \right) \\ &= C + D \left( \frac{1}{f} \right) \\ &= 0 \\ \end{align*}

(57)

so that

\begin{align*} \xi_{h} \left( Q_{left} \right) &= -A l + C + D \left( \frac{1}{n} \right) &&= \alpha_{min} \\ \xi_{h} \left( Q_{right} \right) &= A r + C + D \left( \frac{1}{n} \right) &&= \alpha_{max} \\ \xi_{h} \left( Q_{bottom} \right) &= -B b + C + D \left( \frac{1}{n} \right) &&= 0 \\ \xi_{h} \left( Q_{top} \right) &= B t + C + D \left( \frac{1}{n} \right) &&= 0 \\ \xi_{h} \left( Q_{near} \right) &= C + D \left( \frac{1}{n} \right) &&= 0 \\ \xi_{h} \left( Q_{far} \right) &= C + D \left( \frac{1}{f} \right) &&= 0 \\ \end{align*}

(58)

and now we compute the constants. Subtracting $\xi_{h} \left( Q_{right} \right)$ from $\xi_{h} \left( Q_{left} \right)$ in (58) yields

\begin{align*} \xi_{h} \left( Q_{right} \right) - \xi_{h} \left( Q_{left} \right) &= \left[ A r + C + D \left( \frac{1}{n} \right) \right] - \left[ -A l + C + D \left( \frac{1}{n} \right) \right] \\ &= A r - \left( - A l \right) \\ &= A \left( r - \left( -l \right) \right) \\ &= \alpha_{max} - \alpha_{min}. \end{align*}

(59)

Solving for $A$ , we see that

A = \frac{\alpha_{max} - \alpha_{min}}{r - \left( -l \right)}.

(60)

Subtracting $\xi_{h} \left( Q_{top} \right)$ from $\xi_{h} \left( Q_{bottom} \right)$ in (58) yields

\begin{align*} \xi_{h} \left( Q_{top} \right) - \xi_{h} \left( Q_{bottom} \right) &= \left[ B t + C + D \left( \frac{1}{n} \right) \right] - \left[ -B b + C + D \left( \frac{1}{n} \right) \right] \\ &= B t - B \left( -b \right) \\ &= B \left( t - \left( -b \right) \right) \\ &= 0. \end{align*}

(61)

Solving for $B$ , we see that

B = 0

(62)

since $t - (-b) = t + b \neq 0$ . Subtracting $\xi_{h} \left( Q_{far} \right)$ from $\xi_{h} \left( Q_{near} \right)$ in (58) yields

\begin{align*} \xi_{h} \left( Q_{far} \right) - \xi_{h} \left( Q_{near} \right) &= \left[ C + D \left( \frac{1}{f} \right) \right] - \left[ C + D \left( \frac{1}{n} \right) \right] \\ &= D \left( \frac{1}{f} - \frac{1}{n} \right) \\ &= 0. \end{align*}

(63)

Solving for $D$ , we see that

D = 0

(64)

since $\frac{1}{f} - \frac{1}{n} \neq 0$ . Substituting the constants (60), (62), and (64) back into $\xi_{h}(Q_{right})$ in (58) gives us

\xi_{h} \left( Q_{right} \right) = A r + C + D \left( \frac{1}{n} \right) = A r + C = \left( \frac{\alpha_{max} - \alpha_{min}}{r - \left( -l \right)} \right) r + C = \alpha_{max}.

(65)

Solving for $C$ , we see that

\begin{align*} C &= \alpha_{max} - \left( \frac{\alpha_{max} - \alpha_{min}}{r - \left( -l \right)} \right) r \\ &= \alpha_{max} - \left( \frac{\alpha_{max} - \alpha_{min}}{r + l} \right) r \\ &= \frac{\alpha_{max} \left( r + l \right) - \left( \alpha_{max} - \alpha_{min} \right) r}{r + l} \\ &= \frac{\alpha_{max} r + \alpha_{max} l - \alpha_{max} r + \alpha_{min} r}{r + l} \\ &= \frac{\alpha_{max} r + \alpha_{max} l - \alpha_{max} r + \alpha_{min} r}{r + l} \\ &= \frac{\alpha_{max} l + \alpha_{min} r}{r + l} \\ &= \frac{\alpha_{min} r - \alpha_{max} \left( -l \right)}{r - \left( -l \right)} \\ \end{align*}

(66)

therefore

C = \frac{\alpha_{min} r - \alpha_{max} \left( -l \right)}{r - \left( -l \right)}.

(67)

Assembling the constants (60), (62), (67), (64) back into (55) we have the complete formula for the auxiliary function $\xi_{h}$

\xi_{h} \left( P \right) = \left( \frac{ \left( \alpha_{max} - \alpha_{min} \right) n }{ r - \left( -l \right)} \right) P_{h} \left( \frac{1}{P_{d}} \right) + \frac{\alpha_{min} r - \alpha_{max} \left( -l \right)}{r - \left( -l \right)}.

(68)

Consider the map $\xi_{v}$ , where

\begin{align*} \xi_{v} \left( P \right) &= \left( A P_{proj,h} + B P_{proj,v} + C^{\prime} P_{proj,d} + D^{\prime} \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A P_{proj,h} + B P_{proj,v} + C^{\prime} \theta \left( P_{d} \right) + D^{\prime} \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A P_{proj,h} + B P_{proj,v} + C^{\prime} \left( A^{\prime} P_{d} + B^{\prime} \right) + D^{\prime} \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A P_{proj,h} + B P_{proj,v} + C^{\prime} A^{\prime} P_{d} + C^{\prime} B^{\prime} + D^{\prime} \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A P_{proj,h} + B P_{proj,v} + \left( C^{\prime} A^{\prime} \right) P_{d} + \left( C^{\prime} B^{\prime} + D^{\prime} \right) \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A P_{proj,h} + B P_{proj,v} + C P_{d} + D \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A n P_{h} + B n P_{v} + C P_{d} + D \right) \left( \frac{1}{P_{d}} \right) \\ &= A n P_{h} \left( \frac{1}{P_{d}} \right) + B n P_{v} \left( \frac{1}{P_{d}} \right) + C + D \left( \frac{1}{P_{d}} \right) \\ \end{align*}

(69)

where $C = C^{\prime} A^{\prime}$ and $D = C^{\prime} B^{\prime} + D^{\prime}$ , so that

\xi_{v} \left( P \right) = A n P_{h} \left( \frac{1}{P_{d}} \right) + B n P_{v} \left( \frac{1}{P_{d}} \right) + C + D \left( \frac{1}{P_{d}} \right).

(70)

Define the boundary conditions for our chosen points

\begin{align*} \xi_{v} \left( Q_{left} \right) &= 0 \\ \xi_{v} \left( Q_{right} \right) &= 0 \\ \xi_{v} \left( Q_{bottom} \right) &= \beta_{min} \\ \xi_{v} \left( Q_{top} \right) &= \beta_{max} \\ \xi_{v} \left( Q_{near} \right) &= 0 \\ \xi_{v} \left( Q_{far} \right) &= 0 \\ \end{align*}

(71)

which we need to justify. The view coordinates are orthogonal to each other, and the normalized device coordinates are also orthogonal to each other. This means that $\xi_{v}$ should only be a function of the vertical component and not the horizontal component. The points $Q_{left}, Q_{right}, Q_{near}, Q_{far}$ lie on the depth-horizontal plane, which have a zero vertical component, so they should keep a zero vertical component after transformation.

Applying the boundary conditions, we have

\begin{align*} \xi_{v} \left( Q_{left} \right) &= A n \cdot \left( -l \right) \cdot\left( \frac{1}{n} \right) + B n \cdot 0 \cdot \left( \frac{1}{n} \right) + C + D \left( \frac{1}{n} \right) \\ &= -A l + C + D \left( \frac{1}{n} \right) \\ &= 0 \\ \xi_{v} \left( Q_{right} \right) &= A n \cdot r \cdot \left( \frac{1}{n} \right) + B n \cdot 0 \cdot \left( \frac{1}{n} \right) + C + D \left( \frac{1}{n} \right) \\ &= A r + C + D \left( \frac{1}{n} \right) \\ &= 0 \\ \xi_{v} \left( Q_{bottom} \right) &= A n \cdot 0 \cdot \left( \frac{1}{n} \right) + B n \cdot \left( -b \right) \cdot \left( \frac{1}{n} \right) + C + D \left( \frac{1}{n} \right) \\ &= -B b + C + D \left( \frac{1}{n} \right) \\ &= \beta_{min} \\ \xi_{v} \left( Q_{top} \right) &= A n \cdot 0 \cdot \left( \frac{1}{n} \right) + B n \cdot t \cdot \left( \frac{1}{n} \right) + C + D \left( \frac{1}{n} \right) \\ &= B t + C + D \left( \frac{1}{n} \right) \\ &= \beta_{max} \\ \xi_{v} \left( Q_{near} \right) &= A n \cdot 0 \cdot \left( \frac{1}{n} \right) + B n \cdot 0 \cdot \left( \frac{1}{n} \right) + C + D \left( \frac{1}{n} \right) \\ &= C + D \left( \frac{1}{n} \right) \\ &= 0 \\ \xi_{v} \left( Q_{far} \right) &= A n \cdot 0 \cdot \left( \frac{1}{f} \right) + B n \cdot 0 \cdot \left( \frac{1}{f} \right) + C + D \left( \frac{1}{f} \right) \\ &= C + D \left( \frac{1}{f} \right) \\ &= 0 \\ \end{align*}

(72)

so that

\begin{align*} \xi_{v} \left( Q_{left} \right) &= -A l + C + D \left( \frac{1}{n} \right) &&= 0 \\ \xi_{v} \left( Q_{right} \right) &= A r + C + D \left( \frac{1}{n} \right) &&= 0 \\ \xi_{v} \left( Q_{bottom} \right) &= -B b + C + D \left( \frac{1}{n} \right) &&= \beta_{min} \\ \xi_{v} \left( Q_{top} \right) &= B t + C + D \left( \frac{1}{n} \right) &&= \beta_{max} \\ \xi_{v} \left( Q_{near} \right) &= C + D \left( \frac{1}{n} \right) &&= 0 \\ \xi_{v} \left( Q_{far} \right) &= C + D \left( \frac{1}{f} \right) &&= 0 \\ \end{align*}

(73)

and now we compute the constants. Subtracting $\xi_{v} \left( Q_{right} \right)$ from $\xi_{v} \left( Q_{left} \right)$ in (73) yields

\begin{align*} \xi_{v} \left( Q_{right} \right) - \xi_{v} \left( Q_{left} \right) &= \left[ A r + C + D \left( \frac{1}{n} \right) \right] - \left[ -A l + C + D \left( \frac{1}{n} \right) \right] \\ &= A r - \left( - A l \right) \\ &= A \left( r - \left( -l \right) \right) \\ &= 0. \end{align*}

(74)

Solving for $A$ , we see that

A = 0

(75)

since $r - (-l) = r + l \neq 0$ . Subtracting $\xi_{v} \left( Q_{top} \right)$ from $\xi_{v} \left( Q_{bottom} \right)$ in (73) yields

\begin{align*} \xi_{v} \left( Q_{top} \right) - \xi_{v} \left( Q_{bottom} \right) &= \left[ B t + C + D \left( \frac{1}{n} \right) \right] - \left[ -B b + C + D \left( \frac{1}{n} \right) \right] \\ &= B t - B \left( -b \right) \\ &= B \left( t - \left( -b \right) \right) \\ &= \beta_{max} - \beta_{min}. \end{align*}

(76)

Solving for $B$ , we have

B = \frac{\beta_{max} - \beta_{min}}{t - \left( -b \right)}.

(77)

Subtracting $\xi_{v} \left( Q_{far} \right)$ from $\xi_{v} \left( Q_{near} \right)$ in (73) yields

\begin{align*} \xi_{v} \left( Q_{far} \right) - \xi_{v} \left( Q_{near} \right) &= \left[ C + D \left( \frac{1}{f} \right) \right] - \left[ C + D \left( \frac{1}{n} \right) \right] \\ &= D \left( \frac{1}{f} - \frac{1}{n} \right) \\ &= 0. \end{align*}

(78)

Solving for $D$ , we see that

D = 0

(79)

since $\frac{1}{f} - \frac{1}{n} \neq 0$ . Substituting the constants (75), (77), and (79) back into $\xi_{v}(Q_{right})$ in (73) gives us

\xi_{v} \left( Q_{top} \right) = B t + C + D \left( \frac{1}{n} \right) = B t + C = \left( \frac{\beta_{max} - \beta_{min}}{t - \left( -b \right)} \right) t + C = \beta_{max}.

(80)

Solving for $C$ , we see that

\begin{align*} C &= \beta_{max} - \left( \frac{\beta_{max} - \beta_{min}}{t - \left( -b \right)} \right) t \\ &= \beta_{max} - \left( \frac{\beta_{max} - \beta_{min}}{t + b} \right) t \\ &= \frac{\beta_{max} \left( t + b \right) - \left( \beta_{max} - \beta_{min} \right) t}{t + b} \\ &= \frac{\beta_{max} t + \beta_{max} b - \beta_{max} t + \beta_{min} t}{t + b} \\ &= \frac{\beta_{max} t + \beta_{max} b - \beta_{max} t + \beta_{min} t}{t + b} \\ &= \frac{\beta_{max} b + \beta_{min} t}{t + b} \\ &= \frac{\beta_{min} t - \beta_{max} \left( -b \right)}{t - \left( -b \right)} \\ \end{align*}

(81)

therefore

C = \frac{\beta_{min} t - \beta_{max} \left( -b \right)}{t - \left( -b \right)}.

(82)

Assembling the constants (75), (77), (82), (79) back into (70) we have the complete formula for the auxiliary function $\xi_{v}$

\xi_{v} \left( P \right) = \left( \frac{ \left( \beta_{max} - \beta_{min} \right) n }{ t - \left( -b \right)} \right) P_{v} \left( \frac{1}{P_{d}} \right) + \frac{\beta_{min} t - \beta_{max} \left( -b \right)}{t - \left( -b \right)}.

(83)

Consider the map $\xi_{d}$ , where

\begin{align*} \xi_{d} \left( P \right) &= \left( A P_{proj,h} + B P_{proj,v} + C^{\prime} P_{proj,d} + D^{\prime} \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A P_{proj,h} + B P_{proj,v} + C^{\prime} \theta \left( P_{d} \right) + D^{\prime} \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A P_{proj,h} + B P_{proj,v} + C^{\prime} \left( A^{\prime} P_{d} + B^{\prime} \right) + D^{\prime} \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A P_{proj,h} + B P_{proj,v} + C^{\prime} A^{\prime} P_{d} + C^{\prime} B^{\prime} + D^{\prime} \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A P_{proj,h} + B P_{proj,v} + \left( C^{\prime} A^{\prime} \right) P_{d} + \left( C^{\prime} B^{\prime} + D^{\prime} \right) \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A P_{proj,h} + B P_{proj,v} + C P_{d} + D \right) \left( \frac{1}{P_{d}} \right) \\ &= \left( A n P_{h} + B n P_{v} + C P_{d} + D \right) \left( \frac{1}{P_{d}} \right) \\ &= A n P_{h} \left( \frac{1}{P_{d}} \right) + B n P_{v} \left( \frac{1}{P_{d}} \right) + C + D \left( \frac{1}{P_{d}} \right) \\ \end{align*}

(84)

where $C = C^{\prime} A^{\prime}$ and $D = C^{\prime} B^{\prime} + D^{\prime}$ , so that

\xi_{d} \left( P \right) = A n P_{h} \left( \frac{1}{P_{d}} \right) + B n P_{v} \left( \frac{1}{P_{d}} \right) + C + D \left( \frac{1}{P_{d}} \right).

(85)

Define the boundary conditions for our chosen points

\begin{align*} \xi_{d} \left( Q_{left} \right) &= 0 \\ \xi_{d} \left( Q_{right} \right) &= 0 \\ \xi_{d} \left( Q_{bottom} \right) &= 0 \\ \xi_{d} \left( Q_{top} \right) &= 0 \\ \xi_{d} \left( Q_{near} \right) &= \gamma_{min} \\ \xi_{d} \left( Q_{far} \right) &= \gamma_{max} \\ \end{align*}

(86)

which we need to justify. The view coordinates are orthogonal to each other, and the normalized device coordinates are also orthogonal to each other. This means that $\xi_{d}$ should only be a function of the depth component and not the horizontal or vertical components. The points $Q_{left}, Q_{right}, Q_{bottom}, Q_{top}$ lie on the near plane, which have a depth component of $n$ . Since perspective projection projects depth components onto the near plane, points already on the near plane should stay on the near plane. Similarly, points on the far plane should stay on the far plane. Consequently, the depth component should have no dependence on the horizontal or vertical components, only a depth term and an affine translation term.

Applying the boundary conditions, we have

\begin{align*} \xi_{d} \left( Q_{left} \right) &= A n \cdot \left( -l \right) \cdot\left( \frac{1}{n} \right) + B n \cdot 0 \cdot \left( \frac{1}{n} \right) + C + D \left( \frac{1}{n} \right) \\ &= -A l + C + D \left( \frac{1}{n} \right) \\ &= 0 \\ \xi_{d} \left( Q_{right} \right) &= A n \cdot r \cdot \left( \frac{1}{n} \right) + B n \cdot 0 \cdot \left( \frac{1}{n} \right) + C + D \left( \frac{1}{n} \right) \\ &= A r + C + D \left( \frac{1}{n} \right) \\ &= 0 \\ \xi_{d} \left( Q_{bottom} \right) &= A n \cdot 0 \cdot \left( \frac{1}{n} \right) + B n \cdot \left( -b \right) \cdot \left( \frac{1}{n} \right) + C + D \left( \frac{1}{n} \right) \\ &= -B b + C + D \left( \frac{1}{n} \right) \\ &= 0 \\ \xi_{d} \left( Q_{top} \right) &= A n \cdot 0 \cdot \left( \frac{1}{n} \right) + B n \cdot t \cdot \left( \frac{1}{n} \right) + C + D \left( \frac{1}{n} \right) \\ &= B t + C + D \left( \frac{1}{n} \right) \\ &= 0 \\ \xi_{d} \left( Q_{near} \right) &= A n \cdot 0 \cdot \left( \frac{1}{n} \right) + B n \cdot 0 \cdot \left( \frac{1}{n} \right) + C + D \left( \frac{1}{n} \right) \\ &= C + D \left( \frac{1}{n} \right) \\ &= \gamma_{min} \\ \xi_{d} \left( Q_{far} \right) &= A n \cdot 0 \cdot \left( \frac{1}{f} \right) + B n \cdot 0 \cdot \left( \frac{1}{f} \right) + C + D \left( \frac{1}{f} \right) \\ &= C + D \left( \frac{1}{f} \right) \\ &= \gamma_{max} \\ \end{align*}

(87)

so that

\begin{align*} \xi_{d} \left( Q_{left} \right) &= -A l + C + D \left( \frac{1}{n} \right) &&= \gamma_{min} \\ \xi_{d} \left( Q_{right} \right) &= A r + C + D \left( \frac{1}{n} \right) &&= \gamma_{min} \\ \xi_{d} \left( Q_{bottom} \right) &= -B b + C + D \left( \frac{1}{n} \right) &&= \gamma_{min} \\ \xi_{d} \left( Q_{top} \right) &= B t + C + D \left( \frac{1}{n} \right) &&= \gamma_{min} \\ \xi_{d} \left( Q_{near} \right) &= C + D \left( \frac{1}{n} \right) &&= \gamma_{min} \\ \xi_{d} \left( Q_{far} \right) &= C + D \left( \frac{1}{f} \right) &&= \gamma_{max} \\ \end{align*}

(88)

and now we compute the constants. and now we compute the constants. Subtracting $\xi_{d} \left( Q_{right} \right)$ from $\xi_{d} \left( Q_{left} \right)$ in (88) yields

\begin{align*} \xi_{d} \left( Q_{right} \right) - \xi_{d} \left( Q_{left} \right) &= \left[ A r + C + D \left( \frac{1}{n} \right) \right] - \left[ -A l + C + D \left( \frac{1}{n} \right) \right] \\ &= A r - \left( - A l \right) \\ &= A \left( r - \left( -l \right) \right) \\ &= \gamma_{min} - \gamma_{min} \\ &= 0. \end{align*}

(89)

Solving for $A$ , we see that

A = 0

(90)

since $r - (-l) = r + l \neq 0$ . Subtracting $\xi_{d} \left( Q_{top} \right)$ from $\xi_{d} \left( Q_{bottom} \right)$ in (58) yields

\begin{align*} \xi_{d} \left( Q_{top} \right) - \xi_{d} \left( Q_{bottom} \right) &= \left[ B t + C + D \left( \frac{1}{n} \right) \right] - \left[ -B b + C + D \left( \frac{1}{n} \right) \right] \\ &= B t - B \left( -b \right) \\ &= B \left( t - \left( -b \right) \right) \\ &= \gamma_{min} - \gamma_{min} \\ &= 0. \end{align*}

(91)

Solving for $B$ , we see that

B = 0

(92)

since $t - (-b) = t + b \neq 0$ . Subtracting $\xi_{d} \left( Q_{far} \right)$ from $\xi_{d} \left( Q_{near} \right)$ in (88) yields

\begin{align*} \xi_{d} \left( Q_{far} \right) - \xi_{d} \left( Q_{near} \right) &= \left[ C + D \left( \frac{1}{f} \right) \right] - \left[ C + D \left( \frac{1}{n} \right) \right] \\ &= D \left( \frac{1}{f} - \frac{1}{n} \right) \\ &= D \left( \frac{n - f}{f n} \right) \\ &= D \left( -\frac{f - n}{f n} \right) \\ &= \gamma_{max} - \gamma_{min}. \end{align*}

(93)

Solving for $D$ , we see that

D = -\frac{ \left( \gamma_{max} - \gamma_{min} \right) f n }{f - n}.

(94)

Substituting the constants (90), (92), and (94) back into $\xi_{d}(Q_{far})$ in (88) gives us

\begin{align*} \xi_{d} \left( Q_{far} \right) &= C + D \left( \frac{1}{f} \right) \\ &= C + \left( -\frac{ \left( \gamma_{max} - \gamma_{min} \right) f n }{f - n} \right) \left( \frac{1}{f} \right) \\ &= C + \left( -\frac{ \left( \gamma_{max} - \gamma_{min} \right) n }{f - n} \right) \\ &= \gamma_{max}. \end{align*}

(95)

Solving for $C$

\begin{align*} C &= \gamma_{max} + \left( \frac{ \left( \gamma_{max} - \gamma_{min} \right) n }{f - n} \right) \\ &= \frac{\gamma_{max} \left( f - n \right) + \left( \gamma_{max} - \gamma_{min} \right) n}{f - n} \\ &= \frac{\gamma_{max} f - \gamma_{max} n + \gamma_{max} n - \gamma_{min} n}{f - n} \\ &= \frac{\gamma_{max} f - \gamma_{min} n}{f - n} \\ \end{align*}

(96)

we have

C = \frac{\gamma_{max} f - \gamma_{min} n}{f - n}.

(97)

Assembling the constants (90), (92), (97), (94) back into (85) we have the complete formula for the auxiliary function $\xi_{d}$

\xi_{d} \left( P \right) = \frac{\gamma_{max} f - \gamma_{min} n}{f - n} - \left( \frac{ \left( \gamma_{max} - \gamma_{min} \right) f n }{f - n} \right) \left( \frac{1}{P_{d}} \right).

(98)

Now we return to the definitions for the clip space functions. Recall from (49) that

\begin{align*} P_{ndc,h} \equiv \xi_{h} \left( P \right) &\equiv \phi_{h} \left( P_{proj} \right) \left( \frac{1}{\phi_{w} \left( P_{proj} \right) } \right) = \frac{P_{clip,h}}{P_{clip,w}} \\ P_{ndc,v} \equiv \xi_{v} \left( P \right) &\equiv \phi_{v} \left( P_{proj} \right) \left( \frac{1}{\phi_{w} \left( P_{proj} \right) } \right) = \frac{P_{clip,v}}{P_{clip,w}} \\ P_{ndc,d} \equiv \xi_{d} \left( P \right) &\equiv \phi_{d} \left( P_{proj} \right) \left( \frac{1}{\phi_{w} \left( P_{proj} \right) } \right) = \frac{P_{clip,d}}{P_{clip,w}} \\ \end{align*}

(99)

where $P_{proj}$ denotes a point $P \in \mathbb{R}^{3}$ in projected coordinates, and $P_{clip,w} = \phi_{w}(P_{proj}) = P_{d}$ is an affine scalar function. We now derive the clip coordinate functions $\phi_{h}, \phi_{v}, \phi_{d}$ given by

\begin{align*} P_{clip,h} &= \phi_{h} \left( P_{proj} \right) \\ P_{clip,v} &= \phi_{v} \left( P_{proj} \right) \\ P_{clip,d} &= \phi_{d} \left( P_{proj} \right) \\ \end{align*}

(100)

by comparing the equations in (52) with (68), (83), and (98). Consider the affine map $\phi_{h}$ . Rearranging terms in (68), we have

\begin{align*} P_{ndc,h} &= \left( \frac{\alpha_{max} - \alpha_{min}}{r - \left( -l \right)} \right) P_{proj,h} + \frac{\alpha_{min} r - \alpha_{min} \left( -l \right)}{r - \left( -l \right)} \\ &= \left( \frac{\alpha_{max} - \alpha_{min}}{r - \left( -l \right)} \right) n P_{h} \left( \frac{1}{P_{d}} \right) + \frac{\alpha_{min} r - \alpha_{min} \left( -l \right)}{r - \left( -l \right)} \\ &= \left( \frac{\alpha_{max} - \alpha_{min}}{r - \left( -l \right)} \right) n P_{h} \left( \frac{1}{P_{d}} \right) + \left( \frac{ \alpha_{min} r - \alpha_{min} \left( -l \right) }{r - \left( -l \right)} \right) P_{d} \left( \frac{1}{P_{d}} \right) \\ &= \left( \frac{ \left( \alpha_{max} - \alpha_{min} \right) n }{r - \left( -l \right)} \right) P_{h} \left( \frac{1}{P_{d}} \right) + \left( \frac{ \alpha_{min} r - \alpha_{min} \left( -l \right) }{r - \left( -l \right)} \right) P_{d} \left( \frac{1}{P_{d}} \right) \\ &= \left[ \left( \frac{ \left( \alpha_{max} - \alpha_{min} \right) n }{r - \left( -l \right)} \right) P_{h} + \left( \frac{ \alpha_{min} r - \alpha_{min} \left( -l \right) }{r - \left( -l \right)} \right) P_{d} \right] \left( \frac{1}{P_{d}} \right). \end{align*}

(101)

Comparing (101) with (52), we see that

P_{clip,h} = \phi_{h} \left( P_{proj} \right) = \left( \frac{ \left( \alpha_{max} - \alpha_{min} \right) n }{r - \left( -l \right)} \right) P_{h} + \left( \frac{ \alpha_{min} r - \alpha_{min} \left( -l \right) }{r - \left( -l \right)} \right) P_{d}.

(102)

Consider the affine map $\phi_{v}$ . Rearranging terms in (83) we have

\begin{align*} P_{ndc,v} &= \left( \frac{\beta_{max} - \beta_{min}}{t - \left( -b \right)} \right) P_{proj,v} + \frac{\beta_{min} t - \beta_{min} \left( -b \right)}{t - \left( -b \right)} \\ &= \left( \frac{\beta_{max} - \beta_{min}}{t - \left( -b \right)} \right) n P_{v} \left( \frac{1}{P_{d}} \right) + \frac{\beta_{min} t - \beta_{min} \left( -b \right)}{t - \left( -b \right)} \\ &= \left( \frac{\beta_{max} - \beta_{min}}{t - \left( -b \right)} \right) n P_{v} \left( \frac{1}{P_{d}} \right) + \left( \frac{ \beta_{min} t - \beta_{min} \left( -b \right) }{t - \left( -b \right)} \right) P_{d} \left( \frac{1}{P_{d}} \right) \\ &= \left( \frac{ \left( \beta_{max} - \beta_{min} \right) n }{t - \left( -b \right)} \right) P_{v} \left( \frac{1}{P_{d}} \right) + \left( \frac{ \beta_{min} t - \beta_{min} \left( -b \right) }{t - \left( -b \right)} \right) P_{d} \left( \frac{1}{P_{d}} \right) \\ &= \left[ \left( \frac{ \left( \beta_{max} - \beta_{min} \right) n }{t - \left( -b \right)} \right) P_{v} + \left( \frac{ \beta_{min} t - \beta_{min} \left( -b \right) }{t - \left( -b \right)} \right) P_{d} \right] \left( \frac{1}{P_{d}} \right). \end{align*}

(103)

Comparing (103) with (52) we see that

P_{clip,v} = \phi_{v} \left( P_{proj} \right) = \left( \frac{ \left( \beta_{max} - \beta_{min} \right) n }{t - \left( -b \right)} \right) P_{v} + \left( \frac{ \beta_{min} t - \beta_{min} \left( -b \right) }{t - \left( -b \right)} \right) P_{d}.

(104)

Finally, consider the affine map $\phi_{d}$ . Rearranging terms in (98) we have

\begin{align*} P_{ndc,d} &= \frac{\gamma_{max} f - \gamma_{min} n}{f - n} - \left( \frac{ \left( \gamma_{max} - \gamma_{min} \right) f n }{f - n} \right) \left( \frac{1}{P_{d}} \right) \\ &= \left[ \left( \frac{\gamma_{max} f - \gamma_{min} n}{f - n} \right) P_{d} - \frac{ \left( \gamma_{max} - \gamma_{min} \right) f n }{ f - n } \right] \left( \frac{1}{P_{d}} \right) \\ \end{align*}

(105)

Comparing (105) with (52) we see that

P_{clip,d} = \phi_{d} \left( P_{proj} \right) = \left( \frac{\gamma_{max} f - \gamma_{min} n}{f - n} \right) P_{d} - \frac{ \left( \gamma_{max} - \gamma_{min} \right) f n }{ f - n }.

(106)

Since $\phi_{h}$ , $\phi_{v}$ , and $\phi_{d}$ are the components of an affine transformation, and normalized device coordinates are a projective function of the view coordinates with $\phi_{w}(P_{proj}) = P_{d}$ as the denominator, the perspective projection transformation has the form of the matrix in (38) lifted into $\mathbb{RP}^{3}$ . The resulting matrix equation for the matrix representation is then

\begin{bmatrix} P_{clip,h} \\ P_{clip,v} \\ P_{clip,d} \\ P_{clip,w} \\ \end{bmatrix} = \begin{bmatrix} \frac{\left( \alpha_{max} - \alpha_{min} \right) n }{r - \left( -l \right)} & 0 & \frac{\alpha_{min}r - \alpha_{max} \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{\left( \beta_{max} - \beta_{min} \right) n }{t - \left( -b \right)} & \frac{\beta_{min}t - \beta_{max} \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{\gamma_{max}f - \gamma_{min}n}{f - n} & -\frac{\left( \gamma_{max} - \gamma_{min} \right) f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \begin{bmatrix} P_{h} \\ P_{v} \\ P_{d} \\ P_{w} \\ \end{bmatrix}.

(107)

This yields the first major result, the canonical perspective projection matrix $M^{C}_{per}$ .

Perspective Projection Matrix

M^{C}_{per} = \begin{bmatrix} \frac{\left( \alpha_{max} - \alpha_{min} \right) n }{r - \left( -l \right)} & 0 & \frac{\alpha_{min}r - \alpha_{max} \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{\left( \beta_{max} - \beta_{min} \right) n }{t - \left( -b \right)} & \frac{\beta_{min}t - \beta_{max} \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{\gamma_{max}f - \gamma_{min}n}{f - n} & -\frac{\left( \gamma_{max} - \gamma_{min} \right) f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}.

(108)

This completes the derivation of the perspective projection matrix.

The Canonical Orthographic Projection Matrix¶

We now construct the canonical orthographic projection matrix for the frame $(\tilde{O}_{view}, (\mathbf{\hat{u}}_{h}, \mathbf{\hat{u}}_{v}, \mathbf{\hat{u}}_{d}))$ with the orthographic view volume parametrized by $[-l, r] \times [-b, t] \times [n, f]$ and the canonical view volume parametrized by $[\alpha_{min}, \alpha_{max}] \times [\beta_{min}, \beta_{max}] \times [\gamma_{min}, \gamma_{max}]$ .

An orthographic projection works by removing the component parallel to the direction normal to the plane of projection in three dimensions. Orthographic projections preserve the lengths of lines parallel to the projection plane. It produces the component of a geometric figure that is parallel to the plane of projection. Unlike perspective projection, there is no depth distortion. In terms of our orthonormal frame, an orthographic projection removes the depth component from a vector. Let $\tilde{P} \in \mathbb{E}^{3}$ be a point. The orthographic projection is a map $T^{\prime}_{orth} : \mathbb{E}^{3} \rightarrow \mathbb{E}^{3}$ given by

T^{\prime}_{orth} \left( \tilde{P} \right) = \tilde{P} - \left( \left( \tilde{P} - \tilde{O}_{view} \right) \cdot \mathbf{\hat{n}} \right) \mathbf{\hat{n}}.

(109)

In the coordinate frame $(\tilde{O}_{view}, (\mathbf{\hat{u}}_{h}, \mathbf{\hat{u}}_{v}, \mathbf{\hat{u}}_{d}))$ let $P = \tilde{P} - \tilde{O}_{view}$ be a representation of the point $P$ given by $P = P_{h} \mathbf{\hat{u}}_{h} + P_{v} \mathbf{\hat{u}}_{v} + P_{d} \mathbf{\hat{u}}_{d}$ . The orthographic projection has the form

T^{\prime}_{orth} \left( P \right) = P - \left( P \cdot \mathbf{\hat{u}}_{d} \right) \mathbf{\hat{u}}_{d} = P - P_{d} \mathbf{\hat{u}}_{d} = P_{h} \mathbf{\hat{u}}_{h} + P_{v} \mathbf{\hat{u}}_{v}

(110)

so that $T^{\prime}_{orth}$ indeed projects out the depth component as claimed. This is the usual definition of orthographic projection. The problem is that we still need depth information for rendering, so this information must be preserved in our true projection. Since orthographic projection does not distort depth in any way, and indeed has no denominator, it is an affine transformation. Thus the orthographic projection needs to carry depth information. We need to add the depth information along the normal vector to the projection plane into the transformation to get our projected coordinates. Let $T_{orth} : \mathbb{E}^{3} \rightarrow \mathbb{E}^{3}$ be the transformation given by

T_{orth} \left( \tilde{P} \right) = T^{\prime}_{orth} \left( \tilde{P} \right) + \left( \left( \tilde{P} - \tilde{O}_{view} \right) \cdot \mathbf{\hat{n}} \right) \mathbf{\hat{n}}.

(111)

Expanding out the definition

T_{orth} \left( \tilde{P} \right) = \tilde{P} - \left( \left( \tilde{P} - \tilde{O}_{view} \right) \cdot \mathbf{\hat{n}} \right) \mathbf{\hat{n}} + \left( \left( \tilde{P} - \tilde{O}_{view} \right) \cdot \mathbf{\hat{n}} \right) \mathbf{\hat{n}} = \tilde{P} = I \left( P \right)

(112)

where $I$ denotes the identity map $I : \mathbb{E}^{3} \rightarrow \mathbb{E}^{3}$ . We see that the orthographic projection is simply the identity map when the depth information if factored back in. In components,

T_{orth} \left( P \right) = I \left( P \right) = P

(113)

so that

\begin{align*} P_{proj,h} &= P_{h} \\ P_{proj,v} &= P_{v} \\ P_{proj,d} &= P_{d} \\ P_{proj,w} &= 1 \\ \end{align*}

(114)

gives our projected coordinates. Thus we map directly to clip coordinates from view coordinates. To complete the orthographic projection transformation, we need an affine transformation that maps the orthographic view volume to the canonical view volume in normalized device coordinates. Just like the perspective projection earlier, we infer the orthographic projection matrix indirectly using constraints on how the orthographic view volume transforms into the canonical view volume. We require affine maps $\phi_{h}, \phi_{v}, \phi_{d}, \phi_{w} : \mathbb{R^{3}} \rightarrow \mathbb{R}$ such that

\begin{align*} P_{clip,h} &= \phi_{h} \left( P_{proj} \right) \\ P_{clip,v} &= \phi_{v} \left( P_{proj} \right) \\ P_{clip,d} &= \phi_{d} \left( P_{proj} \right) \\ P_{clip,w} &= \phi_{w} \left( P_{proj} \right) \\ \end{align*}

(115)

where we get $\phi_{w}$ immediately because orthographic projections are affine maps. That is

\phi_{w} \left( P \right) = 1.

(116)

where we use the fact that $P_{proj} = P$ . Since affine transformations do not perform depth normalization, we see that

\begin{align*} P_{ndc,h} &= P_{clip,h} \\ P_{ndc,v} &= P_{clip,v} \\ P_{ndc,d} &= P_{clip,d} \\ P_{ndc,w} &= P_{clip,w} \\ \end{align*}

(117)

therefore

\begin{align*} P_{ndc,h} &= \phi_{h} \left( P_{proj} \right) &= \phi_{h} \left( P \right) \\ P_{ndc,v} &= \phi_{v} \left( P_{proj} \right) &= \phi_{v} \left( P \right) \\ P_{ndc,d} &= \phi_{d} \left( P_{proj} \right) &= \phi_{d} \left( P \right) \\ \end{align*}

(118)

are the equations we need to solve for. As with the perspective projection transformation, we use the constraints on the orthographic view volume to compute the functions. To establish constraints, we need to talk about some well chosen points. We need to construct the maps $\phi_{h}, \phi_{v}, \phi_{d}$ such that the parametrization of the orthographic view volume maps to the parametrization of the canonical view volume. That it, such that coordinates map as $-l \mapsto \alpha_{min}$ , $r \mapsto \alpha_{max}$ , $-b \mapsto \beta_{min}$ , $t \mapsto \beta_{max}$ , $n \mapsto \gamma_{min}$ , $f \mapsto \gamma_{max}$ . Consider the points in view coordinates

\begin{align*} Q_{left} &= -l \mathbf{\hat{u}}_{h} + n \mathbf{\hat{u}}_{d} \\ Q_{right} &= r \mathbf{\hat{u}}_{h} + n \mathbf{\hat{u}}_{d} \\ Q_{bottom} &= -b \mathbf{\hat{u}}_{v} + n \mathbf{\hat{u}}_{d} \\ Q_{top} &= t \mathbf{\hat{u}}_{v} + n \mathbf{\hat{u}}_{d} \\ Q_{near} &= n \mathbf{\hat{u}}_{d} \\ Q_{far} &= f \mathbf{\hat{u}}_{d} \\ \end{align*}

(119)

The points $Q_{near}$ and $Q_{far}$ are the points along the viewing axis that intersect the near plane and the far plane, respectively, of the orthographic view volume. The point $Q_{left}$ represents the point of intersection of the left plane, near plane, and the horizontal-vertical plane of the view volume. The point $Q_{right}$ represents the point of intersection of the right plane, near plane, and the horizontal-vertical plane of the view volume. The point $Q_{bottom}$ represents the point of intersection of the bottom plane, near plane, and depth-vertical plane of the view volume. The point $Q_{top}$ represents the point of intersection of the top plane, near plane, and the depth-vertical plane of the view volume. In short, the points $Q_{near}$ and $Q_{far}$ are the origins of the near and far planes, respectively. The other four points and points chosen along the edge of the viewport in the near plane that allow us to easily set up the boundary conditions to compute the functions $\phi_{h}, \phi_{v}, \phi_{d}$ .

Consider the map $\phi_{h}$ , where

\phi_{h} \left( P \right) = A P_{h} + B P_{v} + C P_{d} + D

(120)

Define the boundary conditions for our chosen points

\begin{align*} \phi_{h} \left( Q_{left} \right) &= \alpha_{min} \\ \phi_{h} \left( Q_{right} \right) &= \alpha_{max} \\ \phi_{h} \left( Q_{bottom} \right) &= 0 \\ \phi_{h} \left( Q_{top} \right) &= 0 \\ \phi_{h} \left( Q_{near} \right) &= 0 \\ \phi_{h} \left( Q_{far} \right) &= 0 \\ \end{align*}

(121)

which we need to justify. The view coordinates are orthogonal to each other, and the normalized device coordinates are also orthogonal to each other. This means that $\phi_{h}$ should only be a function of the horizontal component and not the vertical component. The points $Q_{bottom}, Q_{top}, Q_{near}, Q_{far}$ lie on the depth-vertical plane, which have a zero horizontal component, so they should keep a zero horizontal component after transformation.

Applying the boundary conditions, we have

\begin{align*} \phi_{h} \left( Q_{left} \right) &= A \cdot \left( -l \right) + B \cdot 0 + C n + D &&= -A l + C n + D &&= \alpha_{min} \\ \phi_{h} \left( Q_{right} \right) &= A \cdot r + B \cdot 0 + C n + D &&= A r + C n + D &&= \alpha_{max} \\ \phi_{h} \left( Q_{bottom} \right) &= A \cdot 0 + B \cdot \left( -b \right) + C n + D &&= -B b + C n + D &&= 0 \\ \phi_{h} \left( Q_{top} \right) &= A \cdot 0 + B \cdot t + C n + D &&= B t + C n + D &&= 0 \\ \phi_{h} \left( Q_{near} \right) &= A \cdot 0 + B \cdot 0 + C n + D &&= C n + D &&= 0 \\ \phi_{h} \left( Q_{far} \right) &= A \cdot 0 + B \cdot 0 + C f + D &&= C f + D &&= 0 \\ \end{align*}

(122)

so that

\begin{align*} \phi_{h} \left( Q_{left} \right) &= -A l + C n + D &&= \alpha_{min} \\ \phi_{h} \left( Q_{right} \right) &= A r + C n + D &&= \alpha_{max} \\ \phi_{h} \left( Q_{bottom} \right) &= -B b + C n + D &&= 0 \\ \phi_{h} \left( Q_{top} \right) &= B t + C n + D &&= 0 \\ \phi_{h} \left( Q_{near} \right) &= C n + D &&= 0 \\ \phi_{h} \left( Q_{far} \right) &= C f + D &&= 0 \\ \end{align*}

(123)

and now we compute the constants. Subtracting $\phi_{h} \left( Q_{right} \right)$ from $\phi_{h} \left( Q_{left} \right)$ in (123) yields

\begin{align*} \phi_{h} \left( Q_{right} \right) - \phi_{h} \left( Q_{left} \right) &= \left[ A r + C n + D \right] - \left[ -A l + C n + D \right] \\ &= A r - \left( - A l \right) \\ &= A \left( r - \left( -l \right) \right) \\ &= \alpha_{max} - \alpha_{min}. \end{align*}

(124)

Solving for $A$ , we see that

A = \frac{\alpha_{max} - \alpha_{min}}{r - \left( -l \right)}.

(125)

Subtracting $\phi_{h} \left( Q_{top} \right)$ from $\phi_{h} \left( Q_{bottom} \right)$ in (123) yields

\begin{align*} \phi_{h} \left( Q_{top} \right) - \phi_{h} \left( Q_{bottom} \right) &= \left[ B t + C n + D \right] - \left[ -B b + C n + D \right] \\ &= B t - B \left( -b \right) \\ &= B \left( t - \left( -b \right) \right) \\ &= 0. \end{align*}

(126)

Solving for $B$ , we see that

B = 0

(127)

since $t - (-b) = t + b \neq 0$ . Subtracting $\phi_{h} \left( Q_{far} \right)$ from $\phi_{h} \left( Q_{near} \right)$ in (123) yields

\begin{align*} \phi_{h} \left( Q_{far} \right) - \phi_{h} \left( Q_{near} \right) &= \left[ C f + D \right] - \left[ C n + D \right] \\ &= C f - C n \\ &= C \left( f - n \right) \\ &= 0. \end{align*}

(128)

Solving for $C$ , we see that

C = 0

(129)

since $f - n \neq 0$ . Substituting the constants (125), (127), and (129) back into $\phi_{h}(Q_{right})$ in (123) gives us

\phi_{h} \left( Q_{right} \right) = A r + C n + D = A r + D = \left( \frac{\alpha_{max} - \alpha_{min}}{r - \left( -l \right)} \right) r + D = \alpha_{max}.

(130)

Solving for $D$

\begin{align*} D &= \alpha_{max} - \left( \frac{\alpha_{max} - \alpha_{min}}{r - \left( -l \right)} \right) r \\ &= \alpha_{max} - \left( \frac{\alpha_{max} - \alpha_{min}}{r + l} \right) r \\ &= \frac{\alpha_{max} \left( r + l \right) - \left( \alpha_{max} - \alpha_{min} \right) r}{r + l} \\ &= \frac{\alpha_{max} r + \alpha_{max} l - \alpha_{max} r + \alpha_{min} r}{r + l} \\ &= \frac{\alpha_{max} r + \alpha_{max} l - \alpha_{max} r + \alpha_{min} r}{r + l} \\ &= \frac{\alpha_{max} l + \alpha_{min} r}{r + l} \\ &= \frac{\alpha_{min} r - \alpha_{max} \left( -l \right)}{r - \left( -l \right)} \\ \end{align*}

(131)

we have

D = \frac{\alpha_{min} r - \alpha_{max} \left( -l \right)}{r - \left( -l \right)}.

(132)

Assembling the constants (125), (127), (129), (132) back into (120) we have the complete formula for the function $\phi_{h}$

\phi_{h} \left( P \right) = \left( \frac{ \alpha_{max} - \alpha_{min} }{ r - \left( -l \right)} \right) P_{h} + \frac{\alpha_{min} r - \alpha_{max} \left( -l \right)}{r - \left( -l \right)}.

(133)

Consider the map $\phi_{v}$ , where

\phi_{v} \left( P \right) = A P_{h} + B P_{v} + C P_{d} + D

(134)

Define the boundary conditions for our chosen points

\begin{align*} \phi_{v} \left( Q_{left} \right) &= 0 \\ \phi_{v} \left( Q_{right} \right) &= 0 \\ \phi_{v} \left( Q_{bottom} \right) &= \beta_{min} \\ \phi_{v} \left( Q_{top} \right) &= \beta_{max} \\ \phi_{v} \left( Q_{near} \right) &= 0 \\ \phi_{v} \left( Q_{far} \right) &= 0 \\ \end{align*}

(135)

which we need to justify. The view coordinates are orthogonal to each other, and the normalized device coordinates are also orthogonal to each other. This means that $\phi_{v}$ should only be a function of the vertical component and not the horizontal component. The points $Q_{left}, Q_{right}, Q_{near}, Q_{far}$ lie on the depth-horizontal plane, which have a zero vertical component, so they should keep a zero vertical component after transformation.

Applying the boundary conditions, we have

\begin{align*} \phi_{v} \left( Q_{left} \right) &= A \cdot \left( -l \right) + B \cdot 0 + C n + D &&= -A l + C n + D &&= 0 \\ \phi_{v} \left( Q_{right} \right) &= A \cdot r + B \cdot 0 + C n + D &&= A r + C n + D &&= 0 \\ \phi_{v} \left( Q_{bottom} \right) &= A \cdot 0 + B \cdot \left( -b \right) + C n + D &&= -B b + C n + D &&= \beta_{min} \\ \phi_{v} \left( Q_{top} \right) &= A \cdot 0 + B \cdot t + C n + D &&= B t + C n + D &&= \beta_{max} \\ \phi_{v} \left( Q_{near} \right) &= A \cdot 0 + B \cdot 0 + C n + D &&= C n + D &&= 0 \\ \phi_{v} \left( Q_{far} \right) &= A \cdot 0 + B \cdot 0 + C f + D &&= C f + D &&= 0 \\ \end{align*}

(136)

so that

\begin{align*} \phi_{v} \left( Q_{left} \right) &= -A l + C n + D &&= 0 \\ \phi_{v} \left( Q_{right} \right) &= A r + C n + D &&= 0 \\ \phi_{v} \left( Q_{bottom} \right) &= -B b + C n + D &&= \beta_{min} \\ \phi_{v} \left( Q_{top} \right) &= B t + C n + D &&= \beta_{max} \\ \phi_{v} \left( Q_{near} \right) &= C n + D &&= 0 \\ \phi_{v} \left( Q_{far} \right) &= C f + D &&= 0 \\ \end{align*}

(137)

and now we compute the constants. Subtracting $\phi_{v} \left( Q_{right} \right)$ from $\phi_{v} \left( Q_{left} \right)$ in (137) yields

\begin{align*} \phi_{v} \left( Q_{right} \right) - \phi_{v} \left( Q_{left} \right) &= \left[ A r + C n + D \right] - \left[ -A l + C n + D \right] \\ &= A r - \left( - A l \right) \\ &= A \left( r - \left( -l \right) \right) \\ &= 0. \end{align*}

(138)

Solving for $A$ , we see that

A = 0

(139)

since $r - (-l) = r + l \neq 0$ . Subtracting $\phi_{v} \left( Q_{top} \right)$ from $\phi_{v} \left( Q_{bottom} \right)$ in (137) yields

\begin{align*} \phi_{v} \left( Q_{top} \right) - \phi_{v} \left( Q_{bottom} \right) &= \left[ B t + C n + D \right] - \left[ -B b + C n + D \right] \\ &= B t - B \left( -b \right) \\ &= B \left( t - \left( -b \right) \right) \\ &= \beta_{max} - \beta_{min}. \end{align*}

(140)

Solving for $B$ , we see that

B = \frac{\beta_{max} - \beta_{min}}{t - \left( -b \right)}.

(141)

Subtracting $\phi_{v} \left( Q_{far} \right)$ from $\phi_{v} \left( Q_{near} \right)$ in (137) yields

\begin{align*} \phi_{v} \left( Q_{far} \right) - \phi_{v} \left( Q_{near} \right) &= \left[ C f + D \right] - \left[ C n + D \right] \\ &= C f - C n \\ &= C \left( f - n \right) \\ &= 0. \end{align*}

(142)

Solving for $C$ , we see that

C = 0

(143)

since $f - n \neq 0$ . Substituting the constants (139), (141), and (143) back into $\phi_{v}(Q_{top})$ in (137) gives us

\phi_{v} \left( Q_{top} \right) = B t + C n + D = B t + D = \left( \frac{\beta_{max} - \beta_{min}}{t - \left( -b \right)} \right) t + D = \beta_{max}.

(144)

Solving for $D$

\begin{align*} D &= \beta_{max} - \left( \frac{\beta_{max} - \beta_{min}}{t - \left( -b \right)} \right) t \\ &= \beta_{max} - \left( \frac{\beta_{max} - \beta_{min}}{t + b} \right) t \\ &= \frac{\beta_{max} \left( t + b \right) - \left( \beta_{max} - \beta_{min} \right) t}{t + b} \\ &= \frac{\beta_{max} t + \beta_{max} b - \beta_{max} t + \beta_{min} t}{t + b} \\ &= \frac{\beta_{max} t + \beta_{max} b - \beta_{max} t + \beta_{min} t}{t + b} \\ &= \frac{\beta_{max} b + \beta_{min} t}{t + b} \\ &= \frac{\beta_{min} t - \beta_{max} \left( -b \right)}{t - \left( -b \right)} \\ \end{align*}

(145)

we have

D = \frac{\beta_{min} t - \beta_{max} \left( -b \right)}{t - \left( -b \right)}.

(146)

Assembling the constants (139), (141), (143), (146) back into (134) we have the complete formula for the function $\phi_{v}$

\phi_{v} \left( P \right) = \left( \frac{ \beta_{max} - \beta_{min} }{ t - \left( -b \right)} \right) P_{v} + \frac{\beta_{min} t - \beta_{max} \left( -b \right)}{t - \left( -b \right)}.

(147)

Consider the map $\phi_{d}$ , where

\phi_{d} \left( P \right) = A P_{h} + B P_{v} + C P_{d} + D

(148)

Define the boundary conditions for our chosen points

\begin{align*} \phi_{d} \left( Q_{left} \right) &= \gamma_{min} \\ \phi_{d} \left( Q_{right} \right) &= \gamma_{min} \\ \phi_{d} \left( Q_{bottom} \right) &= \gamma_{min} \\ \phi_{d} \left( Q_{top} \right) &= \gamma_{min} \\ \phi_{d} \left( Q_{near} \right) &= \gamma_{min} \\ \phi_{d} \left( Q_{far} \right) &= \gamma_{max} \\ \end{align*}

(149)

which we need to justify. The view coordinates are orthogonal to each other, and the normalized device coordinates are also orthogonal to each other. This means that $\phi_{d}$ should only be a function of the depth component and not the horizontal or vertical components. The points $Q_{left}, Q_{right}, Q_{bottom}, Q_{top}$ lie on the near plane, which have a depth component of $n$ . Since orthographic projection is just the identity, points already on the near plane should stay on the near plane. Similarly, points on the far plane should stay on the far plane. Consequently, the depth component should have no dependence on the horizontal or vertical components, only a depth term and an affine translation term.

Applying the boundary conditions, we have

\begin{align*} \phi_{d} \left( Q_{left} \right) &= A \cdot \left( -l \right) + B \cdot 0 + C n + D &&= -A l + C n + D &&= \gamma_{min} \\ \phi_{d} \left( Q_{right} \right) &= A \cdot r + B \cdot 0 + C n + D &&= A r + C n + D &&= \gamma_{min} \\ \phi_{d} \left( Q_{bottom} \right) &= A \cdot 0 + B \cdot \left( -b \right) + C n + D &&= -B b + C n + D &&= 0 \\ \phi_{d} \left( Q_{top} \right) &= A \cdot 0 + B \cdot t + C n + D &&= B t + C n + D &&= 0 \\ \phi_{d} \left( Q_{near} \right) &= A \cdot 0 + B \cdot 0 + C n + D &&= C n + D &&= \gamma_{min} \\ \phi_{d} \left( Q_{far} \right) &= A \cdot 0 + B \cdot 0 + C f + D &&= C f + D &&= \gamma_{max} \\ \end{align*}

(150)

so that

\begin{align*} \phi_{d} \left( Q_{left} \right) &= -A l + C n + D &&= \gamma_{min} \\ \phi_{d} \left( Q_{right} \right) &= A r + C n + D &&= \gamma_{min} \\ \phi_{d} \left( Q_{bottom} \right) &= -B b + C n + D &&= \gamma_{min} \\ \phi_{d} \left( Q_{top} \right) &= B t + C n + D &&= \gamma_{min} \\ \phi_{d} \left( Q_{near} \right) &= C n + D &&= \gamma_{min} \\ \phi_{d} \left( Q_{far} \right) &= C f + D &&= \gamma_{max} \\ \end{align*}

(151)

and now we compute the constants. Subtracting $\phi_{d} \left( Q_{right} \right)$ from $\phi_{d} \left( Q_{left} \right)$ in (151) yields

\begin{align*} \phi_{d} \left( Q_{right} \right) - \phi_{d} \left( Q_{left} \right) &= \left[ A r + C n + D \right] - \left[ -A l + C n + D \right] \\ &= A r - \left( - A l \right) \\ &= A \left( r - \left( -l \right) \right) \\ &= \gamma_{min} - \gamma_{min} \\ &= 0. \end{align*}

(152)

Solving for $A$ , we see that

A = 0

(153)

since $r - (-l) = r + l \neq 0$ . Subtracting $\phi_{d} \left( Q_{top} \right)$ from $\phi_{d} \left( Q_{bottom} \right)$ in (151) yields

\begin{align*} \phi_{d} \left( Q_{top} \right) - \phi_{d} \left( Q_{bottom} \right) &= \left[ B t + C n + D \right] - \left[ -B b + C n + D \right] \\ &= B t - B \left( -b \right) \\ &= B \left( t - \left( -b \right) \right) \\ &= \gamma_{min} - \gamma_{min} \\ &= 0. \end{align*}

(154)

Solving for $B$ , we see that

B = 0

(155)

since $t - (-b) = t + b \neq 0$ . Subtracting $\phi_{d} \left( Q_{far} \right)$ from $\phi_{d} \left( Q_{near} \right)$ in (151) yields

\begin{align*} \phi_{d} \left( Q_{far} \right) - \phi_{d} \left( Q_{near} \right) &= \left[ C f + D \right] - \left[ C n + D \right] \\ &= C f - C n \\ &= C \left( f - n \right) \\ &= \gamma_{max} - \gamma_{min}. \end{align*}

(156)

Solving for $C$ , we see that

C = \frac{ \gamma_{max} - \gamma_{min} }{f - n}

(157)

Substituting the constants (153), (155), and (157) back into $\phi_{d}(Q_{far})$ in (151) gives us

\begin{align*} \phi_{d} \left( Q_{far} \right) &= C f + D &= \left( \frac{\gamma_{max} - \gamma_{min}}{f - n} \right) f + D &= \gamma_{max}. \end{align*}

(158)

Solving for $D$

\begin{align*} D &= \gamma_{max} - \left( \frac{\gamma_{max} - \gamma_{min}}{f - n} \right) f \\ &= \frac{\gamma_{max} \left( f - n \right) - \left( \gamma_{max} - \gamma_{min} \right) f}{f - n} \\ &= \frac{\gamma_{max} f - \gamma_{max} n - \gamma_{max} f + \gamma_{min}f}{f - n} \\ &= \frac{\gamma_{min} f - \gamma_{max} n}{f - n} \\ \end{align*}

(159)

we have

D = \frac{\gamma_{min} f - \gamma_{max} n}{f - n}.

(160)

Assembling the constants (153), (155), (157), (160) back into (148) we have the complete formula for the function $\phi_{d}$

\phi_{d} \left( P \right) = \left( \frac{\gamma_{max} - \gamma_{min}}{f - n} \right) P_{d} + \frac{ \gamma_{min} f - \gamma_{max} n }{f - n}.

(161)

Finally, we substitute (133), (147), and (161) back into (115) to get

\begin{align*} P_{clip,h} &= \phi_{h} \left( P \right) &&= \left( \frac{ \alpha_{max} - \alpha_{min} }{ r - \left( -l \right)} \right) P_{h} + \frac{\alpha_{min} r - \alpha_{max} \left( -l \right)}{r - \left( -l \right)} \\ P_{clip,v} &= \phi_{v} \left( P \right) &&= \left( \frac{ \beta_{max} - \beta_{min} }{ t - \left( -b \right)} \right) P_{v} + \frac{\beta_{min} t - \beta_{max} \left( -b \right)}{t - \left( -b \right)} \\ P_{clip,d} &= \phi_{d} \left( P \right) &&= \left( \frac{\gamma_{max} - \gamma_{min}}{f - n} \right) P_{d} + \frac{ \gamma_{min} f - \gamma_{max} n }{f - n} \\ P_{clip,w} &= \phi_{w} \left( P \right) &&= 1 \\ \end{align*}

(162)

Assembling the clip space components into the resulting matrix equation, we have

\begin{bmatrix} P_{clip,h} \\ P_{clip,v} \\ P_{clip,d} \\ P_{clip,w} \\ \end{bmatrix} = \begin{bmatrix} \frac{\alpha_{max} - \alpha_{min}}{r - \left( -l \right)} & 0 & 0 & \frac{\alpha_{min}r - \alpha_{max} \left(-l \right) }{r - \left( -l \right)} \\ 0 & \frac{\beta_{max} - \beta_{min}}{t - \left( -b \right)} & 0 & \frac{\beta_{min}t - \beta_{max}\left( -b \right) }{t - \left(-b \right)} \\ 0 & 0 & \frac{\gamma_{max} - \gamma_{min} }{f - n} & \frac{\gamma_{min}f - \gamma_{max}n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} P_{h} \\ P_{v} \\ P_{d} \\ P_{w} \\ \end{bmatrix}.

(163)

This yields the second major result, the canonical orthographic projection matrix $M^{C}_{orth}$ .

Orthographic Projection Matrix

M^{C}_{orth} = \begin{bmatrix} \frac{\alpha_{max} - \alpha_{min}}{r - \left( -l \right)} & 0 & 0 & \frac{\alpha_{min}r - \alpha_{max} \left(-l \right) }{r - \left( -l \right)} \\ 0 & \frac{\beta_{max} - \beta_{min}}{t - \left( -b \right)} & 0 & \frac{\beta_{min}t - \beta_{max}\left( -b \right) }{t - \left(-b \right)} \\ 0 & 0 & \frac{\gamma_{max} - \gamma_{min} }{f - n} & \frac{\gamma_{min}f - \gamma_{max}n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(164)

This completes the derivation of the canonical orthographic projection matrix.

The Canonical Perspective Matrix¶

Now that we have the canonical perspective projection matrix and the canonical orthographic projection matrix, we can work out the canonical perspective matrix. The canonical perspective matrix maps from view coordinates to projected coordinates. We show that the perspective projection transformation consists of a perspective transformation multiplied by an orthographic projection, we can get the perspective matrix by premultiplying (108) by $(M^{C}_{orth})^{-1}$ given by

\left(M^{C}_{orth}\right)^{-1} = \begin{bmatrix} \frac{r + l}{\alpha_{max} - \alpha_{min}} & 0 & 0 & -\frac{\alpha_{min} r - \alpha_{max} \left( -l \right)}{\alpha_{max} - \alpha_{min}} \\ 0 & \frac{t + b}{\beta_{max} - \beta_{min}} & 0 & -\frac{\beta_{min} t - \beta_{max} \left( -b \right)}{\beta_{max} - \beta_{min}} \\ 0 & 0 & \frac{f - n}{\gamma_{max} - \gamma_{min}} & -\frac{\gamma_{min} f - \gamma_{max} n}{\gamma_{max} - \gamma_{min}} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(165)

Premultiplying $M^{C}_{per}$ by $(M^{C}_{orth})^{-1}$ gives

\begin{align*} M^{C}_{proj} &= \left(M^{C}_{orth}\right)^{-1} M^{C}_{per} \\ &= \begin{bmatrix} \frac{r + l}{\alpha_{max} - \alpha_{min}} & 0 & 0 & -\frac{\alpha_{min} r - \alpha_{max} \left( -l \right)}{\alpha_{max} - \alpha_{min}} \\ 0 & \frac{t + b}{\beta_{max} - \beta_{min}} & 0 & -\frac{\beta_{min} t - \beta_{max} \left( -b \right)}{\beta_{max} - \beta_{min}} \\ 0 & 0 & \frac{f - n}{\gamma_{max} - \gamma_{min}} & -\frac{\gamma_{min} f - \gamma_{max} n}{\gamma_{max} - \gamma_{min}} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{\left( \alpha_{max} - \alpha_{min} \right) n }{r - \left( -l \right)} & 0 & \frac{\alpha_{min}r - \alpha_{max} \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{\left( \beta_{max} - \beta_{min} \right) n }{t - \left( -b \right)} & \frac{\beta_{min}t - \beta_{max} \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{\gamma_{max}f - \gamma_{min}n}{f - n} & -\frac{\left( \gamma_{max} - \gamma_{min} \right) f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{r + l}{\alpha_{max} - \alpha_{min}} & 0 & 0 & -\frac{ \alpha_{min} r + \alpha_{max} l }{ \alpha_{max} - \alpha_{min} } \\ 0 & \frac{t + b}{\beta_{max} - \beta_{min}} & 0 & -\frac{ \beta_{min} t + \beta_{max} b }{ \beta_{max} - \beta_{min} } \\ 0 & 0 & \frac{ f - n }{ \gamma_{max} - \gamma_{min} } & -\frac{ \gamma_{min} f - \gamma_{max} n }{ \gamma_{max} - \gamma_{min} } \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{ \left( \alpha_{max} - \alpha_{min} \right) n }{ r + l } & 0 & \frac{ \alpha_{min} r + \alpha_{max} l }{ r + l } & 0 \\ 0 & \frac{ \left( \beta_{max} - \beta_{min} \right) n }{ t + b } & \frac{ \beta_{min} t + \beta_{max} b }{ t + b } & 0 \\ 0 & 0 & \frac{ \gamma_{max} f - \gamma_{min} n }{ f - n } & -\frac{ \left( \gamma_{max} - \gamma_{min} \right) f n }{ f - n } \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \\ &= \begin{bmatrix} n & 0 & 0 & 0 \\ 0 & n & 0 & 0 \\ 0 & 0 & f + n & -f n \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \end{align*}

(166)

which yields the third major result, the perspective matrix.

This shows that the perspective projection matrix $M^{C}_{per}$ is indeed the product of a projective transformation and an orthographic transformation

M^{C}_{per} = M^{C}_{orth} M^{C}_{proj}.

(168)

Notice that the perspective matrix passes along the input depth coordinate undistorted via the $w$ component, but nonlinearly distorts the output depth component. Finally, recall the equation (48) for the perspective projection matrix where we deduced that the depth component was an affine transformation. We see from the (167) that $\theta(P_{d}) = A^{\prime} P_{d} + B^{\prime} = (f + n) P_{d} - f n$ so that the projected coordinate components from (48) become

\begin{align*} P_{proj,h} &= n P_{h} \\ P_{proj,v} &= n P_{v} \\ P_{proj,d} &= \left ( f + n \right) P_{d} - f n \\ P_{proj,w} &= P_{d} \\ \end{align*}

(169)

which completes the derivation of the unknown constants $A^{\prime}$ and $B^{\prime}$ as promised. This completes the derivation of the perspective matrix.

The Canonical Symmetric Vertical Field Of View Perspective Projection Matrix¶

In the symmetric case, $r = l$ and $t = b$ . The width of the viewport is $\text{width} = r - (-l) = r + l$ . The height of the viewport is $\text{height} = t - (-b) = t + b$ . The aspect ratio, denoted $\text{aspect}$ , is given by

\text{aspect} \equiv \frac{\text{width}}{\text{height}} = \frac{r - \left( -l \right)}{t - \left( -b \right)} = \frac{r + l}{t + b} = \frac{r + r}{t + t} = \frac{2 r}{2 t} = \frac{r}{t}

(170)

which implies that $r = \text{aspect} \cdot t$ . Since the view volume is symmetric, the tangent of $\theta_{vfov} / 2$ is $\tan\left( \theta_{vfov} / 2 \right) = t / n$ . From these facts we see that

t = b = n \tan\left( \frac{\theta_{vfov}}{2} \right) \\ r = l = \text{aspect} \cdot t = \text{aspect} \cdot n \tan\left( \frac{\theta_{vfov}}{2} \right)

(171)

We now use (171) inside the general perspective projection matrix (108) to derive the symmetric vertical field of view perspective matrix. Consider the elements of the matrix $M^{C}_{per}$ . The nonzero elements of $M^{C}_{per}$ become

\begin{align*} M^{C}_{per}\left[0, 0 \right] = \frac{\left( \alpha_{max} - \alpha_{min} \right) n}{r - \left( -l \right)} &= \frac{\left( \alpha_{max} - \alpha_{min} \right) n}{r + l} \\ &= \frac{\left( \alpha_{max} - \alpha_{min} \right) n}{2 r} \\ &= \left( \frac{\alpha_{max} - \alpha_{min}}{2} \right) \frac{n}{r} \\ &= \left( \frac{\alpha_{max} - \alpha_{min}}{2} \right) \frac{n}{\text{aspect} \cdot n \tan\left( \frac{\theta_{vfov}}{2} \right)} \\ &= \left( \frac{\alpha_{max} - \alpha_{min}}{2} \right) \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} \\ M^{C}_{per}\left[1, 1 \right] = \frac{\left( \beta_{max} - \beta_{min} \right) n}{t - \left( -b \right)} &= \frac{\left( \beta_{max} - \beta_{min} \right) n}{t + b} \\ &= \frac{\left( \beta_{max} - \beta_{min} \right) n}{2 t} \\ &= \left( \frac{\beta_{max} - \beta_{min}}{2} \right) \frac{n}{t} \\ &= \left( \frac{\beta_{max} - \beta_{min}}{2} \right) \frac{n}{n \tan\left( \frac{\theta_{vfov}}{2} \right)} \\ &= \left( \frac{\beta_{max} - \beta_{min}}{2} \right) \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} \\ M^{C}_{per}\left[0, 2 \right] = \frac{\alpha_{min} r - \alpha_{max}\left( -l \right)}{r - \left( -l \right)} &= \frac{\alpha_{min} r + \alpha_{max} l}{r + l} \\ &= \frac{\alpha_{min} r + \alpha_{max} r}{2 r} \\ &= \frac{\alpha_{min} + \alpha_{max}}{2} \\ M^{C}_{per}\left[1, 2 \right] = \frac{\beta_{min} t - \beta_{max}\left( -b \right)}{t - \left( -b \right)} &= \frac{\beta_{min} t + \beta_{max} b}{t + b} \\ &= \frac{\beta_{min} t + \beta_{max} t}{2 t} \\ &= \frac{\beta_{min} + \beta_{max}}{2} \\ M^{C}_{per}\left[2, 2 \right] &= \frac{\gamma_{max} f - \gamma_{min} n}{f - n} \\ M^{C}_{per}\left[2, 3 \right] &= -\frac{\left(\gamma_{max} - \gamma_{min} \right) f n}{f - n} \end{align*}

(172)

Substituting (172) back into (108) the matrix $M^{C}_{per}$ takes the form

M^{C}_{per,vfov} = \begin{bmatrix} \left( \frac{\alpha_{max} - \alpha_{min}}{2} \right) \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & \frac{\alpha_{min} + \alpha_{max}}{2} & 0 \\ 0 & \left( \frac{\beta_{max} - \beta_{min}}{2} \right) \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & \frac{\beta_{min} + \beta_{max}}{2} & 0 \\ 0 & 0 & \frac{\gamma_{max}f - \gamma_{min}n}{f - n} & -\frac{\left( \gamma_{max} - \gamma_{min} \right) f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}

(173)

which is the desired formula. This is the fourth major result, the canonical symmetric vertical field of view perspective projection matrix.

Symmetric Vertical Field Of View Perspective Matrix

Let $\mathcal{F}_{can} = (\tilde{O}_{can}, \mathcal{B}_{can})$ be a canonical coordinate frame on $\mathbb{E}^{3}$ , where $\mathcal{B}_{can} = (\mathbf{\hat{u}}_{h}, \mathbf{\hat{u}}_v, \mathbf{\hat{u}}_{d})$ is the basis for the frame. Let the perspective view volume be parametrized by the vertical field of view angle $\theta_{vfov}$ , the near plane $n$ , the far plane $f$ and the aspect ratio $\text{aspect}$ such that $n > 0$ , $f > 0$ , $0 < \theta_{vfov} < \pi$ , and $\text{aspect} > 0$ . The canonical symmetric vertical field of view perspective projection matrix is given by

M^{C}_{per, vfov} = \begin{bmatrix} \left( \frac{\alpha_{max} - \alpha_{min}}{2} \right) \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & \frac{\alpha_{min} + \alpha_{max}}{2} & 0 \\ 0 & \left( \frac{\beta_{max} - \beta_{min}}{2} \right) \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & \frac{\beta_{min} + \beta_{max}}{2} & 0 \\ 0 & 0 & \frac{\gamma_{max}f - \gamma_{min}n}{f - n} & -\frac{\left( \gamma_{max} - \gamma_{min} \right) f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}.

(174)

How Matrices Transform Under Changes In Coordinates¶

For any given view coordinate system and clip coordinate system, there exists a perspective projection matrix that can be expressed as $M_{per} = T^{-1}_{clip} M^{C}_{per} T_{view}$ where $M^{C}_{per}$ is the canonical perspective projection matrix. Let $[Q]$ be a homogeneous point in $\mathbb{RP}^3$ . Let $Q_{view}$ be a representive of $[Q]$ in view coordinates, $Q_{clip}$ be a representative of $[Q]$ in clip coordinates, $Q^{C}_{view}$ be a representive of $[Q]$ in the canonically chosen view coordinate system, $Q^{C}_{clip}$ be a representive of $[Q]$ in the canonically chosen clip coordinates, such that

Q^{C}_{clip} = M^{C}_{per} Q^{C}_{view}.

(175)

That is, $Q^{C}_{clip}$ represents the perspective projected point $Q$ . Let $T_{view} : \mathbb{RP}^{3} \rightarrow \mathbb{RP}^{3}$ denote the coordinate transformation from view coordinates to the canonical view coordinates. Let $T_{clip} : \mathbb{RP}^{3} \rightarrow \mathbb{RP}^{3}$ denote the coordinate transformation from clip coordinates to canonical clip coordinates. The coordinate relations are given by $Q^{C}_{view} = T_{view} Q^{C}_{view}$ and $Q^{C}_{clip} = T_{clip} Q^{C}_{clip}$ . Applying these relationships in (175)

T_{clip} Q_{clip} = M^{C}_{per} T_{view} Q_{view}

(176)

or equivalently

Q_{clip} = T^{-1}_{clip} M^{C}_{per} T_{view} Q_{view} = \left( T^{-1}_{clip} M^{C}_{per} T_{view} \right) Q_{view}

(177)

therefore we identify

M_{per} \equiv T^{-1}_{clip} M^{C}_{per} T_{view}.

(178)

This proves the existence of a perspective projection matrix for any source view coordinate system, and any target clip coordinate system. To show uniqueness, suppose that $M^{\prime}_{per}$ is another transformation that maps view coordinates to clip coordinates, such that $Q_{clip} = M^{\prime}_{per} Q_{view}$ . Using the coordinate transformations again, we can write $Q^{C}_{clip} = T_{clip} Q_{clip}$ and $Q^{C}_{view} = T_{view} Q_{view}$ . Inverting these relations, we get $Q^{C}_{view} = T^{-1}_{view} Q_{view}$ and $Q^{C}_{clip} = T^{-1}_{clip} Q_{clip}$ . Using the relation for $M^{\prime}_{per}$ in equation (178) implies

T^{-1}_{clip} Q^{C}_{clip} = M^{\prime}_{per} T^{-1}_{view} Q^{C}_{view}

(179)

or equivalently

Q^{C}_{clip} = T_{clip} M^{\prime}_{per} T^{-1}_{view} Q^{C}_{view} = \left( T_{clip} M^{\prime}_{per} T^{-1}_{view} \right) Q^{C}_{view}.

(180)

Applying the relation for the canonical perspective projection matrix

Q^{C}_{clip} = M^{C}_{per} Q^{C}_{view} = \left( T_{clip} M^{\prime}_{per} T^{-1}_{view} \right) Q^{C}_{view}

(181)

which imples that

M^{C}_{per} = T_{clip} M^{\prime}_{per} T^{-1}_{view}

(182)

or equivalently

M^{\prime}_{per} = T^{-1}_{clip} M^{C}_{per} T_{view} = M_{per}

(183)

where the last equality in (183) follows from (178). This proves uniqueness for the perspective projection matrix.

To show that any perspective projection is well-defined, let $[Q] \in \mathbb{RP}^{3}$ be a point and let $Q_{1} \sim Q_{2}$ be representatives of $[Q]$ . Then there exists $\lambda \in \mathbb{R} - \{ 0 \}$ such that $Q_{2} = \lambda Q_{1}$ . By linearity of the canonical perspective projection, $\lambda M^{C}_{per} Q_{1} = M^{C}_{per} ( \lambda Q_{1} ) = M^{C}_{per} Q_{2}$ which shows that $M^{C}_{per}$ is well-defined. By the linearity of homogeneous orthogonal transformations

\begin{align*} \lambda M_{per} Q_{1} &= \lambda T^{-1}_{clip} M^{C}_{per} T_{view} Q_{1} \\ &= \left( T^{-1}_{clip} M^{C}_{per} T_{view} \right) \left( \lambda Q_{1} \right) \\ &= \left( T^{-1}_{clip} M^{C}_{per} T_{view} \right) Q_{2} \\ &= M_{per} Q_{2} \\ &= M_{per} \left( \lambda Q_{1} \right) \\ \end{align*}

(184)

which shows that $M_{per}$ is well-defined. The same argument shows that we can ge the same result for any projective matrix represented in canonical coordinates, leading to our formulas for constructing a perspective matrix, perspective projection matrix, or orthographic projection matrix from any source coordinate system to any target coordinate system which are the formulas we will use to compute the matrices for specific platforms.

Projection Matrices From View Coordinates To Clip Coordinates

Let $\mathcal{F}_{can} = (\tilde{O}_{can}, \mathcal{B}_{can})$ be a canonical coordinate frame on $\mathbb{E}^{3}$ . Let $\mathcal{F}_{view} = (\tilde{O}_{view}, \mathcal{B}_{view})$ be the orthonormal frame for the view coordinate system. Let $\mathcal{F}_{clip} = (\tilde{O}_{clip} \mathcal{B}_{clip})$ be the orthonormal frame for the clip coordinate system. Let $T_{view} : \mathbb{R}^{3} \rightarrow \mathbb{R}^{3}$ be the coordinate transformation from the view coordinate frame $\mathcal{F}_{view}$ to the canonical coordinate frame $\mathbb{F}_{can}$ . Let $T_{clip} : \mathbb{R}^{3} \rightarrow \mathbb{R}^{3}$ be the coordinate transformation from the clip space frame $\mathcal{F}_{clip}$ to the canonical coordinate frame $\mathcal{F}_{can}$ .

Let $M^{C}_{per}$ be the canonical perspective projection matrix for $\mathcal{F}_{can}$ . The perspective projection matrix is given by

M_{per} \equiv T^{-1}_{clip} M^{C}_{per} T_{view}.

(185)

Let $M^{C}_{orth}$ be the canonical orthographic projection matrix for $\mathcal{F}_{can}$ . The orthographic projection matrix is given by

M_{orth} \equiv T^{-1}_{clip} M^{C}_{orth} T_{view}.

(186)

Let $M^{C}_{proj}$ be the canonical perspective matrix for $\mathcal{F}_{can}$ . The perspective matrix is given by

M_{proj} \equiv T^{-1}_{clip} M^{C}_{proj} T_{view}.

(187)

Projection Matrices For Specific Software Platforms¶

We apply the results of the previous sections to calculate the projection matrices for specific software platforms.

OpenGL¶

We compute the perspective and orthographic transformation matrices for OpenGL.

The Canonical Projection Matrices¶

The canonical view coordinate system for OpenGL is the frame $(\mathbf{0}, (\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}}))$ in $\mathbb{R}^{3}$ , where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ , where $\mathbf{\hat{z}}$ points into the view volume. The canonical view volume in normalized device coordinates is parametrized by $[-1, 1] \times [-1, 1] \times [-1, 1]$ . The canonical perspective projection matrix for this parametrization is given by

M^{C, OpenGL}_{per} = \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f + n}{f - n} & -\frac{2 f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}.

(188)

The canonical orthographic projection matrix is given by

M^{C, OpenGL}_{orth} = \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{2}{f - n} & -\frac{f + n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(189)

The canonical symmetric vertical field of view perspective projection matrix is given by

M^{C,OpenGL}_{per,vfov} = \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f + n}{f - n} & -\frac{ 2 f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}.

(190)

This finishes the statement of the canonical matrices for OpenGL.

Right-Handed View Space¶

The right-handed view coordinate system for OpenGL is the frame $(\mathbf{0}, (\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}}))$ in $\mathbb{R}^{3}$ , where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{z}}$ points out of the view volume towards the viewer. This is a right-handed coordinate system. The clip coordinate system for OpenGL is the canonical left-handed one. The orthogonal transformations are given by

Q^{OpenGL}_{lh} = Q^{OpenGL}_{rh} = I

(191)

where $I$ is the identity matrix. The change of orientation matrices are given by

\Omega_{rh \rightarrow lh} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} , \hspace{4 pt} \Omega_{lh \rightarrow lh} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(192)

To compute the projections, we need to transform from the chosen view coordinates to the canonical view coordinates, apply the canonical projection, and then transform from the canonical clip coordinates to the target clip coordinates. We can map any view coordinates to clip coordinates using the same process. Each coordinate transformation is the product of an orthogonal transform and a change of orientation matrix. Let us calculate the perspective projection

\begin{align*} M^{OpenGL}_{per,rh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} M^{C,OpenGL}_{per} \Omega_{rh \rightarrow lh} Q^{OpenGL}_{rh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} \right) M^{C,OpenGL}_{per} \left( \Omega_{rh \rightarrow lh} Q^{OpenGL}_{rh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C,OpenGL}_{per} \left( \Omega_{rh \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C,OpenGL}_{per} \Omega_{rh \rightarrow lh} \\ &= I M^{C,OpenGL}_{per} \Omega_{rh \rightarrow lh} \\ &= M^{C,OpenGL}_{per} \Omega_{rh \rightarrow lh} \\ &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f + n}{f - n} & -\frac{2 f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & \frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & \frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & -\frac{f + n}{f - n} & -\frac{2 f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \end{align*}

(193)

therefore

M^{OpenGL}_{per,rh \rightarrow lh} = \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & \frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & \frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & -\frac{f + n}{f - n} & -\frac{2 f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix}.

(194)

Here is the calculation for the orthographic matrix

\begin{align*} M^{OpenGL}_{orth,rh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} M^{C, OpenGL}_{orth} \Omega_{rh \rightarrow lh} Q^{OpenGL}_{rh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} \right) M^{C, OpenGL}_{orth} \left( \Omega_{rh \rightarrow lh} Q^{OpenGL}_{rh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C, OpenGL}_{orth} \left( \Omega_{rh \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C, OpenGL}_{orth} \Omega_{rh \rightarrow lh} \\ &= I M^{C, OpenGL}_{orth} \Omega_{rh \rightarrow lh} \\ &= M^{C, OpenGL}_{orth} \Omega_{rh \rightarrow lh} \\ &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{2}{f - n} & -\frac{f + n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & -\frac{2}{f - n} & -\frac{f + n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \end{align*}

(195)

therefore

M^{OpenGL}_{orth,rh \rightarrow lh} = \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & -\frac{2}{f - n} & -\frac{f + n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(196)

Finally, we calculate the matrix for the symmetric vertical field of view perspective projection

\begin{align*} M^{OpenGL}_{per,vfov,rh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} M^{C,OpenGL}_{per,vfov} \Omega_{rh \rightarrow lh} Q^{OpenGL}_{rh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} \right) M^{C,OpenGL}_{per,vfov} \left( \Omega_{rh \rightarrow lh} Q^{OpenGL}_{rh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C,OpenGL}_{per,vfov} \left( \Omega_{rh \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C,OpenGL}_{per,vfov} \Omega_{rh \rightarrow lh} \\ &= I M^{C,OpenGL}_{per,vfov} \Omega_{rh \rightarrow lh} \\ &= M^{C,OpenGL}_{per,vfov} \Omega_{rh \rightarrow lh} \\ &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f + n}{f - n} & -\frac{ 2 f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & -\frac{f + n}{f - n} & -\frac{ 2 f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \end{align*}

(197)

therefore

M^{OpenGL}_{per,vfov,rh \rightarrow lh} = \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & -\frac{f + n}{f - n} & -\frac{ 2 f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix}.

(198)

This completes the calculation of the matrices for the right-handed OpenGL view coordinates.

OpenGL Right-Handed Projection Matrix Box

View Coordinate Frame (Right-Handed Orientation)
- Origin: $\mathbf{0} = [0, 0, 0]^{T}$
- Basis: $(\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{x}}$ points to the right, $\mathbf{\hat{y}}$ points up, $\mathbf{\hat{z}}$ points out of the view volume towards the viewer
Clip Coordinate Frame (Left-Handed Orientation)
- Origin: $\mathbf{0} = [0, 0, 0]^{T}$
- Basis: $(\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{x}}$ points right, $\mathbf{\hat{y}}$ points up, $\mathbf{\hat{z}}$ points into the view volume away from the viewer
Canonical View Volume: $[-1, 1] \times [-1, 1] \times [-1, 1]$

\begin{align*} M^{OpenGL}_{per,rh \rightarrow lh} &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & \frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & \frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & -\frac{f + n}{f - n} & -\frac{2 f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \\ M^{OpenGL}_{per,rh \rightarrow lh} &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & \frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & \frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & -\frac{f + n}{f - n} & -\frac{2 f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \\ M^{OpenGL}_{per,vfov,rh \rightarrow lh} &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & -\frac{f + n}{f - n} & -\frac{ 2 f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \\ \end{align*}

(199)

Left-Handed View Space¶

The left-handed view space coordinate system for OpenGL is the frame $(\mathbf{0}, (\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}}))$ in $\mathbb{R}^{3}$ , where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{z}}$ points into the view volume away from the viewer. This is a left-handed coordinate system. The clip coordinate system is the canonical left-handed one. The orthogonal transformation is given by

Q^{OpenGL}_{lh} = I

(200)

where $I$ is the identity matrix. The change of orientation matrix are given by

\Omega_{lh \rightarrow lh} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(201)

To compute the projections, we need to transform from the chosen view coordinates to the canonical view coordinates, apply the canonical projection, and then transform from the canonical clip coordinates to the target clip coordinates. We can map any view coordinates to any clip coordinates using the same process. Each coordinate transformation is the product of an orthogonal transform and a change of orientation matrix. Let us calculate the perspective projection

\begin{align*} M^{OpenGL}_{per,lh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} M^{C,OpenGL}_{per} \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} \right) M^{C,OpenGL}_{per} \left( \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C,OpenGL}_{per} \left( \Omega_{lh \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C,OpenGL}_{per} \Omega_{lh \rightarrow lh} \\ &= I M^{C,OpenGL}_{per} I \\ &= M^{C,OpenGL}_{per} \\ &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f + n}{f - n} & -\frac{2 f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \end{align*}

(202)

therefore

M^{OpenGL}_{per,lh \rightarrow lh} = \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f + n}{f - n} & -\frac{2 f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}.

(203)

Here is the calculation for the orthographic matrix

\begin{align*} M^{OpenGL}_{orth,lh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} M^{C, OpenGL}_{orth} \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} \right) M^{C, OpenGL}_{orth} \left( \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C, OpenGL}_{orth} \left( \Omega_{lh \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C, OpenGL}_{orth} \Omega_{lh \rightarrow lh} \\ &= I M^{C, OpenGL}_{orth} I \\ &= M^{C, OpenGL}_{orth} \\ &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{2}{f - n} & -\frac{f + n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \end{align*}

(204)

therefore

M^{OpenGL}_{orth,lh \rightarrow lh} = \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{2}{f - n} & -\frac{f + n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(205)

Finally, we calculate the matrix for the symmetric vertical field of view perspective projection

\begin{align*} M^{OpenGL}_{per,vfov,lh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} M^{C,OpenGL}_{per,vfov} \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} \right) M^{C,OpenGL}_{per,vfov} \left( \Omega_{lh \rightarrow lh} Q^{OpenGL}_{lh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C,OpenGL}_{per,vfov} \left( \Omega_{lh \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C,OpenGL}_{per,vfov} \Omega_{lh \rightarrow lh} \\ &= I M^{C,OpenGL}_{per,vfov} I \\ &= M^{C,OpenGL}_{per,vfov} \\ &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f + n}{f - n} & -\frac{ 2 f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \end{align*}

(206)

therefore

M^{OpenGL}_{per,vfov,lh \rightarrow lh} = \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f + n}{f - n} & -\frac{ 2 f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}.

(207)

This completes the computation of the matrices for the left-handed OpenGL view coordinates.

OpenGL Left-Handed Projection Matrix Box

View Coordinate Frame (Left-Handed Orientation)
- Origin: $\mathbf{0} = [0, 0, 0]^{T}$
- Basis: $(\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{x}}$ points right, $\mathbf{\hat{y}}$ points up, $\mathbf{\hat{z}}$ points into the view volume away from the viewer
Clip Coordinate Frame (Left-Handed Orientation)
- Origin: $\mathbf{0} = [0, 0, 0]^{T}$
- Basis: $(\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{x}}$ points right, $\mathbf{\hat{y}}$ points up, $\mathbf{\hat{z}}$ points into the view volume away from the viewer
Canonical View Volume: $[-1, 1] \times [-1, 1] \times [-1, 1]$

\begin{align*} M^{OpenGL}_{per,lh \rightarrow lh} &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f + n}{f - n} & -\frac{2 f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \\ M^{OpenGL}_{orth,lh \rightarrow lh} &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{2}{f - n} & -\frac{f + n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ M^{OpenGL}_{per,vfov,lh \rightarrow lh} &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f + n}{f - n} & -\frac{ 2 f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \end{align*}

(208)

Comparing Coordinate System Orientations¶

Let us illustrate why the coordinate systems in this section have a left-handed orientation or right-handed orientation. By convention, the standard basis on $\mathbb{R}^{3}$ has a right-handed orientation. Recall that the standard basis on $\mathbb{R}^{3}$ is the tuple of vectors $(\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , and $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ are the standard basis vectors. In this basis, OpenGL’s left-handed view coordinates and clip coordinates have the determinant

\det \begin{bmatrix} \mathbf{\hat{x}} & \mathbf{\hat{y}} & -\mathbf{\hat{z}} \end{bmatrix} = \det \begin{bmatrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & -1 \\ \end{bmatrix} = 1 \cdot 1 \cdot -1 = -1

(209)

and OpenGL’s right-handed view coordinates have the determinant

\det \begin{bmatrix} \mathbf{\hat{x}} & \mathbf{\hat{y}} & \mathbf{\hat{z}} \end{bmatrix} = \det \begin{bmatrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \\ \end{bmatrix} = 1 \cdot 1 \cdot 1 = 1

(210)

so the right-handed OpenGL view coordinate system indeed has a right-handed orientation, and the left-handed view coordinate system and clip coordinate system have a left-handed orientation.

DirectX¶

We compute the perspective and orthographic transformation matrices for DirectX.

The Canonical Matrices¶

The canonical view space coordinate system for DirectX is the frame $(\mathbf{0}, (\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ in $\mathbb{R}^{3}$ , where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ , where $\mathbf{\hat{z}}$ points into the view volume. The canonical view volume in normalized device coordinates is parametrized by $[-1, 1] \times [-1, 1] \times [0, 1]$ . The canonical perspective projection matrix for this parametrization is given by

M^{C, DirectX}_{per} = \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}.

(211)

The canonical orthographic projection matrix for DirectX is given by

M^{C, DirectX}_{orth} = \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(212)

The canonical symmetric vertical field of view perspective projection matrix for DirectX is given by

M^{C,DirectX}_{per,vfov} = \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}.

(213)

This finishes the statement of the canonical matrices for DirectX.

Right-Handed View Space¶

The right-handed view coordinate system for DirectX is the frame $(\mathbf{0}, (\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}}))$ in $\mathbb{R}^{3}$ , where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{z}}$ points out of the view volume towards the viewer. This is a right-handed coordinate system. The clip coordinate system for DirectX is the canonical left-handed one. The orthogonal transformations are given by

Q^{DirectX}_{lh} = Q^{DirectX}_{rh} = I

(214)

where $I$ is the identity matrix. The change of orientation matrices are given by

\Omega_{rh \rightarrow lh} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} , \hspace{4 pt} \Omega_{lh \rightarrow lh} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(215)

To compute the projections, we need to transform from the chosen view coordinates to the canonical view coordinates, apply the canonical projection, and then transform from the canonical clip coordinates to the target clip coordinates. We can map any view coordinates to any clip coordinates using the same process. Each coordinate transformation is the product of an orthogonal transform and a change of orientation matrix. Let us calculate the perspective projection

\begin{align*} M^{DirectX}_{per,rh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} M^{C,DirectX}_{per} \Omega_{rh \rightarrow lh} Q^{DirectX}_{rh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} \right) M^{C,DirectX}_{per} \left( \Omega_{rh \rightarrow lh} Q^{DirectX}_{rh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C,DirectX}_{per} \left( \Omega_{rh \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C,DirectX}_{per} \Omega_{rh \rightarrow lh} \\ &= I M^{C,DirectX}_{per} \Omega_{rh \rightarrow lh} \\ &= M^{C,DirectX}_{per} \Omega_{rh \rightarrow lh} \\ &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & \frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & \frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \end{align*}

(216)

therefore

M^{DirectX}_{per,rh \rightarrow lh} = \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & \frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & \frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix}.

(217)

Here is the calculation for the orthographic matrix

\begin{align*} M^{DirectX}_{orth,rh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} M^{C, DirectX}_{orth} \Omega_{rh \rightarrow lh} Q^{DirectX}_{rh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} \right) M^{C, DirectX}_{orth} \left( \Omega_{rh \rightarrow lh} Q^{DirectX}_{rh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C, DirectX}_{orth} \left( \Omega_{rh \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C, DirectX}_{orth} \Omega_{rh \rightarrow lh} \\ &= I M^{C, DirectX}_{orth} \Omega_{rh \rightarrow lh} \\ &= M^{C, DirectX}_{orth} \Omega_{rh \rightarrow lh} \\ &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & -\frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \end{align*}

(218)

therefore

M^{DirectX}_{orth,rh \rightarrow lh} = \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & -\frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(219)

Finally, we calculate the matrix for the symmetric vertical field of view perspective projection

\begin{align*} M^{DirectX}_{per,vfov,rh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} M^{C,DirectX}_{per,vfov} \Omega_{rh \rightarrow lh} Q^{DirectX}_{rh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} \right) M^{C,DirectX}_{per,vfov} \left( \Omega_{rh \rightarrow lh} Q^{DirectX}_{rh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C,DirectX}_{per,vfov} \left( \Omega_{rh \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C,DirectX}_{per,vfov} \Omega_{rh \rightarrow lh} \\ &= I M^{C,DirectX}_{per,vfov} \Omega_{rh \rightarrow lh} \\ &= M^{C,DirectX}_{per,vfov} \Omega_{rh \rightarrow lh} \\ &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \end{align*}

(220)

therefore

M^{DirectX}_{per,vfov,rh \rightarrow lh} = \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix}.

(221)

This completes the computation of the matrices for the right-handed DirectX view coordinates.

DirectX Right-Handed Projection Matrix Box

View Coordinate Frame (Right-Handed Orientation)
- Origin: $\mathbf{0} = [0, 0, 0]^{T}$
- Basis: $(\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{x}}$ points to the right, $\mathbf{\hat{y}}$ points up, $\mathbf{\hat{z}}$ points out of the view volume towards the viewer
Clip Coordinate Frame (Left-Handed Orientation)
- Origin: $\mathbf{0} = [0, 0, 0]^{T}$
- Basis: $(\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{x}}$ points right, $\mathbf{\hat{y}}$ points up, $\mathbf{\hat{z}}$ points into the view volume away from the viewer
Canonical View Volume: $[-1, 1] \times [-1, 1] \times [0, 1]$

\begin{align*} M^{DirectX}_{per,rh \rightarrow lh} &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & \frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & \frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \\ M^{DirectX}_{orth,rh \rightarrow lh} &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & -\frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ M^{DirectX}_{per,vfov,rh \rightarrow lh} &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \\ \end{align*}

(222)

Left-Handed View Space¶

The left-handed view coordinate system for OpenGL is the frame $(\mathbf{0}, (\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}}))$ in $\mathbb{R}^{3}$ , where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{z}}$ points into the view volume away from the viewer. This is a right-handed coordinate system. The clip coordinate system is the canonical left-handed one. The orthogonal transformation is given by

Q^{DirectX}_{lh} = I

(223)

where $I$ is the identity matrix. The change of orientation matrix are given by

\Omega_{lh \rightarrow lh} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(224)

To compute the projections, we need to transform from the chosen view coordinates to the canonical view coordinates, apply the canonical projection, and then transform from the canonical clip coordinates to the target clip coordinates. We can map any view coordinates to any clip coordinates using the same process. Each coordinate transformation is the product of an orthogonal transform and a change of orientation matrix. Let us calculate the perspective projection

\begin{align*} M^{DirectX}_{per,lh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} M^{C,DirectX}_{per} \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} \right) M^{C,DirectX}_{per} \left( \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C,DirectX}_{per} \left( \Omega_{rl \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C,DirectX}_{per} \Omega_{lh \rightarrow lh} \\ &= I M^{C,DirectX}_{per} I \\ &= M^{C,DirectX}_{per} \\ &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \end{align*}

(225)

therefore

M^{DirectX}_{per,lh \rightarrow lh} = \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}.

(226)

Here is the calculation for the orthographic matrix

\begin{align*} M^{DirectX}_{orth,lh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} M^{C, DirectX}_{orth} \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} \right) M^{C, DirectX}_{orth} \left( \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C, DirectX}_{orth} \left( \Omega_{lh \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C, DirectX}_{orth} \Omega_{lh \rightarrow lh} \\ &= I M^{C, DirectX}_{orth} I \\ &= M^{C, DirectX}_{orth} \\ &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \end{align*}

(227)

therefore

M^{DirectX}_{orth,lh \rightarrow lh} = \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(228)

Finally, we calculate the matrix for the symmetric vertical field of view perspective projection

\begin{align*} M^{DirectX}_{per,vfov,lh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} M^{C,DirectX}_{per,vfov} \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} \right) M^{C,DirectX}_{per,vfov} \left( \Omega_{lh \rightarrow lh} Q^{DirectX}_{lh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C,DirectX}_{per,vfov} \left( \Omega_{lh \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C,DirectX}_{per,vfov} \Omega_{lh \rightarrow lh} \\ &= I M^{C,DirectX}_{per,vfov} I \\ &= M^{C,DirectX}_{per,vfov} \\ &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \end{align*}

(229)

therefore

M^{DirectX}_{per,vfov,lh \rightarrow lh} = \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}.

(230)

This completes the computation of the matrices for the left-handed DirectX view coordinates.

DirectX Left-Handed Projection Matrix Box

View Coordinate Frame (Left-Handed Orientation)
- Origin: $\mathbf{0} = [0, 0, 0]^{T}$
- Basis: $(\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{x}}$ points right, $\mathbf{\hat{y}}$ points up, $\mathbf{\hat{z}}$ points into the view volume away from the viewer
Clip Coordinate Frame (Left-Handed Orientation)
- Origin: $\mathbf{0} = [0, 0, 0]^{T}$
- Basis: $(\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{x}}$ points right, $\mathbf{\hat{y}}$ points up, $\mathbf{\hat{z}}$ points into the view volume away from the viewer
Canonical View Volume: $[-1, 1] \times [-1, 1] \times [0, 1]$

\begin{align*} M^{DirectX}_{per,lh \rightarrow lh} &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \\ M^{DirectX}_{orth,lh \rightarrow lh} &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ M^{DirectX}_{per,vfov,lh \rightarrow lh} &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \\ \end{align*}

(231)

Comparing Coordinate System Orientations¶

Let us illustrate why the coordinate systems in this section have a left-handed orientation or right-handed orientation. By convention, the standard basis on $\mathbb{R}^{3}$ has a right-handed orientation. Recall that the standard basis on $\mathbb{R}^{3}$ is the tuple of vectors $(\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , and $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ are the standard basis vectors. In this basis, DirectX’s left-handed view coordinates and clip coordinates have the determinant

\det \begin{bmatrix} \mathbf{\hat{x}} & \mathbf{\hat{y}} & -\mathbf{\hat{z}} \end{bmatrix} = \det \begin{bmatrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & -1 \\ \end{bmatrix} = 1 \cdot 1 \cdot -1 = -1

(232)

and DirectX’s right-handed view coordinates have the determinant

\det \begin{bmatrix} \mathbf{\hat{x}} & \mathbf{\hat{y}} & \mathbf{\hat{z}} \end{bmatrix} = \det \begin{bmatrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \\ \end{bmatrix} = 1 \cdot 1 \cdot 1 = 1

(233)

so the right-handed DirectX view coordinate system indeed has a right-handed orientation, and the left-handed view coordinate system and clip coordinate system have a left-handed orientation.

Metal¶

We compute the perspective and orthographic transformation matrices for Metal.

The Canonical Matrices¶

The canonical view space coordinate system for Metal is the frame $(\mathbf{0}, (\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ in $\mathbb{R}^{3}$ , where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ , where $\mathbf{\hat{z}}$ points into the view volume. The canonical view volume for for Metal in normalized device coordinates is parametrized by $[-1, 1] \times [-1, 1] \times [0, 1]$ . The canonical perspective projection matrix for Metal is given by

M^{C, Metal}_{per} = \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} .

(234)

The canonical orthographic projection matrix for Metal is given by

M^{C, Metal}_{orth} = \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} .

(235)

The canonical symmetric vertical field of view perspective projection matrix for Metal is given by

M^{C,Metal}_{per,vfov} = \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}.

(236)

This finishes the statement of the canonical matrices for Metal.

Right-Handed View Space¶

The right-handed view space coordinate system for Metal is the frame $(\mathbf{0}, (\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}}))$ in $\mathbb{R}^{3}$ , where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{z}}$ points out of the view volume towards the viewer. This is a right-handed coordinate system. The clip coordinate system for Metal is the canonical left-handed one. The orthogonal transformations are given by

Q^{Metal}_{lh} = Q^{Metal}_{rh} = I

(237)

where $I$ is the identity matrix. The change of orientation matrices are given by

\Omega_{rh \rightarrow lh} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} , \hspace{4 pt} \Omega_{lh \rightarrow lh} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(238)

To compute the projections, we need to transform from the chosen view coordinates to the canonical view coordinates, apply the canonical projection, and then transform from the canonical clip coordinates to the target clip coordinates. We can map any view coordinates to any clip coordinates using the same process. Each coordinate transformation is the product of an orthogonal transform and a change of orientation matrix. Let us calculate the perspective projection

\begin{align*} M^{Metal}_{per,rh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} M^{C,Metal}_{per} \Omega_{rh \rightarrow lh} Q^{Metal}_{rh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} \right) M^{C,Metal}_{per} \left( \Omega_{rh \rightarrow lh} Q^{Metal}_{rh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C,Metal}_{per} \left( \Omega_{rh \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C,Metal}_{per} \Omega_{rh \rightarrow lh} \\ &= I M^{C,Metal}_{per} \Omega_{rh \rightarrow lh} \\ &= M^{C,Metal}_{per} \Omega_{rh \rightarrow lh} \\ &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \end{bmatrix} \\ &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & \frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & \frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & -1 & 0 \end{bmatrix} \end{align*}

(239)

therefore

M^{Metal}_{per,rh \rightarrow lh} = \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & \frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & \frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & -1 & 0 \end{bmatrix}.

(240)

Here is the calculation for the orthographic matrix

\begin{align*} M^{Metal}_{orth,rh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} M^{C, Metal}_{orth} \Omega_{rh \rightarrow lh} Q^{Metal}_{rh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} \right) M^{C, Metal}_{orth} \left( \Omega_{rh \rightarrow lh} Q^{Metal}_{rh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C, Metal}_{orth} \left( \Omega_{rh \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C, Metal}_{orth} \Omega_{rh \rightarrow lh} \\ &= I M^{C, Metal}_{orth} \Omega_{rh \rightarrow lh} \\ &= M^{C, Metal}_{orth} \Omega_{rh \rightarrow lh} \\ &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \end{bmatrix} \\ &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & -\frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \end{bmatrix} \end{align*}

(241)

therefore

M^{Metal}_{orth,rh \rightarrow lh} = \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & -\frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \end{bmatrix}.

(242)

Finally, we calculate the matrix for the symmetric vertical field of view perspective projection

\begin{align*} M^{Metal}_{per,vfov,rh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} M^{C,Metal}_{per,vfov} \Omega_{rh \rightarrow lh} Q^{Metal}_{rh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} \right) M^{C,Metal}_{per,vfov} \left( \Omega_{rh \rightarrow lh} Q^{Metal}_{rh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C,Metal}_{per,vfov} \left( \Omega_{rh \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C,Metal}_{per,vfov} \Omega_{rh \rightarrow lh} \\ &= I M^{C,Metal}_{per,vfov} \Omega_{rh \rightarrow lh} \\ &= M^{C,Metal}_{per,vfov} \Omega_{rh \rightarrow lh} \\ &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \end{align*}

(243)

therefore

M^{Metal}_{per,vfov,rh \rightarrow lh} = \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix}.

(244)

This completes the computation of the matrices for the right-handed Metal view coordinates.

Metal Right-Handed Projection Matrix Box

View Coordinate Frame (Right-Handed Orientation)
- Origin: $\mathbf{0} = [0, 0, 0]^{T}$
- Basis: $(\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{x}}$ points to the right, $\mathbf{\hat{y}}$ points up, $\mathbf{\hat{z}}$ points out of the view volume towards the viewer
Clip Coordinate Frame (Left-Handed Orientation)
- Origin: $\mathbf{0} = [0, 0, 0]^{T}$
- Basis: $(\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{x}}$ points right, $\mathbf{\hat{y}}$ points up, $\mathbf{\hat{z}}$ points into the view volume away from the viewer
Canonical View Volume: $[-1, 1] \times [-1, 1] \times [0, 1]$

\begin{align*} M^{Metal}_{per,rh \rightarrow lh} &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & \frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & \frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & -1 & 0 \end{bmatrix} \\ M^{Metal}_{orth,rh \rightarrow lh} &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & -\frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \end{bmatrix} \\ M^{Metal}_{per,vfov,rh \rightarrow lh} &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \\ \end{align*}

(245)

Left-Handed View Space¶

The left-handed view space coordinate system for Metal is the frame $(\mathbf{0}, (\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}}))$ in $\mathbb{R}^{3}$ , where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{z}}$ points into the view volume away from the viewer. This is a right-handed coordinate system. The clip coordinate system is the canonical left-handed one. The orthogonal transformation is given by

Q^{Metal}_{lh} = I

(246)

where $I$ is the identity matrix. The change of orientation matrix are given by

\Omega_{lh \rightarrow lh} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(247)

To compute the projections, we need to transform from the chosen view coordinates to the canonical view coordinates, apply the canonical projection, and then transform from the canonical clip coordinates to the target clip coordinates. We can map any view coordinates to any clip coordinates using the same process. Each coordinate transformation is the product of an orthogonal transform and a change of orientation matrix. Let us calculate the perspective projection

\begin{align*} M^{Metal}_{per,lh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} M^{C,Metal}_{per} \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} \right) M^{C,Metal}_{per} \left( \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C,Metal}_{per} \left( \Omega_{rl \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C,Metal}_{per} \Omega_{lh \rightarrow lh} \\ &= I M^{C,Metal}_{per} I \\ &= M^{C,Metal}_{per} \\ &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \end{bmatrix} \end{align*}

(248)

therefore

M^{Metal}_{per,lh \rightarrow lh} = \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \end{bmatrix}.

(249)

Here is the calculation for the orthographic matrix

\begin{align*} M^{Metal}_{orth,lh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} M^{C, Metal}_{orth} \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} \right) M^{C, Metal}_{orth} \left( \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C, Metal}_{orth} \left( \Omega_{lh \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C, Metal}_{orth} \Omega_{lh \rightarrow lh} \\ &= I M^{C, Metal}_{orth} I \\ &= M^{C, Metal}_{orth} \\ &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \end{bmatrix} \end{align*}

(250)

therefore

M^{Metal}_{orth,lh \rightarrow lh} = \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \end{bmatrix}.

(251)

Finally, we calculate the matrix for the symmetric vertical field of view perspective projection

\begin{align*} M^{Metal}_{per,vfov,lh \rightarrow lh} &= \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} M^{C,Metal}_{per,vfov} \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} \\ &= \left( \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} \right) M^{C,Metal}_{per,vfov} \left( \Omega_{lh \rightarrow lh} Q^{Metal}_{lh} \right) \\ &= \left( \Omega_{lh \rightarrow lh} I \right) M^{C,Metal}_{per,vfov} \left( \Omega_{lh \rightarrow lh} I \right) \\ &= \Omega_{lh \rightarrow lh} M^{C,Metal}_{per,vfov} \Omega_{lh \rightarrow lh} \\ &= I M^{C,Metal}_{per,vfov} I \\ &= M^{C,Metal}_{per,vfov} \\ &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \end{bmatrix} \end{align*}

(252)

therefore

M^{Metal}_{per,vfov,lh \rightarrow lh} = \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \end{bmatrix}.

(253)

This completes the computation of the matrices for the left-handed Metal view coordinates.

Metal Left-Handed Projection Matrix Box

View Coordinate Frame (Left-Handed Orientation)
- Origin: $\mathbf{0} = [0, 0, 0]^{T}$
- Basis: $(\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{x}}$ points right, $\mathbf{\hat{y}}$ points up, $\mathbf{\hat{z}}$ points into the view volume away from the viewer
Clip Coordinate Frame (Left-Handed Orientation)
- Origin: $\mathbf{0} = [0, 0, 0]^{T}$
- Basis: $(\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{x}}$ points right, $\mathbf{\hat{y}}$ points up, $\mathbf{\hat{z}}$ points into the view volume away from the viewer
Canonical View Volume: $[-1, 1] \times [-1, 1] \times [0, 1]$

\begin{align*} M^{Metal}_{per,lh \rightarrow lh} &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \\ M^{Metal}_{orth,lh \rightarrow lh} &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ M^{Metal}_{per,vfov,lh \rightarrow lh} &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \\ \end{align*}

(254)

Comparing Coordinate System Orientations¶

Let us illustrate why the coordinate systems in this section have a left-handed orientation or right-handed orientation. By convention, the standard basis on $\mathbb{R}^{3}$ has a right-handed orientation. Recall that the standard basis on $\mathbb{R}^{3}$ is the tuple of vectors $(\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , and $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ are the standard basis vectors. In this basis, Metal’s left-handed view coordinates and clip coordinates have the determinant

\det \begin{bmatrix} \mathbf{\hat{x}} & \mathbf{\hat{y}} & -\mathbf{\hat{z}} \end{bmatrix} = \det \begin{bmatrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & -1 \\ \end{bmatrix} = 1 \cdot 1 \cdot -1 = -1

(255)

and Metal’s right-handed view coordinates have the determinant

\det \begin{bmatrix} \mathbf{\hat{x}} & \mathbf{\hat{y}} & \mathbf{\hat{z}} \end{bmatrix} = \det \begin{bmatrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \\ \end{bmatrix} = 1 \cdot 1 \cdot 1 = 1

(256)

so the right-handed Metal view coordinate system indeed has a right-handed orientation, and the left-handed view coordinate system and clip coordinate system have a left-handed orientation.

Vulkan¶

We compute the perspective and orthographic transformation matrices for Vulkan.

The Canonical Matrices¶

The canonical view coordinate system for Vulkan is the frame $(\mathbf{0}, (\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ in $\mathbb{R}^{3}$ , where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ , where $\mathbf{\hat{z}}$ points into the view volume. The canonical view volume for for Vulkan in normalized device coordinates is parametrized by $[-1, 1] \times [-1, 1] \times [0, 1]$ . The canonical perspective projection matrix for Vulkan is given by

M^{C, Vulkan}_{per} = \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}.

(257)

The canonical orthographic projection matrix for Vulkan is given by

M^{C, Vulkan}_{orth} = \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(258)

The canonical symmetric vertical field of view perspective projection matrix for Vulkan is given by

M^{C, Vulkan}_{per,vfov} = \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}

(259)

This finishes the statement of the canonical matrices for Vulkan.

Right-Handed View Space¶

The right-handed view coordinate system for Vulkan is the frame $(\mathbf{0}, (\mathbf{\hat{x}}, -\mathbf{\hat{y}}, \mathbf{\hat{z}}))$ in $\mathbb{R}^{3}$ , where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{z}}$ points into the view volume away from the viewer. Notice that the vertical axis points down in this frame. This is a right-handed coordinate system. The clip coordinate system for Vulkan is the same as the right-handed view coordinate system. A homogeneous rotation about the x-axis is defined as

R_{x}\left( \theta \right) = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & \cos\left( \theta \right) & -\sin\left( \theta \right) & 0 \\ 0 & \sin\left( \theta \right) & \cos\left( \theta \right) & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(260)

The orthogonal transformations for the right-handed Vulkan view coordinates are given by

Q^{Vulkan}_{rh} = R_{x}\left( \pi \right) = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & \cos\left( \pi \right) & -\sin\left( \pi \right) & 0 \\ 0 & \sin\left( \pi \right) & \cos\left( \pi \right) & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ Q^{Vulkan}_{lh} = R_{x}\left( -\pi \right) = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & \cos\left( -\pi \right) & -\sin\left( -\pi \right) & 0 \\ 0 & \sin\left( -\pi \right) & \cos\left( -\pi \right) & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(261)

The change of orientation matrices are given by

\Omega_{rh \rightarrow lh} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} , \hspace{4 pt} \Omega_{lh \rightarrow rh} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(262)

To compute the projections, we need to transform from the chosen view coordinates to the canonical view coordinates, apply the canonical projection, and then transform from the canonical clip coordinates to the target clip coordinates. We can map any view coordinates to any clip coordinates using the same process. Each coordinate transformation is the product of an orthogonal transform and a change of orientation matrix. Let us calculate the perspective projection

\begin{align*} M^{Vulkan}_{per, rh \rightarrow rh} &= \Omega_{lh \rightarrow rh} \left(Q^{Vulkan}_{lh}\right)^{-1} M^{C,Vulkan}_{per} \Omega_{rh \rightarrow lh} Q^{Vulkan}_{rh} \\ &= \left( \Omega_{lh \rightarrow rh} \left(Q^{Vulkan}_{lh}\right)^{-1} \right) M^{C,Vulkan}_{per} \left( \Omega_{rh \rightarrow lh} Q^{Vulkan}_{rh} \right) \\ &= \left( \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \right) M^{C,Vulkan}_{per} \left( \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \right) \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} M^{C,Vulkan}_{per} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & -\frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{ 2 n }{r + l} & 0 & -\frac{r - l}{r + l} & 0 \\ 0 & -\frac{ 2 n }{t + b} & -\frac{t - b}{t + b} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{ 2 n }{r + l} & 0 & -\frac{r - l}{r + l} & 0 \\ 0 & -\frac{ 2 n }{b + t} & \frac{b - t}{b + t} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{ 2 n }{r + l} & 0 & -\frac{r - l}{r + l} & 0 \\ 0 & \frac{ 2 n }{b + t} & -\frac{b - t}{b + t} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{b - \left( -t \right)} & -\frac{b + \left( -t \right)}{b - \left( -t \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \end{align*}

(263)

therefore

M^{Vulkan}_{per, rh \rightarrow rh} = \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{b - \left( -t \right)} & -\frac{b + \left( -t \right)}{b - \left( -t \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}.

(264)

Here is the calculation for the orthographic matrix

\begin{align*} M^{Vulkan}_{orth, rh \rightarrow rh} &= \Omega_{lh \rightarrow rh} \left(Q^{Vulkan}_{lh}\right)^{-1} M^{C,Vulkan}_{orth} \Omega_{rh \rightarrow lh} Q^{Vulkan}_{rh} \\ &= \left( \Omega_{lh \rightarrow rh} \left(Q^{Vulkan}_{lh}\right)^{-1} \right) M^{C,Vulkan}_{orth} \left( \Omega_{rh \rightarrow lh} Q^{Vulkan}_{rh} \right) \\ &= \left( \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \right) M^{C,Vulkan}_{orth} \left( \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \right) \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{bmatrix} M^{C,Vulkan}_{orth} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & -\frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{2}{r + l} & 0 & 0 & -\frac{r - l}{r + l} \\ 0 & -\frac{2}{t + b} & 0 & -\frac{t - b}{t + b} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{2}{r + l} & 0 & 0 & -\frac{r - l}{r + l} \\ 0 & -\frac{2}{b + t} & 0 & \frac{b - t}{b + t} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{2}{r + l} & 0 & 0 & -\frac{r - l}{r + l} \\ 0 & \frac{2}{b + t} & 0 & -\frac{b - t}{b + t} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{b - \left( -t \right)} & 0 & -\frac{b + \left( -t \right)}{b - \left( -t \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \end{align*}

(265)

therefore

M^{Vulkan}_{orth, rh \rightarrow rh} = \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{b - \left( -t \right)} & 0 & -\frac{b + \left( -t \right)}{b - \left( -t \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(266)

Finally, we calculate the matrix for the symmetric vertical field of view perspective projection

\begin{align*} M^{Vulkan}_{per,vfov,rh \rightarrow rh} &= \Omega_{lh \rightarrow rh} \left(Q^{Vulkan}_{lh}\right)^{-1} M^{C,Vulkan}_{per,vfov} \Omega_{rh \rightarrow lh} Q^{Vulkan}_{rh} \\ &= \left( \Omega_{lh \rightarrow rh} \left(Q^{Vulkan}_{lh}\right)^{-1} \right) M^{C,Vulkan}_{per,vfov} \left( \Omega_{rh \rightarrow lh} Q^{Vulkan}_{rh} \right) \\ &= \left( \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \right) M^{C,Vulkan}_{per,vfov} \left( \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \right) \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} M^{C,Vulkan}_{per,vfov} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & -\frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \end{align*}

(267)

therefore

M^{Vulkan}_{per,vfov,rh \rightarrow rh} = \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}.

(268)

This completes the computation of the matrices for the right-handed Vulkan view coordinates.

Vulkan Right-Handed Projection Matrix Box

View Coordinate Frame (Right-Handed Orientation)
- Origin: $\mathbf{0} = [0, 0, 0]^{T}$
- Basis: $(\mathbf{\hat{x}}, -\mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{x}}$ points right, $-\mathbf{\hat{y}}$ points down, $\mathbf{\hat{z}}$ points into the view volume away from the viewer
Clip Coordinate Frame (Right-Handed Orientation)
- Origin: $\mathbf{0} = [0, 0, 0]^{T}$
- Basis: $(\mathbf{\hat{x}}, -\mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{x}}$ points right, $-\mathbf{\hat{y}}$ points down, $\mathbf{\hat{z}}$ points into the view volume away from the viewer
Canonical View Volume: $[-1, 1] \times [-1, 1] \times [0, 1]$

\begin{align*} M^{Vulkan}_{per, rh \rightarrow rh} &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{b - \left( -t \right)} & -\frac{b + \left( -t \right)}{b - \left( -t \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \\ M^{Vulkan}_{orth, rh \rightarrow rh} &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{b - \left( -t \right)} & 0 & -\frac{b + \left( -t \right)}{b - \left( -t \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ M^{Vulkan}_{per,vfov,rh \rightarrow rh} &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \\ \end{align*}

(269)

Left-Handed View Space¶

The right-handed view space coordinate system for Vulkan is the frame $(\mathbf{0}, (\mathbf{\hat{x}}, -\mathbf{\hat{y}}, -\mathbf{\hat{z}}))$ in $\mathbb{R}^{3}$ , where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $-\mathbf{\hat{z}}$ points out of the view volume towards the viewer. Notice that the vertical axis points down in this frame. This is a left-handed coordinate system. The clip coordinate system for Vulkan is the same as the right-handed view coordinate system. A homogeneous rotation about the x-axis is defined as

R_{x}\left( \theta \right) = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & \cos\left( \theta \right) & -\sin\left( \theta \right) & 0 \\ 0 & \sin\left( \theta \right) & \cos\left( \theta \right) & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(270)

The orthogonal transformations for the left-handed Vulkan view coordinates are given by

Q^{Vulkan}_{lh} = R_{x}\left( \pi \right) = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & \cos\left( \pi \right) & -\sin\left( \pi \right) & 0 \\ 0 & \sin\left( \pi \right) & \cos\left( \pi \right) & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(271)

The change of orientation matrices are given by

\Omega_{lh \rightarrow lh} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} , \hspace{4 pt} \Omega_{lh \rightarrow rh} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(272)

To compute the projections, we need to transform from the chosen view coordinates to the canonical view coordinates, apply the canonical projection, and then transform from the canonical clip coordinates to the target clip coordinates. We can map any view coordinates to any clip coordinates using the same process. Each coordinate transformation is the product of an orthogonal transform and a change of orientation matrix. Let us calculate the perspective projection

\begin{align*} M^{Vulkan}_{per, lh \rightarrow rh} &= \Omega_{lh \rightarrow rh} \left(Q^{Vulkan}_{lh}\right)^{-1} M^{C,Vulkan}_{per} \Omega_{lh \rightarrow lh} Q^{Vulkan}_{lh} \\ &= \left( \Omega_{lh \rightarrow rh} \left( Q^{Vulkan}_{lh}\right)^{-1} \right) M^{C,Vulkan}_{per} \left( \Omega_{lh \rightarrow lh} Q^{Vulkan}_{lh} \right) \\ &= \left( \Omega_{lh \rightarrow rh} \left( Q^{Vulkan}_{lh}\right)^{-1} \right) M^{C,Vulkan}_{per} \left( I Q^{Vulkan}_{lh} \right) \\ &= \left( \Omega_{lh \rightarrow rh} \left( Q^{Vulkan}_{lh}\right)^{-1} \right) M^{C,Vulkan}_{per} Q^{Vulkan}_{lh} \\ &= \left( \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \right) M^{C,Vulkan}_{per} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} M^{C,Vulkan}_{per} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{t - \left( -b \right)} & -\frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & \frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & -\frac{ 2 n }{t - \left( -b \right)} & \frac{t + \left( -b \right)}{t - \left( -b \right)} & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{ 2 n }{r + l} & 0 & \frac{r - l}{r + l} & 0 \\ 0 & -\frac{ 2 n }{t + b} & \frac{t - b}{t + b} & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{ 2 n }{r + l} & 0 & \frac{r - l}{r + l} & 0 \\ 0 & -\frac{ 2 n }{b + t} & -\frac{b - t}{b + t} & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{ 2 n }{r + l} & 0 & \frac{r - l}{r + l} & 0 \\ 0 & \frac{ 2 n }{b + t} & \frac{b - t}{b + t} & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & \frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{b - \left( -t \right)} & \frac{b + \left( -t \right)}{b - \left( -t \right)} & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \end{align*}

(273)

therefore

M^{Vulkan}_{per, lh \rightarrow rh} = \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & \frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{b - \left( -t \right)} & \frac{b + \left( -t \right)}{b - \left( -t \right)} & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix}.

(274)

Here is the calculation for the orthographic matrix

\begin{align*} M^{Vulkan}_{orth, lh \rightarrow rh} &= \Omega_{lh \rightarrow rh} \left(Q^{Vulkan}_{lh}\right)^{-1} M^{C,Vulkan}_{orth} \Omega_{lh \rightarrow lh} Q^{Vulkan}_{lh} \\ &= \left( \Omega_{lh \rightarrow rh} \left(Q^{Vulkan}_{lh}\right)^{-1} \right) M^{C,Vulkan}_{orth} \left( \Omega_{lh \rightarrow lh} Q^{Vulkan}_{lh} \right) \\ &= \left( \Omega_{lh \rightarrow rh} \left(Q^{Vulkan}_{lh}\right)^{-1} \right) M^{C,Vulkan}_{orth} \left( I Q^{Vulkan}_{lh} \right) \\ &= \left( \Omega_{lh \rightarrow rh} \left(Q^{Vulkan}_{lh}\right)^{-1} \right) M^{C,Vulkan}_{orth} Q^{Vulkan}_{lh} \\ &= \left( \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \right) M^{C,Vulkan}_{orth} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} M^{C,Vulkan}_{orth} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & \frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & -\frac{2}{t - \left( -b \right)} & 0 & -\frac{t + \left( -b \right)}{t - \left( -b \right)} \\ 0 & 0 & -\frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{bmatrix} \begin{bmatrix} \frac{2}{r + l} & 0 & 0 & -\frac{r - l}{r + l} \\ 0 & -\frac{2}{t + b} & 0 & -\frac{t - b}{t + b} \\ 0 & 0 & -\frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{2}{r + l} & 0 & 0 & -\frac{r - l}{r + l} \\ 0 & -\frac{2}{b + t} & 0 & \frac{b - t}{b + t} \\ 0 & 0 & -\frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{2}{r + l} & 0 & 0 & -\frac{r - l}{r + l} \\ 0 & \frac{2}{b + t} & 0 & -\frac{b - t}{b + t} \\ 0 & 0 & -\frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{b - \left( -t \right)} & 0 & -\frac{b + \left( -t \right)}{b - \left( -t \right)} \\ 0 & 0 & -\frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \end{align*}

(275)

therefore

M^{Vulkan}_{orth, lh \rightarrow rh} = \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{b - \left( -t \right)} & 0 & -\frac{b + \left( -t \right)}{b - \left( -t \right)} \\ 0 & 0 & -\frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}.

(276)

Finally, we calculate the matrix for the symmetric vertical field of view perspective projection

\begin{align*} M^{Vulkan}_{per,vfov,lh \rightarrow rh} &= \Omega_{lh \rightarrow rh} \left(Q^{Vulkan}_{lh}\right)^{-1} M^{C,Vulkan}_{per,vfov} \Omega_{lh \rightarrow lh} Q^{Vulkan}_{lh} \\ &= \left( \Omega_{lh \rightarrow rh} \left(Q^{Vulkan}_{lh}\right)^{-1} \right) M^{C,Vulkan}_{per,vfov} \left( \Omega_{lh \rightarrow lh} Q^{Vulkan}_{lh} \right) \\ &= \left( \Omega_{lh \rightarrow rh} \left(Q^{Vulkan}_{lh}\right)^{-1} \right) M^{C,Vulkan}_{per,vfov} \left( I Q^{Vulkan}_{lh} \right) \\ &= \left( \Omega_{lh \rightarrow rh} \left(Q^{Vulkan}_{lh}\right)^{-1} \right) M^{C,Vulkan}_{per,vfov} Q^{Vulkan}_{lh} \\ &= \left( \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \right) M^{C,Vulkan}_{per,vfov} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} M^{C,Vulkan}_{per,vfov} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & \frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ &= \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & -\frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \\ &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \end{align*}

(277)

therefore

M^{Vulkan}_{per,vfov,lh \rightarrow rh} = \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix}.

(278)

This completes the derivation of the matrices for the left-handed Vulkan view coordinates.

Vulkan Left-Handed Projection Matrix Box

View Coordinate Frame (Left-Handed Orientation)
- Origin: $\mathbf{0} = [0, 0, 0]^{T}$
- Basis: $(\mathbf{\hat{x}}, -\mathbf{\hat{y}}, -\mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{x}}$ points right, $-\mathbf{\hat{y}}$ points down, $-\mathbf{\hat{z}}$ points out of the view volume towards the viewer
Clip Coordinate Frame (Right-Handed Orientation)
- Origin: $\mathbf{0} = [0, 0, 0]^{T}$
- Basis: $(\mathbf{\hat{x}}, -\mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ where $\mathbf{\hat{x}}$ points right, $-\mathbf{\hat{y}}$ points down, $\mathbf{\hat{z}}$ points into the view volume away from the viewer
Canonical View Volume: $[-1, 1] \times [-1, 1] \times [0, 1]$

\begin{align*} M^{Vulkan}_{per, lh \rightarrow rh} &= \begin{bmatrix} \frac{ 2 n }{r - \left( -l \right)} & 0 & \frac{r + \left( -l \right)}{r - \left( -l \right)} & 0 \\ 0 & \frac{ 2 n }{b - \left( -t \right)} & \frac{b + \left( -t \right)}{b - \left( -t \right)} & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \\ M^{Vulkan}_{orth, lh \rightarrow rh} &= \begin{bmatrix} \frac{2}{r - \left( -l \right)} & 0 & 0 & -\frac{r + \left( -l \right)}{r - \left( -l \right)} \\ 0 & \frac{2}{b - \left( -t \right)} & 0 & -\frac{b + \left( -t \right)}{b - \left( -t \right)} \\ 0 & 0 & -\frac{1}{f - n} & -\frac{n}{f - n} \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} \\ M^{Vulkan}_{per,vfov,lh \rightarrow rh} &= \begin{bmatrix} \frac{1}{\text{aspect} \cdot \tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 & 0 \\ 0 & \frac{1}{\tan\left( \frac{\theta_{vfov}}{2} \right)} & 0 & 0 \\ 0 & 0 & -\frac{f}{f - n} & -\frac{ f n }{f - n} \\ 0 & 0 & -1 & 0 \\ \end{bmatrix} \\ \end{align*}

(279)

Comparing Coordinate System Orientations¶

Let us illustrate why the coordinate systems in this section have a left-handed orientation or right-handed orientation. By convention, the standard basis on $\mathbb{R}^{3}$ has a right-handed orientation. Recall that the standard basis on $\mathbb{R}^{3}$ is the tuple of vectors $(\mathbf{\hat{x}}, \mathbf{\hat{y}}, \mathbf{\hat{z}})$ where $\mathbf{\hat{x}} = [1, 0, 0]^{T}$ , $\mathbf{\hat{y}} = [0, 1, 0]^{T}$ , and $\mathbf{\hat{z}} = [0, 0, 1]^{T}$ are the standard basis vectors. In this basis, Vulkans’s left-handed view coordinate system has the determinant

\det \begin{bmatrix} \mathbf{\hat{x}} & -\mathbf{\hat{y}} & \mathbf{\hat{z}} \end{bmatrix} = \det \begin{bmatrix} 1 & 0 & 0 \\ 0 & -1 & 0 \\ 0 & 0 & 1 \\ \end{bmatrix} = 1 \cdot -1 \cdot 1 = -1

(280)

and Vulkan’s right-handed view coordinates and clip coordinates have the determinant

\det \begin{bmatrix} \mathbf{\hat{x}} & -\mathbf{\hat{y}} & -\mathbf{\hat{z}} \end{bmatrix} = \det \begin{bmatrix} 1 & 0 & 0 \\ 0 & -1 & 0 \\ 0 & 0 & -1 \\ \end{bmatrix} = 1 \cdot -1 \cdot -1 = 1

(281)

so the right-handed Vulkan view coordinate system and clip coordinate system indeed have a right-handed orientation, and the left-handed view coordinate system has a left-handed orientation.

Summary¶

We develop the manifold structure of real projective space $\mathbb{RP}^{3}$ from scratch, then use that information to show how we can represent linear, affine, and projective transformations as matrices. We demonstrate why real projective space $\mathbb{RP}^{3}$ is a convenient manifold for solving problems in computer graphics, geometric modeling, robotics, computer vision, and other spatial computing domains formulated in the setting $\mathbb{E}^{3}$ . We use this knowledge to construct homogeneous projection matrices.

We construct a set of projection matrices using a canonically chosen set of coordinates which makes it easy to derive any other projection matrix using coordinate transformations in conjunction with the relevant coordinate transformations to create the final result. We chose a view coordinate system where the horizontal axis points to the right, the vertical axis points up, and the depth axis points into the view volume. This has the benefit of keeping all of the computations in the same orthonormal frame, which makes the behavior of the projection more obvious. Operating in real projective space allows us to express our rendering problems in a coordinate system and scale independent way, such that the choice of coordinate system is a degree of freedom for the problem at hand. Moreover, the coordinate system independent formulation is a low-key form of separation of concerns in software engineering for spatial computing domains.

We show how to construct perspective and orthographic projection transformations in $\mathbb{RP}^{3}$ from any view space orthonormal frame to any clip coordinate frame by first defining the matrix in a specially chosen coordinate chart, and then show that one can construct any other one by using the appropriate orthogonal transformations and changes of orientation to map from the desired view coordinate system to the canonical one on one side, and mapping from the canonical clip coordinate system to the desired clip coordinate system using the same process. This result shows that perspective and orthographic projections are indeed coordinate independent transformations.

References¶

Lee, John M. 2011. Introduction to Topological Manifolds. (2nd ed.). Springer Science+Business Media, LLC. Lee (2011).
Marschner, Steve, et al. 2021. Fundamentals of Computer Graphics. 5th ed., A K Peters/CRC Press. Marschner et al. (2021).
Munkres, James R. 2000. Topology. (2nd ed.). Pearson, Upper Saddle River, NJ.

References¶

Lee, J. M. (2011). Introduction to Topological Manifolds. In Graduate Texts in Mathematics. Springer New York. 10.1007/978-1-4419-7940-7
Marschner, S., Shirley, P., Ashikhmin, M., Gleicher, M., Hoffman, N., Johnson, G., Munzner, T., Reinhard, E., Thompson, W. B., Willemsen, P., & Wyvill, B. (2021). Fundamentals of Computer Graphics. A K Peters/CRC Press. 10.1201/9781003050339

The House Of LambdaXymox

BOOK REVIEW: Vector Calculus, Linear Algebra, And Differential Forms A Unified Approach: 5th Edition