Cone Beam Iterative Reconstruction

This example demonstrates gradient-based iterative reconstruction for 3D cone beam CT using the differentiable ConeProjectorFunction from diffct.

Overview

3D cone beam iterative reconstruction extends optimization methods to full volumetric reconstruction. This example shows how to:

Formulate 3D cone beam CT reconstruction as a large-scale optimization problem
Handle the computational complexity of 3D forward and backward projections
Apply memory-efficient optimization strategies for volumetric data
Monitor convergence in high-dimensional parameter space

Mathematical Background

3D Cone Beam Iterative Formulation

The 3D reconstruction problem is formulated as:

\[\hat{f} = \arg\min_f \|A_{\text{cone}}(f) - p\|_2^2 + \lambda R(f)\]

where: - \(f(x,y,z)\) is the unknown 3D volume - \(A_{\text{cone}}\) is the cone beam forward projection operator - \(p(\phi, u, v)\) is the measured 2D projection data - \(R(f)\) is an optional 3D regularization term

3D Forward Projection

The cone beam forward projection integrates along rays through the 3D volume:

\[p(\phi, u, v) = \int_0^{\infty} f\left(\vec{r}_s(\phi) + t \cdot \vec{d}(\phi, u, v)\right) dt\]

where \(\vec{r}_s(\phi)\) is the source position and \(\vec{d}(\phi, u, v)\) is the ray direction vector.

3D Gradient Computation

The gradient of the 3D loss function uses the cone beam backprojection operator:

\[\frac{\partial L}{\partial f} = 2A_{\text{cone}}^T(A_{\text{cone}}(f) - p_{\text{measured}})\]

where \(A_{\text{cone}}^T\) is the 3D cone beam backprojection operator (adjoint).

Computational Complexity

3D reconstruction presents significant computational challenges:

Memory Requirements: \(O(N^3)\) for volume storage vs \(O(N^2)\) for 2D images
Projection Data: \(O(N_{\phi} \times N_u \times N_v)\) 2D projections
Forward/Backward Operations: \(O(N^3 \times N_{\phi})\) computational complexity
Gradient Storage: Additional memory for automatic differentiation

Implementation Steps

3D Problem Setup: Define parameterized 3D volume as learnable tensor
Cone Beam Forward Model: Use ConeProjectorFunction for 2D projection prediction
Loss Computation: Calculate L2 distance between predicted and measured projections
3D Gradient Computation: Use automatic differentiation through cone beam operators
Memory-Efficient Optimization: Apply strategies to handle large 3D parameter space
Convergence Monitoring: Track loss and 3D reconstruction quality

Model Architecture

The 3D reconstruction model consists of:

Parameterized Volume: Learnable 3D tensor representing the unknown volume
Cone Beam Forward Model: ConeProjectorFunction with 3D geometry parameters
Loss Function: Mean squared error between predicted and measured 2D projections

3D Regularization Options

Common 3D regularization terms:

3D Total Variation: \(R_{\text{TV}}(f) = \sum_{x,y,z} \|\nabla f(x,y,z)\|_2\)
3D Smoothness: \(R_{\text{smooth}}(f) = \sum_{x,y,z} \|\nabla f(x,y,z)\|_2^2\)
L1 Sparsity: \(R_{\text{L1}}(f) = \sum_{x,y,z} |f(x,y,z)|\)

Memory Management Strategies

3D reconstruction requires careful memory management:

Gradient Checkpointing: Trade computation for memory in backpropagation
Mixed Precision: Use float16 when possible to reduce memory usage
Batch Processing: Process volume slices when memory is extremely limited
Efficient Data Layout: Optimize tensor storage and access patterns

Convergence Characteristics

3D cone beam reconstruction typically exhibits:

Initial Convergence (0-100 iterations): Rapid loss decrease, basic 3D structure emerges
Detail Refinement (100-500 iterations): Fine 3D features develop progressively
Final Convergence (500+ iterations): Slow improvement, potential overfitting risk

Challenges in 3D Reconstruction

Cone Beam Artifacts: Increased artifacts for large cone angles in 3D
Incomplete Sampling: Missing data in certain regions of 3D Fourier space
Computational Cost: Orders of magnitude higher than 2D reconstruction
Memory Limitations: Large volumes may exceed available GPU memory
Convergence Complexity: Higher-dimensional optimization landscape

Applications

3D cone beam iterative reconstruction is essential for:

Medical CBCT: Dental, orthopedic, and interventional imaging
Industrial CT: Non-destructive testing and quality control
Micro-CT: High-resolution imaging of small specimens and materials
Security Screening: Advanced baggage and cargo inspection systems

Code Example3D Cone Beam Iterative Example
import math
import torch
import numpy as np
import matplotlib.pyplot as plt
import torch.nn as nn
import torch.optim as optim
from diffct.differentiable import ConeProjectorFunction

def shepp_logan_3d(shape):
    zz, yy, xx = np.mgrid[:shape[0], :shape[1], :shape[2]]
    xx = (xx - (shape[2] - 1) / 2) / ((shape[2] - 1) / 2)
    yy = (yy - (shape[1] - 1) / 2) / ((shape[1] - 1) / 2)
    zz = (zz - (shape[0] - 1) / 2) / ((shape[0] - 1) / 2)
    el_params = np.array([
        [0, 0, 0, 0.69, 0.92, 0.81, 0, 0, 0, 1],
        [0, -0.0184, 0, 0.6624, 0.874, 0.78, 0, 0, 0, -0.8],
        [0.22, 0, 0, 0.11, 0.31, 0.22, -np.pi/10.0, 0, 0, -0.2],
        [-0.22, 0, 0, 0.16, 0.41, 0.28, np.pi/10.0, 0, 0, -0.2],
        [0, 0.35, -0.15, 0.21, 0.25, 0.41, 0, 0, 0, 0.1],
        [0, 0.1, 0.25, 0.046, 0.046, 0.05, 0, 0, 0, 0.1],
        [0, -0.1, 0.25, 0.046, 0.046, 0.05, 0, 0, 0, 0.1],
        [-0.08, -0.605, 0, 0.046, 0.023, 0.05, 0, 0, 0, 0.1],
        [0, -0.605, 0, 0.023, 0.023, 0.02, 0, 0, 0, 0.1],
        [0.06, -0.605, 0, 0.023, 0.046, 0.02, 0, 0, 0, 0.1],
    ], dtype=np.float32)

    # Extract parameters for vectorization
    x_pos = el_params[:, 0][:, None, None, None]
    y_pos = el_params[:, 1][:, None, None, None]
    z_pos = el_params[:, 2][:, None, None, None]
    a_axis = el_params[:, 3][:, None, None, None]
    b_axis = el_params[:, 4][:, None, None, None]
    c_axis = el_params[:, 5][:, None, None, None]
    phi = el_params[:, 6][:, None, None, None]
    val = el_params[:, 9][:, None, None, None]

    # Broadcast grid to ellipsoid axis
    xc = xx[None, ...] - x_pos
    yc = yy[None, ...] - y_pos
    zc = zz[None, ...] - z_pos

    c = np.cos(phi)
    s = np.sin(phi)

    # Only rotation around z, so can vectorize:
    xp = c * xc - s * yc
    yp = s * xc + c * yc
    zp = zc

    mask = (
        (xp ** 2) / (a_axis ** 2)
        + (yp ** 2) / (b_axis ** 2)
        + (zp ** 2) / (c_axis ** 2)
        <= 1.0
    )

    # Use broadcasting to sum all ellipsoid contributions
    shepp_logan = np.sum(mask * val, axis=0)
    shepp_logan = np.clip(shepp_logan, 0, 1)
    return shepp_logan

class IterativeRecoModel(nn.Module):
    def __init__(self, volume_shape, angles, det_u, det_v, du, dv, sdd, sid, voxel_spacing):
        super().__init__()
        self.reco = nn.Parameter(torch.zeros(volume_shape))
        self.angles = angles
        self.det_u = det_u
        self.det_v = det_v
        self.du = du
        self.dv = dv
        self.sdd = sdd
        self.sid = sid
        self.relu = nn.ReLU() # non negative constraint
        self.voxel_spacing = voxel_spacing

    def forward(self, x):
        updated_reco = x + self.reco
        current_sino = ConeProjectorFunction.apply(updated_reco, 
                                                   self.angles, 
                                                   self.det_u, self.det_v, 
                                                   self.du, self.dv, 
                                                   self.sdd, self.sid, self.voxel_spacing)
        return current_sino, self.relu(updated_reco)

class Pipeline:
    def __init__(self, lr, volume_shape, angles, 
                 det_u, det_v, du, dv, 
                 sdd, sid, voxel_spacing,
                 device, epoches=1000):
        
        self.epoches = epoches
        self.model = IterativeRecoModel(volume_shape, angles,
                                        det_u, det_v, du, dv, 
                                        sdd, sid, voxel_spacing).to(device)
        
        self.optimizer = optim.AdamW(list(self.model.parameters()), lr=lr)
        self.loss = nn.MSELoss()

    def train(self, input, label):
        loss_values = []
        for epoch in range(self.epoches):
            self.optimizer.zero_grad()
            predictions, current_reco = self.model(input)
            loss_value = self.loss(predictions, label)
            loss_value.backward()
            self.optimizer.step()
            loss_values.append(loss_value.item())

            if epoch % 10 == 0:
                print(f"Epoch {epoch}, Loss: {loss_value.item()}")

        return loss_values, self.model

def main():
    Nx, Ny, Nz = 64, 64, 64
    phantom_cpu = shepp_logan_3d((Nz, Ny, Nx))

    num_views = 180
    angles_np = np.linspace(0, 2 * math.pi, num_views, endpoint=False).astype(np.float32)

    det_u, det_v = 128, 128
    du, dv = 1.0, 1.0
    voxel_spacing = 1.0
    sdd = 600.0
    sid = 400.0

    device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
    phantom_torch = torch.tensor(phantom_cpu, device=device, dtype=torch.float32).contiguous()

    # Generate the "real" sinogram
    angles_torch = torch.tensor(angles_np, device=device, dtype=torch.float32)
    real_sinogram = ConeProjectorFunction.apply(phantom_torch, angles_torch,
                                               det_u, det_v, du, dv,
                                               sdd, sid, voxel_spacing)

    pipeline_instance = Pipeline(lr=1e-1, 
                                 volume_shape=(Nz,Ny,Nx),
                                 angles=angles_torch,
                                 det_u=det_u, det_v=det_v,
                                 du=du, dv=dv, voxel_spacing=voxel_spacing,
                                 sdd=sdd,
                                 sid=sid,
                                 device=device, epoches=1000)
    
    ini_guess = torch.zeros_like(phantom_torch)
    
    loss_values, trained_model = pipeline_instance.train(ini_guess, real_sinogram)
    
    reco = trained_model(ini_guess)[1].squeeze().cpu().detach().numpy()

    plt.figure()
    plt.plot(loss_values)
    plt.title("Loss Curve")
    plt.xlabel("Epoch")
    plt.ylabel("Loss")
    plt.show()

    mid_slice = Nz // 2
    plt.figure(figsize=(12, 6))
    plt.subplot(1, 2, 1)
    plt.imshow(phantom_cpu[mid_slice, :, :], cmap="gray")
    plt.title("Original Phantom Mid-Slice")
    plt.axis("off")

    plt.subplot(1, 2, 2)
    plt.imshow(reco[mid_slice, :, :], cmap="gray")
    plt.title("Reconstructed Mid-Slice")
    plt.axis("off")
    plt.show()

if __name__ == "__main__":
    main()