Abstract
Objective:
Computed tomography (CT) is a widely used medical imaging modality, but its radiation exposure poses potential risks to human health. Sparse-view scanning has emerged as an effective approach to reduce radiation dose; however, images reconstructed using the filtered back-projection (FBP) algorithm from sparse-view projections often suffer from severe streak artifacts. Achieving high-quality CT image reconstructed from sparse-view projections remains a challenging task.
Methods:
Building on compressed sensing (CS), the total variation (TV) algorithm is applied for high-quality sparse-view reconstruction. We further propose a relative total variation (RTV) algorithm to enhance the accuracy of sparse-view reconstruction. Experimental results indicate that while the RTV algorithm improves accuracy, it has limitations in edge preservation. To address this, inspired by the success of directional TV (DTV) in limited-angle reconstruction, we develop a directional relative TV (DRTV) model. This model applies the RTV technique in both x and y directions independently, and we derive its adaptive steepest descent projection onto convex set (ASD-POCS) solution algorithm.
Results:
Experiments conducted on simulated phantoms and real CT images demonstrate the correctness, convergence, and superior performance of the DRTV algorithm in sparse-view reconstruction. Compared with the TV, DTV, and RTV algorithm, the DRTV algorithm exhibits superior preservation of structural features and texture details.
Significance:
The DRTV algorithm represents an advanced method for high-precision sparse-view CT reconstruction, providing stable and accurate results. Moreover, the approach is applicable to other medical imaging modalities.
Keywords
Introduction
With the advantages of high resolution, fast imaging speed and versatility, computed tomography (CT) has become a widely used imaging modality in clinical diagnosis and treatment nowadays. However, the health risks of radiation exposure from CT should not be underestimated. Sparse-view scanning is a configuration that effectively reduces CT radiation dose. But images reconstructed using the filtered back-projection (FBP) algorithm, 1 a traditional and commercially available reconstruction method, from sparse-view projections often suffer from severe streak artifacts. 2 How to achieve high-precision sparse-view CT reconstruction is a challenging problem.
With the aid of compressive sensing (CS) theory, 3 many ill-posed inverse problems, including sparse-view reconstruction, can now be effectively solved. Among the most representative solutions is the development of total variation (TV) type algorithms. In 2006, the model problem of reconstructing an object from incomplete frequency samples was studied by Candes et al., where it was shown that function can be reconstructed exactly as the solution to the minimization problem. 4 That is to say that exact recovery may be obtained by solving a convex optimization problem. On this basis, Sidky et al. develop and investigate an iterative image reconstruction algorithm based on the minimization of the image TV that applies to divergent-beam CT, which can realize sparse reconstruction with high accuracy. 5 Subsequently, in 2008, Sidky et al. further optimized the algorithm and proposed the adaptive steepest descent projection onto convex set (ASD-POCS) algorithm. 6
Several variants of TV have since emerged to address the shortcomings of TV algorithms for specific problems. In 2010, Synho Do et al. present an improved regularization technique by incorporating higher-order (HOTV) derivatives to reduce staircase artifacts without sacrificing edge sharpness. 7 Later people also applied HOTV in other scenarios.8,9 In 2011, since the low-contrast structures tend to be smoothed out by the TV regularization, therefore Tian et al. developed an iterative CT reconstruction algorithm with edge-preserving TV (EPTV) regularization to reconstruct CT images from highly under-sampled data obtained at low mAs levels. 10 In 2012, since a conventional TV minimization algorithm often suffers from over-smoothness on the edges of the resulting image, Liu et al. proposed an adaptive-weighted TV (AwTV) minimization algorithm derived by considering the anisotropic edge property among neighboring image voxels. 11 In 2012, Xu et al. proposed the relative total variation (RTV) model, which can extract meaningful structures under the complication of texture patterns. 12 In 2013, since the TV minimization process is isotropic, suggesting that it is unfit for limited-angle CT. Therefore Chen et al. proposed an anisotropic TV (ATV) minimization method. 13 In the same year, Ning et al. proposed a new image reconstruction algorithm based on lp norm compressive sensing by combining the penalty function and revised Hesse sequence quadratic programming, and using block compressive sensing. 14 In 2014, Sidky et al. proposed the constrained total p variation (TpV) model and solved the model using the Chambolle-Pock (CP)15–17 optimization algorithm. 18 In 2015, Rigie et al. proposed the total nuclear variation (TnV) model, which better preserves the boundary information by encouraging the different channels to have the same edge structure and the gradient vectors to point to the same direction. 19 In 2017, Wang et al. proposed a new iteratively reweighted ATV method, in which a reweighted technique is incorporated into the idea of ATV. 20 In 2021, Zhang et al. proposed the directional total variation (DTV) model, which introduces the direction of the gradient to constrain the variation of the image, thus preserving the structural features and texture details of the image. 21 In the same year, in ASD-POCS algorithm, the existing gradient expression of the TV-type norm appears too complicated in the implementation code and reduces image reconstruction speed. To address this issue, Qiao developed a simple and fast ASD-POCS algorithm. 22 In the same year, Qiao et al. proposed the balanced total variation (bTV) model, which can guarantee convergence and achieve fast convergence in 3D electron paramagnetic resonance imaging (EPRI)23–25 applications. 26 In 2023, Qiao et al. proposed a data divergence constrained, total nuclear variation minimization model and its CP solving algorithm. 27 In 2024, Liu et al. proposed an edge-preserving total nuclear variation (EPTVN) minimization algorithm for sparse reconstruction in EPRI. 28 In 2024, Qiao et al. proposed to use the DTV algorithm in sparse reconstruction for EPR imaging, which outperforms existing FBP and TV-like algorithms as well as deep learning-based methods. 29 In the same year, Fang et al. used the DTV algorithm for fast EPR imaging, which achieves a 10-fold speedup compared to the standard FBP algorithm. 30
The TV algorithm is not conducive to preserving the image structure during image reconstruction as it penalizes all the gradients in the reconstructed image and may also produce streak artifacts. The RTV model can be used to penalize different sizes of image gradients, which can adaptively protect the image structure in sparse projection CT reconstruction and help to suppress the blocky artifacts to a certain extent.31–33 The DTV model imposes directional TV constraints on the image and accurately recovers the phantoms from data generated over a significantly reduced angular range, and that it considerably diminishes artifacts observed otherwise in reconstructions of existing algorithms.34,35 Since RTV is weak in edge-preserving ability and DTV has edge-protecting advantages, in this paper, we propose a directional relative total variation (DRTV) reconstruction model, which is a fusion of RTV and DTV rather than a simple combination of the two. It can be understood as a directional treatment of relative TV, or it can be seen as a relativized improvement of directional TV. From the point of view of sparse optimization, DRTV is able to utilize more accurate a priori information corresponding to DRTV regular term compared to RTV, thus exerting stronger constraint ability on the solution. Through this fusion, the advantages of both are fully combined, and it is expected to realize high-precision sparse reconstruction.
In summary, the main contributions of this work are listed as follows:
We combine the artifact reduction advantage of RTV and the edge protection advantage of DTV to propose a DRTV model for sparse CT reconstruction. We derived the DRTV-ASD-POCS solution algorithm in the ASD-POCS framework. We design simulation experiments and real CT experiments to verify the correctness, stability and superior performance of the proposed algorithm in sparse reconstruction.
The rest of the paper is organized as follows. In Section II, we design the DRTV model and derive the DRTV-ASD-POCS solving algorithm in the framework of ASD-POCS algorithm. In Section III, we organize experiments to verify the correctness and convergence of the algorithm as well as the superiority it shows on sparse reconstruction. The discussion and conclusions are given in Section IV.
Methods
Imaging system model
In this paper, two-dimensional (2D) parallel-beam CT reconstruction is used as a research object. The discrete to discrete (D2D) imaging model for 2D CT, can be expressed as:
Typically, the system matrix is large-scale, ill-posed and under-conditioned, so direct matrix inversion is computationally intractable. In order to improve the reconstruction accuracy, the model needs to be transformed into an optimization problem and solved using the appropriate method.
RTV model
RTV model shown as
DTV model
DTV model shown as
They are the
DRTV model
In this paper, we incorporate the idea of DTV into the RTV, which constrains the x and y directions, respectively. The proposed DRTV model defined as
Equations (14) and (15) can be written from equations (4), (5), (7) and (8) as
By reorganizing the terms and grouping elements containing
The second line in equation (18) is an approximation obtained due to the numerical stability of the introduced
Equation (19) indicates that
Similarly, we can express equation (17) as
Equations (14) and (15) written in matrix form can be obtained
Then its gradient is calculated as follows:
DRTV-ASD-POCS algorithm
The pseudo-code for the proposed DRTV algorithm is presented in Algorithm 1. Specifically, in Algorithm 1, Lines 5 and 6 implement the POCS component, which ensures data consistency and enforces non-negativity constraints. The data consistency is achieved through algebraic reconstruction technique (ART) 37 operations combined with forward projection. Lines 13 to 17 correspond to the ASD component, which minimizes the DRTV norm. The gradient computation of the DRTV norm in Lines 14 and 15 is derived based on Equations (12) and (13), respectively.
Pseudocodes of the DRTV-ASD-POCS Algorithm
Reconstruction parameters
The complete specification of the DRTV algorithm consists of model parameters and algorithm parameters. The model parameters determine the solution of the optimization model, including the system matrix
Results
Correctness and convergence analysis of algorithm
In this section, the correctness of the DRTV algorithm is verified by using the FORBILD phantom. The size of the phantom is 256 × 256, the rotation center is located at the position of [128,128] in the center of the image, the length of the detector bin element is 1, the number of detector bin is the same as the length of the phantom, and the projection data are uniformly collected in the range of [0, π] for 360 angles for reconstruction. The Siddon ray-driven method is used to find the system matrix A and generate projection data of size 256 × 360.Each pixel of the grayscale image can represent 256 different gray levels ranging from 0 to 255.When the difference in the grayscale of the image is small enough, the human eye cannot perceive the difference between the two images. In the case of sufficient and ideal experimental data, the difference between the reconstructed image and the real image can be measured by root mean square error (RMSE). In general, the monitor is no longer able to distinguish the reconstructed image from the true image, and thus the proposed model, solution algorithm, and computer implementation are correct when
Figure 1 (a) shows the true image of the FORBILD phantom, and (b) shows the reconstructed image. It can be seen that (a) and (b) are almost identical and it is difficult to distinguish them with the naked eye. (c) shows a comparison of the vertical centerline profile between the true image and reconstructed image, and (d) shows a comparison of the horizontal centerline profiles between the true image and reconstructed image. It can be seen from the figure that the centerline profile almost completely overlaps. It is shown that the algorithm achieves a highly accurate reconstruction and meets the correctness metric for qualitative observations in this experiment.

The algorithm correctness verification for FORBILD phantom (a) True image of FORBILD phantom; (b) Reconstructed image; (c) Comparison of vertical centerline profile between the true image and the reconstructed image: the location of the blue line in (a); (d) Comparison of horizontal centerline profile between the true image and the reconstructed image: the location of the red line in (a).
We introduce three metrics to evaluate the convergence behavior of the algorithm by observing the changes in these metrics. They are defined as follows:
Figure 2 (a) shows the iterative trend of RMSE for FORBILD phantom. Above 100 iterations, their RMSEs are less than 10−4, which meets the metric for quantitatively analyzing the experimental correctness. In summary, the qualitative observation and quantitative analysis of the reconstruction results can prove that the verification of the correctness of the algorithm is successful. (b) shows the iterative trend of the data error, and it can be seen that the trend is still decreasing. (c) shows the iterative trend of the relative error of their TV values, respectively. Since the TV value is the sum of the gradient magnitude variations of all pixel points of the image, assigning the TV relative error to each pixel point, the relative error of each pixel point is less than 10−4, the monitor has been unable to distinguish the changes of the image. But there is an upward dithering during the reconstruction process. This phenomenon is a characteristic of the TV algorithms in the iterative process, which often appears in the iterative process. It does not affect the convergence of this algorithm. After the vibration, its relative error will continue to decrease, which indicates that the solution of the optimization model still tends to converge. Therefore, the three image quality metrics all have reached the allowed state of convergence.

Convergence analysis of reconstruction results for the FORBILD phantom (a)
Evaluation of sparse reconstruction ability
In order to evaluate the sparse reconstruction ability of the DRTV algorithm, in this section, the experiments are performed at 20, 30, 40, and 50 projection angles for FORBILD phantom and real CT image by using TV, DTV, RTV and DRTV algorithm. The sizes of FORBILD phantom and real CT image are 256 × 256. In the experimental analysis, RMSE and structural similarity (SSIM) are used as the metrics of image reconstruction quality.
Figure 3 (a) and (b) shows the comparison of the RMSE of the FORBILD phantom and the real CT image by using TV, DTV, RTV and DRTV algorithm for 20 projection angles, respectively. The DRTV algorithm achieves superior reconstruction accuracy after 500 iterations. Figure 4 shows the reconstruction results of the four algorithms for FORBILD phantom. At 20 projection angles, the reconstruction results of the TV and DTV algorithm have obvious artifacts and noise, and several image details are also smoothed out. From 20 to 50 projection angles, the accuracy of the reconstructed image increases. However, the reconstructed image still has some obvious artifacts compared with the original image at 50 projection angles. The reconstructed images by using the RTV and DRTV algorithms are almost indistinguishable from the original images with the naked eye at 20 projection angles. As the projection angle increases, the reconstruction results are still consistent with the original image. Figure 5 shows the reconstructed images on the real CT image by using the four algorithms. Compared to the DRTV algorithm, the TV and RTV algorithm clearly smooth out more image details, while the DTV algorithm has artifacts and noise, especially in the experimental results at 20 angles. The DRTV algorithm exhibits superior capability in preserving structural integrity and fine details of the reconstructed images.

The comparison of RMSE at 20 sparse projection angles by using TV, DTV, RTV and DRTV algorithm for different phantom (a) FORBILD phantom; (b) real CT image.

The comparison of the TV, DTV, RTV and DRTV algorithm for reconstruction of the FORBILD phantom: the number above the image indicates the number of projections; the text on the left indicates the algorithm used.

The comparison of TV, DTV, RTV and DRTV algorithm for reconstruction of real CT image: the number above the image indicates the number of projections; the text on the left indicates the algorithm used.
Table 1 and Table 2 show the RMSE and SSIM of the reconstruction of the FORBILD phantom and real CT image using the TV, DTV, RTV and DRTV algorithm for 20 to 50 projection angles, respectively. DRTV algorithm's final convergence accuracy is 0.0116 and 0.0849, respectively. However, the final convergence accuracies of TV, DTV and RTV algorithm are only 0.0504, 0.0187, 0.0179 and 0.0903, 0.0880, 0.0890, respectively. It is clear that the DRTV algorithm provides higher reconstruction accuracy and structural similarity for the reconstruction.
The comparison of RMSE and SSIM for the FORBILD phantom by the TV, DTV, RTV and DRTV algorithm.
The comparison of RMSE and SSIM for real CT image by the TV, DTV, RTV and DRTV algorithm.
Stability analysis of algorithm
To evaluate the stability of the DRTV algorithm, the experiments are performed at 50 projection angles with different levels of Gaussian noise added to each projection data by using the TV, DTV, RTV and DRTV algorithm for FORBILD phantom and real CT image, respectively. The mean value of Gaussian noise is 0, and the variance is 0.01, 0.02, 0.03, 0.04 and 0.05, respectively. In the experimental analysis, RMSE and SSIM are used as the metrics of image reconstruction quality.
We select the reconstruction results of the FORBILD phantom and real CT image at 50 projection angles and add Gaussian white noise with variance of 0.05 to the projection data for elaboration. As shown in Figure 6 (a) to (d) and Figure 7 (a) to (d)are the reconstruction images of TV, DTV, RTV and DRTV algorithm, respectively, and (e) to (h) are the local area magnification images of the respective reconstruction results. Figure 8 shows the comparison of the RMSE by using TV, DTV, RTV and DRTV algorithm for the FORBILD phantom and the real CT image at 50 projection angles, and with the addition of Gaussian white noise with variance of 0.05 to the projection data, respectively. It can be seen that the TV, DTV, RTV and DRTV algorithm all show noise suppression properties. However, the comparison reveals that the reconstruction results of the DRTV algorithm are more stable and clearer. In Table 3, the reconstruction accuracies of these four algorithms on the FORBILD phantom are 0.0145, 0.0122, 0.0111, and 0.0044, respectively. In Table 4, the reconstruction accuracies of these four algorithms on the real CT image are 0.0764, 0.0657, 0.0636, and 0.0573, respectively. The quantitative analysis shows that the DRTV algorithm is superior in terms of stability and protection of image structure.

(a)∼(d) The reconstruction of the FORBILD phantom results using TV, DTV, RTV, and DRTV algorithm under the condition of adding Gaussian white noise with variance of 0.05 to the projection data at 50 projection angles; (e)∼(h) Local area magnification images of the reconstruction results.

(a)∼(d) The reconstruction of real CT image results using TV, DTV, RTV, and DRTV algorithm under the condition of adding Gaussian white noise with variance of 0.05 to the projection data at 50 projection angles; (e)∼(h) Local area magnification images of the reconstruction results.

The comparison of the RMSE using TV, DTV, RTV and DRTV algorithm for different phantom at 50 projection angles and adding Gaussian white noise with a variance of 0.05 to the projection data (a) FORBILD phantom; (b) real CT image.
The comparison of RMSE and SSIM for FORBILD phantom under different levels of noise by TV, DTV, RTV and DRTV algorithm.
The comparison of RMSE and SSIM for real CT image under different levels of noise by TV, DTV, RTV and DRTV algorithm.
Discussions
The selection of the parameter b has a significant effect on the reconstruction results. As an adjustment factor, b is used to balance the weights in the x and y directions during the reconstruction process. A suitable value of b can ensure that the details of the image in different directions are reasonably preserved, thus improving the reconstruction quality. If b is not properly selected, it may lead to the loss of information or artifacts in a certain direction, thus affecting the overall reconstruction effect. When the value of b is small, which prefers the x direction, the reconstruction process may focus more on keeping the details of the image in the horizontal direction, resulting in sharper horizontal edges. However, if the value of b is too small, it may trigger excessive preservation in the x direction, introducing noise or artifacts. When the value of b is big, which prefers the y direction. For images with distinct directional features, the selection of b value can help to highlight features in a particular direction. Adjusting b can target the enhancement of structures in a certain direction, thus improving the performance of the reconstruction results. Therefore, optimizing the selection of b is crucial for improving reconstruction accuracy and stability.
To better understand the effect of parameter b, we performed additional experiments across a continuous range of b values. Figure 9 illustrates the reconstruction of a real CT image at 50 projection angles for different values of b (0.7, 0.8, 0.9, 1.0, and 1.1). It can be observed that the reconstruction quality improves with b = 0.9, which retains the most image details and provides sharper contours. The values of RMSE and SSIM in Table 5 further confirm that the image reconstruction quality is optimal when b = 0.9, with minimal error and high similarity to the original image. As b moves away from 0.9, either towards smaller or larger values, the reconstruction quality starts to degrade.

(a)∼(e) The comparison of different b value (0.7, 0.8, 0.9, 1.0, 1.1) for reconstruction of real CT image. (f)∼(j) Local area magnification images of the reconstruction results.
The comparison of RMSE and SSIM for real CT image by with different b value.
In summary, optimizing b is crucial for enhancing reconstruction accuracy and stability. The optimal value of b is context-dependent and varies based on the structure of the object and the noise characteristics in the measurement data. For isotropic objects, b≈1 is typically sufficient, while for anisotropic objects, tuning b to emphasize the dominant direction can significantly improve reconstruction results. We recommend that the value of b be carefully selected based on the specific features of the object to achieve the best possible reconstruction outcome.
Conclusions
In this paper, a directional relative TV algorithm for sparse-view CT reconstruction is proposed, which combines the advantages of RTV with those of DTV, designed under the ASD-POCS framework. The algorithm effectively protects the structural features and texture details of the image during the reconstruction process. Experiments conducted on simulated phantoms and real CT images demonstrate the correctness, convergence and superior performance of the DRTV algorithm in sparse-view reconstruction. Compared with the TV, DTV, and RTV algorithm, the DRTV algorithm exhibits superior preservation of structural features and texture details. The qualitative and quantitative results show that the algorithm can realize high-precision image reconstruction while suppressing artifacts. Moreover, the approach is applicable to other medical imaging modalities.
Footnotes
Acknowledgements
This work was supported in part by National Natural Science Foundation of China under grant 62071281, and by Local Science and Technology Development Fund Project Guided by the Central Government under grant YDZJSX2021A003.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
