11email: zhangzhixing@stu.pku.edu.cn 22institutetext: National Key Laboratory of General Artificial Intelligence, School of Intelligence Science and Technology, Peking University, Beijing, China
22email: wanglw@pku.edu.cn 33institutetext: Yizhun Medical AI Co., Ltd, Beijing, China
44institutetext: Peking University First Hospital, Beijing, China
55institutetext: Center for Machine Learning Research, Peking University, Beijing, China
66institutetext: Pazhou Lab, Guangzhou, China
Topology-Preserving Automatic Labeling of Coronary Arteries via Anatomy-aware Connection Classifier
Abstract
Automatic labeling of coronary arteries is an essential task in the practical diagnosis process of cardiovascular diseases. For experienced radiologists, the anatomically predetermined connections are important for labeling the artery segments accurately, while this prior knowledge is barely explored in previous studies. In this paper, we present a new framework called TopoLab which incorporates the anatomical connections into the network design explicitly. Specifically, the strategies of intra-segment feature aggregation and inter-segment feature interaction are introduced for hierarchical segment feature extraction. Moreover, we propose the anatomy-aware connection classifier to enable classification for each connected segment pair, which effectively exploits the prior topology among the arteries with different categories. To validate the effectiveness of our method, we contribute high-quality annotations of artery labeling to the public orCaScore dataset. The experimental results on both the orCaScore dataset and an in-house dataset show that our TopoLab has achieved state-of-the-art performance.
Keywords:
Automatic Labeling Coronary Arteries Topology-Preserving.1 Introduction
Coronary Computerized Tomography Angiography (CCTA) is a commonly used non-invasive approach for the diagnosis of potential coronary artery diseases [11]. In clinical practice, accurate labeling of coronary artery segments (see Figure 1(a)) is a crucial step toward the subsequent diagnosis and analysis of the image. However, the vast variability of coronary artery anatomy across individuals makes it challenging to achieve precise and automatic labeling.

Previous studies on labeling coronary arteries using deep learning-based methods [20, 22, 23, 24] have shown promising results by introducing graph convolutional networks and point cloud analysis. However, these approaches have overlooked the essential prior knowledge that different categories of coronary arteries have anatomically predetermined connections [3]. For instance, LAD, LCX, and RI originate from LM, while S and D arise from LAD. All of the anatomical connections form a tree structure with prior topology, as shown in Figure 1(b). We call this structure category topological trees. Due to lacking the utilization of the category topological trees, existing methods [20, 22, 23, 24] often make some obvious mistakes that violate the prior topology as illustrated in Figure 1(c). We argue that incorporating the category topological trees into the network explicitly is the key to improving automatic labeling performance, especially in reducing topology-violation labeling errors.
In this paper, we propose a novel framework called TopoLab to perform topology-preserving automatic labeling of coronary arteries. Our model mainly contains two components: the hierarchical feature extraction module and the anatomy-aware connection classifier. The hierarchical feature extraction module introduces the segment query to achieve intra-segment feature aggregation via Transformer [17] and relies on graph convolutional network [8] to establish inter-segment feature interactions. Moreover, to incorporate the category topological trees into the network explicitly, we further propose the anatomy-aware connection classifier (AC-Classifier). Unlike previous methods that classify each segment independently, AC-Classifier performs classification for every connected segment pair. Specifically, all of the connections derived from the category topological trees are used to construct the ground truth connection templates, and each connected segment pair is categorized into one of these templates. Since the connection templates inherently conform to the topology, AC-Classifier has effectively prioritized the anatomically predetermined connections and the network is enabled to preserve the topology by design.
To the best of our knowledge, there is currently no publicly available dataset with annotations for artery labeling. In this work, we contribute high-quality annotations to the orCaScore dataset [19]. The experimental results on the public dataset orCaScore and an in-house dataset have demonstrated that our TopoLab outperforms previous state-of-the-art methods, especially in the topology-related metrics.
Our contributions can be summarized as follows. (1) We are the first to incorporate the category topological trees into the deep learning models for automatic labeling of coronary arteries by introducing AC-Classifier. (2) We propose a novel hierarchical feature extraction module to achieve intra- and inter-segment feature aggregations. (3) Our approach achieves state-of-the-art performance on both public and in-house datasets. (4) We provide high-quality annotations of artery labeling for the public dataset orCaScore, which is available at https://github.com/zutsusemi/MICCAI2023-TopoLab-Labels/.
2 Related Work
Traditional methods [1, 21] for automatic labeling of coronary arteries usually align the extracted artery trees with a 3D coronary artery tree model which provides the anatomical connections as prior knowledge. However, these works rely heavily on logical rules, which can not always capture the complexities of the anatomical structure. To overcome the limitations, deep learning has been introduced in this area [20, 22, 23, 24] with its great success in medical imaging [14, 7, 15, 16, 18]. For instance, TreeLab-Net [20] uses bidirectional tree-structural LSTM [6] to model the coronary artery trees, while CPR-GCN [22] constructs a vessel graph by treating each segment as a node and leverages GCN [8] to aggregate segment features. CorLab-Net [24] regards spatial and anatomical dependencies as the explicit guidance for artery labeling based on point cloud networks [13]. Nevertheless, all of these deep learning based methods ignore the important priority about the predetermined anatomical structure – category topological trees. In this paper, we aim to incorporate this priority into the network design explicitly for developing the topology-preserving models.
3 Methodology
3.1 Overview
We start by extracting centerlines from the vessel segmentation annotations in a CCTA image, using a traditional 3D thinning algorithm [9]. Next, we use the minimum spanning tree algorithm [5] to construct two coronary artery trees, one for the left domain (LD) and one for the right domain (RD). Following the branch bifurcation rules in [22] (details can be found in supplementary materials), the coronary artery trees are split into several segments, denoted by , where is the number of segments, denotes the -th segment comprised of the 3D positions of centerline points. Our model takes as input the vessel segments and their connections , where indicates that the -th segment is connected with the -th segment. The objective is to predict the classes of the vessel segments.
As illustrated in Figure 2, we design a novel framework named TopoLab for the automatic labeling of coronary arteries. In Sec. 3.2, we will introduce the hierarchical feature extraction module comprised of intra-segment feature aggregation and inter-segment feature interaction. In Sec. 3.3, we will elaborate on the details of AC-Classifier which exploits the category topological trees effectively.

3.2 Hierarchical Feature Extraction
We first feed the CCTA image into an image encoder (e.g. U-Net [2]) to obtain the downscaled feature map , where is the channel size.
For each segment , trilinear interpolation in the downsampled feature map is adopted for the centerline point to obtain the corresponding point features The segment features are denoted as , simplified as .
Intra-Segment Feature Aggregation. To extract the segment-level features for labeling, aggregation of sequential point features belonging to the same segment is required. Previous studies [20, 22] employ Bidirectional LSTM [6] to summarize the tubular sequential features. However, due to the weak representative ability of LSTM for the vessel segments with huge length variability, the performance of these methods is limited.
We introduce Transformer [17] as our intra-segment feature aggregator for its strong capability to model the relationships among the sequences with varying lengths. Concretely, we set a learnable embedding called segment query to aggregate intra-segment point features. The segment query is concatenated with the segment features to obtain the new tensor . Following [4], we feed augmented by the learnable 3D positional encodings into a Transformer encoder containing several standard sub-blocks. Each standard sub-block has the same architecture as in [17], which consists of multi-head attention, feed-forward network, layer normalization and ReLU activation. The state of the segment query at the output of the Transformer encoder serves as the aggregated segment representation .
Inter-Segment Feature Interaction. The branching structure of coronary arteries is inherently graph-like, with each segment serving as a node and the connections between segments serving as edges. Thus, we leverage graph convolutional network [8] (GCN) to capture the interactions among different segments.
Specifically, let denotes the aggregated segment features where the -th item of is , and is the adjacency matrix for the vessel segment graph derived from the segment connections . The process of GCN layers is as follows:
(1) |
where is the ReLU activation, is the learnable parameters for the -th GCN layer, the input for the first layer is .
Finally, we fuse the input segment features and the output of the final GCN layer to obtain with the parameters . The enhanced segment features are forwarded to the classifier for segment labeling, where is the -th item of .
3.3 Anatomy-aware Connection Classifier
The direct approach for labeling the coronary arteries is to use a linear layer to classify each segment independently as in previous methods. To incorporate the category topological trees into the classifier design, we propose to conduct the classification task for every connected segment pair.
We begin by defining the ground truth segment connections which are composed of the topology-conforming connections derived from the category topological trees (like LMLAD, LCXOM, etc.). Note that the self-connections (e.g. RCARCA) are also considered as the ground truth connections. The ground truth segment connections have the corresponding template embeddings for classification, which are represented by , where is the number of ground truth segment connections. Denote as the -th item of , where and stand for the segment classes indexed by the -th ground truth connection, Enc denotes the sinusoidal encoding as in [17]. The connection templates are used to enable classification for the connected segment pairs.
Then, given the segment features , and all of the connected segment pairs , the connection features are obtained by rearrangement of the segment features, where the -th item of is . We use an MLP layer to further fuse the features .
Training Loss. The loss function can be written as:
(2) |
(3) |
where is the ground truth of the -th segment connection, denotes the -th item of . sim in Eq. 3 stand for the cosine similarity, , and the temperature is a hyperparameter which is set as by default.
Inference. During the inference stage, for each segment, we first select the connection with the largest confidence score among all segment connections that have covered the given segment. And then the corresponding category indexed by the selected connection serves as the prediction of the specific segment.
4 Experiments
4.1 Setup
Datasets. We train and evaluate our method on two datasets. The orCaScore [19] MICCAI 2014 Challenge contains 72 contrast-enhanced CTA images and non-contrast enhanced CT scans. As the original dataset only contains the labels of calcifications, we have annotated vessel segmentation and anatomical categories for coronary arteries with experienced radiologists. Considering the small amount of data, we randomly split the dataset into five folds to perform cross-validation. The mean values of cross-validation results are reported. We also collect an In-house Dataset containing 1200 CTA scans which have been annotated by at least two experts. The dataset is collected in compliance with the terms of the licensing agreement and ethical certification. We randomly split the dataset into train, validation and test set with 800, 200 and 200 scans respectively. The annotations for both datasets adhere to the same standard, which includes 14 classes of coronary artery segments (see Figure 1(b)). It is a challenging task, surpassing the scope of studies like TreeLab-Net[20] and CPR-GCN[22], which only consider 10 and 11 categories, respectively.
Evaluation Metrics. Following previous methods [22, 20, 24], we adopt the mean metrics of all categories of the segments including recall, precision and F1. Note that the mean metric is the weighted average based on the number of segments of different categories. To further evaluate the topological accuracy of connected segments, we propose two new metrics: viola and , which reflect the segment-level topological accuracy and the case-level topological accuracy, respectively. Specifically, viola is calculated as the ratio of the number of connections violating the topology to the total number of connections, while is calculated as the ratio of the number of test cases containing any topology-violating connection to the total number of test cases.
4.2 Implementation Details
3D ResUNet [25] is employed as the image encoder for feature extraction with channel dimension . The transformer encoder has 3 standard blocks and the number of graph convolution layers is set to 4. We train the network using AdamW optimizer [10] with a base learning rate of 5e-4 and the cosine learning rate schedule during the training stage. The batch size is set to 4. All networks are implemented by Pytorch [12] and trained on four NVIDIA GeForce RTX 3090 GPUs. We train our model on the orCaScore dataset for 3.5k iterations and in-house dataset for 12.5k iterations.
4.3 Comparison with Other Methods
Quantitative Results. We compare TopoLab with other deep learning based approaches including TaG-Net [23], CPR-GCN [22], TreeLab-Net [20], and CorLab-Net [24] which are implemented by ourselves with the same training configurations for a fair comparison. Note that for the point cloud-based methods [24, 23], we use intra-segment voting to transform the point-level predictions into segment-level predictions. From the results on the orCaScore dataset in Table 1 and the in-house dataset in Table 2, we can conclude that the proposed TopoLab outperforms all existing methods by a large margin. The performance gains are more significant in the topology-related metrics, which demonstrates the effectiveness of the utilization of prior knowledge. More detailed results on each category of coronary arteries can be found in the supplementary materials.
Qualitative Results. In Figure 3, we present a qualitative comparison of TopoLab with other methods. Consider the first case as an example, where our approach successfully avoids topology-violating errors, whereas other methods incorrectly classify RI as D, leading to the RID, DRI, or LMD connections that violate topology. These visualizations demonstrate the effectiveness of the usage of category topological trees. More qualitative results can be found in supplementary materials.
4.4 Ablation Study
In this subsection, we explore the effectiveness of different components in TopoLab on orCaScore dataset, as shown in Table 3.
Intra-segment Feature Aggregation (IFA). Transformer [17] is leveraged to achieve the intra-segment feature aggregation in our model. To validate its benefits, we replace it with the Bi-LSTM used in CPR-GCN. From the results in the first line of Table 3, our method surpasses Bi-LSTM by 7.94% in F1 score and 26.57% in .
Inter-segment Feature Interaction (IFI). We use GCN to establish inter-segment feature interactions due to the natural graph structure of coronary arteries. When directly removing this module, the performance drops by 1.57% in F1 score and 19.14% in as illustrated in the second line of Table 3.
Anatomy-aware Connection Classifier (ACC). AC-Classifier which exploits the prior knowledge from category topological trees is adopted to classify each connected segment pair. We replace it with a commonly used linear layer to enable classification for single segments as in previous methods, and the performance drops by 1.33% in Viola and 15.42% in , which effectively demonstrates the superiority of the proposed method.

Method | Recall | Precision | F1 | Viola | |
w/o IFA | |||||
w/o IFI | |||||
w/o ACC | |||||
TopoLab |
5 Conclusion
In this study, we review the essential task of coronary artery labeling and exploit the prior knowledge of the predetermined anatomical connections. The proposed strategies of intra- and inter-segment feature aggregation guarantee effective feature extraction, while the AC-Classifier preserves the clinical logic in the network design. The extensive experiments on orCaScore dataset and in-house dataset reveal that the proposed TopoLab has achieved new state-of-the-art performance. We hope our paper could encourage the community to explore the use of clinical priority to facilitate the design of more effective algorithms.
Acknowledgements.
We would like to thank Dr. Nianxi Liao for valuable discussions. This work is supported by National Key R&D Program of China (2022ZD0114900) and National Science Foundation of China (NSFC62276005).
References
- [1] Cao, Q., Broersen, A., de Graaf, M.A., Kitslaar, P.H., Yang, G., Scholte, A.J., Lelieveldt, B.P., Reiber, J.H., Dijkstra, J.: Automatic identification of coronary tree anatomy in coronary computed tomography angiography. The international journal of cardiovascular imaging 33, 1809–1819 (2017)
- [2] Çiçek, Ö., Abdulkadir, A., Lienkamp, S.S., Brox, T., Ronneberger, O.: 3d u-net: learning dense volumetric segmentation from sparse annotation. In: International conference on medical image computing and computer-assisted intervention. pp. 424–432. Springer (2016)
- [3] Dodge Jr, J.T., Brown, B.G., Bolson, E.L., Dodge, H.T.: Intrathoracic spatial location of specified coronary segments on the normal human heart. applications in quantitative arteriography, assessment of regional risk and contraction, and anatomic display. Circulation 78(5), 1167–1180 (1988)
- [4] Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., et al.: An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)
- [5] Graham, R.L., Hell, P.: On the history of the minimum spanning tree problem. Annals of the History of Computing 7(1), 43–57 (1985)
- [6] Graves, A., Schmidhuber, J.: Framewise phoneme classification with bidirectional lstm and other neural network architectures. Neural networks 18(5-6), 602–610 (2005)
- [7] Isensee, F., Jaeger, P.F., Kohl, S.A., Petersen, J., Maier-Hein, K.H.: nnu-net: a self-configuring method for deep learning-based biomedical image segmentation. Nature methods 18(2), 203–211 (2021)
- [8] Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016)
- [9] Lee, T.C., Kashyap, R.L., Chu, C.N.: Building skeleton models via 3-d medial surface axis thinning algorithms. CVGIP: graphical models and image processing 56(6), 462–478 (1994)
- [10] Loshchilov, I., Hutter, F.: Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 (2017)
- [11] Mowatt, G., Cook, J.A., Hillis, G.S., Walker, S., Fraser, C., Jia, X., Waugh, N.: 64-slice computed tomography angiography in the diagnosis and assessment of coronary artery disease: systematic review and meta-analysis. Heart 94(11), 1386–1393 (2008)
- [12] Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Zeming Lin, e.a.: Pytorch: An imperative style, high-performance deep learning library. In: Wallach, H.M., Larochelle, H., Beygelzimer, A., d’Alché-Buc, F., Fox, E.B., Garnett, R. (eds.) NeurIPS (2019)
- [13] Qi, C.R., Yi, L., Su, H., Guibas, L.J.: Pointnet++: Deep hierarchical feature learning on point sets in a metric space. Advances in neural information processing systems 30 (2017)
- [14] Ronneberger, O., Fischer, P., Brox, T.: U-net: Convolutional networks for biomedical image segmentation. In: Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18. pp. 234–241. Springer (2015)
- [15] Shen, D., Wu, G., Suk, H.I.: Deep learning in medical image analysis. Annual review of biomedical engineering 19, 221–248 (2017)
- [16] Shit, S., Paetzold, J.C., Sekuboyina, A., Ezhov, I., Unger, A., Zhylka, A., Pluim, J.P., Bauer, U., Menze, B.H.: cldice-a novel topology-preserving loss function for tubular structure segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 16560–16569 (2021)
- [17] Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. Advances in neural information processing systems 30 (2017)
- [18] Wang, D., Zhang, Z., Zhao, Z., Liu, Y., Chen, Y., Wang, L.: Pointscatter: Point set representation for tubular structure extraction. In: Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXI. pp. 366–383. Springer (2022)
- [19] Wolterink, J.M., Leiner, T., De Vos, B.D., Coatrieux, J.L., Kelm, B.M., Kondo, S., Salgado, R.A., Shahzad, R., Shu, H., Snoeren, M., et al.: An evaluation of automatic coronary artery calcium scoring methods with cardiac ct using the orcascore framework. Medical physics 43(5), 2361–2373 (2016)
- [20] Wu, D., Wang, X., Bai, J., Xu, X., Ouyang, B., Li, Y., Zhang, H., Song, Q., Cao, K., Yin, Y.: Automated anatomical labeling of coronary arteries via bidirectional tree lstms. International journal of computer assisted radiology and surgery 14, 271–280 (2019)
- [21] Yang, G., Broersen, A., Petr, R., Kitslaar, P., de Graaf, M.A., Bax, J.J., Reiber, J.H., Dijkstra, J.: Automatic coronary artery tree labeling in coronary computed tomographic angiography datasets. In: 2011 Computing in Cardiology. pp. 109–112. IEEE (2011)
- [22] Yang, H., Zhen, X., Chi, Y., Zhang, L., Hua, X.S.: Cpr-gcn: Conditional partial-residual graph convolutional network in automated anatomical labeling of coronary arteries. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 3803–3811 (2020)
- [23] Yao, L., Shi, F., Wang, S., Zhang, X., Xue, Z., Cao, X., Zhan, Y., Chen, L., Chen, Y., Song, B., et al.: Tag-net: Topology-aware graph network for centerline-based vessel labeling. IEEE Transactions on Medical Imaging (2023)
- [24] Zhang, X., Cui, Z., Feng, J., Song, Y., Wu, D., Shen, D.: Corlab-net: anatomical dependency-aware point-cloud learning for automatic labeling of coronary arteries. In: Machine Learning in Medical Imaging: 12th International Workshop, MLMI 2021, Held in Conjunction with MICCAI 2021, Strasbourg, France, September 27, 2021, Proceedings 12. pp. 576–585. Springer (2021)
- [25] Zhang, Z., Liu, Q., Wang, Y.: Road extraction by deep residual u-net. IEEE Geoscience and Remote Sensing Letters 15(5), 749–753 (2018)