Skip to main content

The burden of cancers associated with HIV in the South African public health sector, 2004–2014: a record linkage study



The impact of South Africa’s high human immunodeficiency virus (HIV) burden on cancer risk is not fully understood, particularly in the context of antiretroviral treatment (ART) availability. We examined national cancer trends and excess cancer risk in people living with HIV (PLHIV) compared to those who are HIV-negative.


We used probabilistic record linkage to match cancer records provided by the National Cancer Registry to HIV data provided by the National Health Laboratory Service (NHLS). We also used text search of specific HIV terms from the clinical section of pathology reports to determine HIV status of cancer patients. We used logistic and Joinpoint regression models to evaluate the risk and trends in cancers in PLHIV compared to HIV-negative patients from 2004 to 2014. In sensitivity analysis, we used inverse probability weighting (IPW) to correct for possible selection bias.


A total of 329,208 cancer cases from public sector laboratories were reported to the NCR from 2004 to 2014 with the HIV status known for 95,279 (28.9%) cancer cases. About 50% of all the female cancer cases (n = 30,486) with a known status were HIV-positive. PLHIV were at higher risk of AIDS-defining cancers (Kaposi sarcoma [adjusted OR:134, 95% CI:111–162], non-Hodgkin lymphoma [adjusted OR:2.73, 95% CI:2.56–2.91] and, cervix [adjusted OR:1.70, 95% CI:1.63–1.77], conjunctival cancer [adjusted OR:21.5, 95% CI:16.3–28.4] and human papilloma virus (HPV) related cancers (including; penis [adjusted OR:2.35, 95% CI:1.85–2.99], and vulva [adjusted OR:1.94, 95% CI:1.67–2.25]) compared to HIV-negative patients. Analysis using the IPW population yielded comparable results.


There is need for improved awareness and screening of conjunctival cancer and HPV-associated cancers at HIV care centres. Further research and discussion is warranted on inclusive HPV vaccination in PLHIV.


In Africa, 25.7 million people currently live with the Human Immunodeficiency Virus (HIV) as of 2017 [1]. In South Africa, approximately 14% of the population was living with HIV in 2017 [2]. Since the introduction of antiretroviral treatment (ART) in 2004, there has been an increase in longevity amongst people living with HIV (PLHIV) in South Africa [3]. With this increase in longevity and the known association between cancer and HIV, the risk for cancer amongst PLHIV has increased. However, the additional risk of cancer that PLHIV in South Africa have compared to those who are HIV negative in the ART era is not fully documented.

Studies in developed countries have shown a higher burden of non-AIDS-defining cancers (NADCs) amongst PLHIV in the ART era particularly, anal, skin, liver and lung cancer [4, 5]. Associated with this is age, race, unavailability of ART in some cases, HIV transmission route, lifestyle related factors and immunosuppression [4, 6,7,8]. However, not all NADCs have exhibited differential rates before and after ART. For example PLHIV have remained at low risk of colon, breast and prostate cancers, leading to the possibility that not all cancers are associated with immunosuppression [9]. In contrast, developing countries still have a higher burden of AIDS-defining cancers (ADCs), namely Kaposi Sarcoma (KS), cervical cancer (CC), and non-Hodgkin lymphoma (NHL). This is largely due to co-infections with oncogenic viruses and possibly, poor access to HIV care including ART [10,11,12].

Studies on HIV and cancer done in South Africa have involved HIV cohorts or case control studies which have limited generalization to the general population [13, 14]. The cancer data provided by the National Cancer Registry (NCR) lacks information on HIV status amongst cancer patients as HIV status is not routinely collected in the cancer registry. The South African HIV Cancer Match (SAM) study is a probabilistic record linkage study. It consists of a national HIV cohort created from National Health Laboratory Service (NHLS) HIV laboratory data (CD4 counts, viral load, HIV tests), linked to the NCR data, in order to study cancer risk in HIV positive people [15]. The current study is nested within the SAM study. We aimed to determine the impact of HIV on cancer burden and the cancer risk in PLHIV compared to HIV negative people or the general South African population.


Study setting and design

The NHLS is the largest diagnostic pathology service in South Africa. It provides laboratory and public health services to over 80% of the South African population [16]. This is achieved through a national network of laboratories in all the nine provinces of South Africa. The NHLS’ Corporate Data Warehouse (CDW) is an electronic data repository for all public sector laboratory data. The NCR’s main mandate is pathology-based cancer surveillance with both private and public laboratories legislated to report all cancer cases to the institution. This was a cross sectional study of all cancers diagnosed in public sector laboratories from 2004 to 2014 with HIV data being obtained from the NHLS’ CDW.

Study population, variables and data sources

We included all records of patients diagnosed with cancer in public healthcare laboratories from 2004 to 2014. Cancer diagnosis was coded according to International Classification of Diseases for Oncology (ICD-O-3) excluding all cancer pre-cursor lesions. Since the source of our HIV data was the NHLS, which services the public sector, we excluded cancer records from the private sector. Our rationale was, if a patient accessed cancer care at a private facility, they were more likely to access HIV care at a private facility as well [17]. From our linkage out of the 335,589 cancer records that were reported from the private sector only 1122 had a known HIV result thus supporting our hypothesis.

An individual was considered HIV positive or negative if the HIV diagnostic test result was positive or negative respectively. If the result was indeterminate or neither positive nor negative, the HIV result was regarded as unknown. In addition, HIV monitoring tests such as HIV viral load and CD4 counts were used to assume an HIV positive status. To supplement the NHLS HIV dataset, repeated text mining was done to extract more HIV results from the clinical section of pathology reports on confirmed cases of cancer reported to the NCR. By definition, text mining refers to the drawing out of important and specific information from a block of text [18]. The text mining process involved the use of key terms used to refer or infer HIV status. The key words used included, “HIV”, “HIV+” “HIV positive” “AIDS”, “haart”, “ART”, “ARV”, “antiretroviral”, anti-retroviral”, “RVD”, “RVD positive”, “retroviral disease”, “immune suppression”, “immunosuppression”, “immuno-suppression”, “acquired immune-deficiency”, “retroreactive”, “immunocompromised”, “HIV reactive”, “CD4”, “regimen 1 treatment”, “reg 1 treatment “Retroviral disease”, “RVD”, “HIV”, “HAART” and “ARV”. From the extracted records a series of samples were taken and reviewed to refine the search terms. Demographic characteristics and potential confounders such as age, gender and race were extracted from the NCR database.

Data management

The HIV and cancer datasets were linked using the in house CDW probabilistic record linkage algorithm. This algorithm is used to link all the laboratory records that belong to the same individual within the entire NHLS database. The linkage variables include name, surname and date of birth. For records to be considered a match, the first letter of the first names should match and two components of the date of birth must also match. First names and surnames are given the same linkage weights (40% each) and the date of birth contributes 20% of the overall weight. For records with a recorded national identity number, exact matching is done and this is used to validate the probabilistic record linkage. Records that attain a score of 90% and above are considered a match. After linkage, duplicates were removed and private sector cancer records were excluded and a final sample of 329,280 records remained.

Cervical cancer, KS and NHL were classified as ADCs and the rest of the malignancies as NADCs. We also looked at NHL subtypes namely, Burkitt lymphoma, Diffuse large B-cell lymphoma (DLBCL), Diffuse immunoblastic large B-cell lymphoma (DILBCL), follicular lymphoma Not otherwise Specified (NOS) and NHL NOS. The NADCs were grouped into virus-related and virus unrelated cancers. The following were classified as virus-related cancers according to the IARC Monograph Working group assessment; liver cancer (hepatitis viruses), penis, vulva, vagina, anal, oropharynx, larynx and tonsil (Human Papilloma Virus (HPV) other than cervix) and Hodgkin’s lymphoma and nasopharyngeal cancer (Epstein Barr Virus (EBV)) [19]. Although all the ADCs are associated with viruses they were not included in the virus related NADCs category. For descriptive purposes, age was classified as 0–14, 15–19, 20–24, 25–29, 30–34, 35–39, 40–44, 45–49, 50–54, 55–59 and 60 + .

Data analysis

We determined the characteristics of cancer patients (age, gender, race, cancer type (NADC or ADC) and cancer diagnosis year) by HIV status (positive, negative or unknown) with 95% confidence intervals. To determine the additional risk that PLHIV had of developing specific cancers as per ICD-O-3 coding, logistic regression models were fitted adjusting for age (as a continuous variable), gender (males and females), race (Asian, Black, Coloured and White) and cancer diagnosis year (modelled as a continuous variable).

We assessed trends in cancer risk for selected cancers by plotting yearly crude odds ratios using Joinpoint regression models (Joinpoint Regression Program, Version April, 2018 Statistical Research and Applications Branch, National Cancer Institute). The Joinpoint program allows one to determine if the trend observed is statistically significant or not. In most cases the independent variable is the calendar year. Observed odds ratios (or other parameters such as incidence rate or counts) are joined in straight lines at each time point hence the term joinpoint. The model goes to identify at which time point a significant change in trend is observed as well as the magnitude of the change (Annual Percentage Change (APC)). Permutation tests are then used to select the final model that better describes the change in trends. To determine the contribution of HIV to the cancer burden in South Africa, we calculated Attributable Risk Fractions (ARFs) using adjusted odds ratios as demonstrated by Newson [20].

Sensitivity analysis

Clinicians are more likely to request an HIV test if the patient is symptomatic, hence creating a selection bias. With high number of missing HIV status, inverse probability weighting (IPW) methods were used as a post-hoc sensitivity analysis to correct for possible selection bias. We created the weights using age, gender, cancer diagnosis year and cancer type similar to the method used by Dryden-Petersen et al. [10].

Analysis was done using Stata version 15 (College Station, TX: StataCorp LP). P-values of less than 0.05 were considered to be statistically significant.


From 2004 until 2014, a total of 329,208 cancers were reported to the NCR by the public sector laboratories. Probabilistic record linkage identified 90,796 HIV results and through text mining of cancer pathology reports an additional 4483 HIV results were found. Of the 95,279 (28.9%) cancer patients with a known HIV status, 46,951 (14.3%) were HIV positive. Amongst PLHIV, cancer proportions were highest between the ages of 25 and 49 (Table 1 below). In contrast, 37% (n = 17,890) of all HIV negative individuals were in the over 60 age group. Across all the HIV status subgroups, the greater proportion of cancers was observed in the Black population at 62.6% (n = 206,286). A general increase in cancer proportions was observed for all cancers irrespective of the HIV status by calendar year. Compared to the HIV negative individuals and those with an unknown status, more ADCs were observed in PLHIV. Throughout the study period, ADCs remained constantly higher than NADCs in HIV positive individuals, (Fig. 1 below).

Table 1 Characteristics and distribution of public sector cancer cases by HIV status, 2004–2014
Fig. 1
figure 1

Percentage contribution of ADCs and NADCs to the total cancer burden amongst PLHIV in South Africa, 2004–2014. A comparison of incident cancers by cancer type in PLHIV. Given in the graph is a percentage of the total cancers in PLHIV each year

Correcting for age, gender, race, and year of cancer diagnosis, cancer risk was highest in the HIV positive population for all ADCs (Kaposi sarcoma, NHL, and cervical cancer) with an overall adjusted odds ratio of 4.5 (95% CI =4.35–4.65). The NHL subtypes Burkitt’s lymphoma (adjusted OR: 6.48, 95% CI (5.21–8.07)), Diffuse large B-cell lymphoma (DLBCL) (adjusted OR 2.93 95% CI (2.67–3.22)) and Diffuse immunoblastic large B-cell lymphoma (DILBCL) (adjusted OR 12.1 95% CI (9.02–16.3)). Compared to HIV negative individuals, PLHIV were 0.74 times less likely to develop NADCs (adjusted OR: 0.26, 95% CI (0·25–0.26). As a group, virus-related NADCs were not significantly associated with HIV but most of the HPV-associated cancers such as anal, penile, vulva and lip and Hodgkin’s lymphoma (EBV-associated), were high risk in HIV positive individuals [p < 0.0001]. Liver cancer, which is associated with hepatitis viruses, was not significantly associated with HIV. People living with HIV were at a higher risk for Squamous Cell Carcinoma (SCC) of the skin, Basal Cell Carcinoma (BCC), eye, and conjunctival cancers (p < 0.0001). Non-virus related NADCs were also not associated with HIV. The weighted analysis produced results that were comparable to the complete case analysis (Table 2).

Table 2 Odds ratios for cancer in PLHIV by complete case analysis and by weighted analysis

Trends in cancer risk for selected individual cancers varied, with significant increases observed for cervix, anus, vulva, conjunctiva and penis from 2004 to 2014 in PLHIV (Fig. 2 below). Although the APC was not significant for Kaposi sarcoma, there was a substantial decrease in risk between 2004 and 2006 with no changes observed thereafter. Prior to 2011, there was no significant difference in risk for anal, vulva and penile cancers between those who were HIV negative and PLHIV but significant increases were observed after 2011. For Burkitt’s lymphoma and NHL, whilst the risk was higher in PLHIV, there was relatively no change over the study period. Although insignificant, the trend line for Hodgkin’s lymphoma was suggestive of an increase in cancer risk.

Fig. 2
figure 2

Trends in cancer risk for selected cancers amongst PLHIV in the South African public health sector, 2004–2014. The line graphs were fitted in Joinpoint using crude odds ratios (dots). The annual percentage change in odds ratios was significant (p-value < 0.05) for all cancers selected for in-depth analysis of trends except for Kaposi sarcoma, Burkitt’s lymphoma, NHL and Hodgkin’s lymphoma

There was no shift in burden between ADCs and NADCs observed amongst incident cancers in PLHIV (Fig. 1). Using weighted estimates of the odds ratio, 41% of all ADCs reported between 2004 and 2014 were attributable to HIV. The contribution of HIV on ADCs increased by 22% within the study period (Fig. 3). No particular contribution by HIV towards NADCs as a whole was noted, given the negative ARFs. The same was true for the category virus-related NADCs (Fig. 3), HIV did not seem to contribute to the burden of virus related NADCs amongst PLHIV in the public sector. However, the “protective” effect of HIV has been waning overtime.

Fig. 3
figure 3

Trend in Attributable risk fractions amongst PLHIV in the South African public health sector, 2004-2014. Using adjusted odds ratios adjusting for age, gender, race, year of cancer diagnosis, and Province. ARF = Attributable Risk Fraction. ADC = AIDS defining cancer (includes Kaposi sarcoma, non-Hodgkin’s lymphoma and cervical cancer). NADCs = Non-AIDS Defining Cancers. Virus-related NADCs = liver cancer (hepatitis viruses), penis, vulva, vagina, anal, lip, mouth, gum, salivary gland and tonsil (Human Papilloma Virus (HPV) associated malignancies other than cervix), and Hodgkin’s lymphoma and nasopharyngeal cancer (Epstein Barr Virus (EBV)


Over the period 2004–2014 (ART era), the risk of all ADCs and some virus-related NADCs was higher amongst HIV positive individuals compared to those who were HIV negative. The strongest association was observed between KS [adjusted OR: 134, 95% CI 111–161], conjunctival cancer [adjusted OR: 21.5, 95% CI 16.3–28.4] as well as Burkitt’s lymphoma [adjusted OR: 6.48, 95% CI 5.21–8.07]. Amongst the virus-related NADCs, HPV-associated cancers such as lip, anal, penile and vulva cancer had the strongest associations with HIV. Compared to those who were HIV negative, squamous cell carcinoma of the skin (SCC), basal cell carcinoma (BCC) and conjunctival cancer were the only virus-unrelated NADCs that were significantly associated with HIV. Over time, amongst PLHIV a significant upward trend in risk was observed for cancer of the conjunctiva and anogenital cancers, including cervix.

The spectrum of cancers observed in this study was comparable to what has been observed in other African countries. Similar to a case control study by Stein et al. conducted before ART was available, the risk of KS, cervical cancer, NHL, anogenital cancers other than cervix and SCC skin was elevated amongst PLHIV in our present study [21]. For KS, the risk was higher compared to the one reported by Stein (adjusted OR: 50.4, 95% CI 34.2–74.3) [21]. In both our study and the Stein pre-ART study, the odds ratios were adjusted for age, gender, race and year of diagnosis, which allowed for comparability. Possible explanations for the higher risk in our study is that, until 2016 when the universal test and treat policy was adopted in South Africa, treatment initiation was dependent on CD4 count [22]. In 2004, ART became freely available in the public sector with patients who had CD4 counts of less than 200 cell/μl or in the WHO stage IV of disease being eligible for treatment [23]. Patients were also evaluated to determine if they were psychological fit to receive the treatment. In 2010 in addition to the 2004 recommendations, those who had a co-infection with TB were also automatically eligible for the free ART [24]. In 2011, the criteria were then expanded to include all patients who had a CD4 count of less than 350 cells/μl [24]. These CD4 count thresholds led to a high proportion of immunosuppressed individuals with a high burden of disease, a risk factor for ADCs [14, 22]. As a result, the risk of KS remained elevated even after ART introduction. Moreover, it is possible that the pick-up rate of KS at HIV clinics improved with the expansion of ART and improvements in HIV treatment policies in South Africa hence the greater strength of association observed. Despite the high risk reported for KS in our study, it was lower than reported in other studies particularly those done in the developed countries [4, 25]. In South Africa, the prevalence of Human Herpes Virus (HHV8) was high even before the HIV era, therefore creating a high KS background risk [26]. In addition to this, clinical diagnosis of KS is quite prevalent in the African context with no biopsies or other samples being sent to the laboratory [27]. Therefore, under-reporting of KS to the pathology-based cancer registry may have been possible.

In contrast, the risk reported in our study for NHL (adjusted OR: 2.73, 95% CI 2.56–2.91) was lower than the one reported by Stein (adjusted OR: 6.1, 95% CI 4.4–8.4) which points to a possible reduction in risk of NHL after the introduction of ART [21]. There was no change noted before and after ART in overall cervical cancer risk although an upward trend was observed in the ART era. This is in line with other reports from Africa with various reasons being put forth to account for the increase in cervical cancer risk even with the introduction of ART. These include advanced disease upon ART initiation and older age [28, 29]. Another theory that has been put forward is the lack of a relationship between cervical cancer risk and immunosuppression. Some studies have demonstrated that low CD4 counts do not necessarily amount to increased risk of cervical cancer and other HPV-related cancers [29]. As such, restoration of immunity with ART will not necessarily lead to a reduced risk of cervical cancer. In addition, the prevalence of HPV (a known risk factor for cervical cancer) is higher amongst women living with HIV [29, 30]. Possible co-infection with HPV has also been highlighted in this study with increased risk amongst PLHIV observed for HPV associated cancers such as vulva, anus, penis and lip.

Besides the ADCs and HPV related cancers, we observed other additional cancers were strongly associated with HIV in the ART era. Compared to HIV negative individuals, the risk of conjunctival cancer, Hodgkin’s lymphoma and BCC was also higher in PLHIV in our study. Before ART, there were no reports of conjunctival cancer and BCC as being high risk amongst PLHIV in South Africa [21]. The association between conjunctival cancer and HIV has been reported in Africa [29, 31]. High rates of solar radiation and unproved associations with HPV have been cited as possible reasons why this cancer is common in Sub-Saharan Africa compared to other parts of the world [29]. Like SCC skin, we observed stronger associations between HIV and BCC. Reports have linked age and white race to higher BCC risk in PLHIV with immunosuppression and increased viral loads only being linked to SCC skin [32, 33]. On the other hand PLHIV were less likely to develop virus-unrelated cancers such as breast and prostate which is in line with the literature [4, 7, 25]. Lower risks were also observed for lung and liver cancers in PLHIV consistent with the results reported by Stein et al. but contrary to other reports especially those done in resource rich areas [4, 7, 9]. In the resource rich countries, there is a higher prevalence of lifestyle related factors such as smoking which results in lung cancer and increased alcohol intake which results in liver cancer in HIV cohorts [7]. In our study, it is still uncertain why the liver and lung cancer risk was lower in PLHIV compared to HIV negative individuals.

In the ART era, different cancer trends have been observed, with ADCs decreasing upon ART introduction in other settings [6, 9, 25]. In particular, KS has declined with the introduction and expansion of ART hence supporting the association between this cancer and immunosuppression [6, 25, 34]. In our study following the initial drop in KS risk after ART introduction in 2004, there has not been a significant change in risk amongst PLHIV in the ART era [10]. This is similar to what was reported in a recent study done in Botswana which demonstrated a decrease in KS risk with ART introduction but no significant change with increased roll out of ART [10]. The arguments for this are similar to the reasons why KS risk was reported as higher in our study compared to the pre-ART era, which include HIV treatment policies and improved pick-up rate. The trend in NHL risk exhibited a slight but insignificant decrease over the 11-year period. Whilst some studies have shown decreasing trend in NHL in the ART era in PLHIV others have shown stable trends even with the increased rollout of ART [10, 25]. This has largely been because of Burkitt’s lymphoma as its incidence has remained constant even in the ART era.

Also showing increasing trends in the ART era were most HPV related anogenital cancers (cervix, anus, penis and vulva). Although anal cancer is on the rise, the risk reported is not as high as observed in developed countries. This is possibly due to the difference in HIV epidemiology between South Africa and developed countries. In the latter, the main mode of HIV transmission is men who have sex with men (MSM) through receptive anal sex where as in South Africa, HIV transmission is mainly heterosexual [4, 5]. Co-infection with HPV is higher amongst people living with HIV with the routes of transmission being similar to HIV [35]. Both anal and cervical cancer are associated with HPV, but the different transmission routes will result in more cervical cancer in the African context and more anal cancer in developed countries.

This was the first nationwide study to compare cancer risk amongst the HIV positive and HIV negative people in the ART era. Laboratory confirmation of both cancer and HIV allowed for high specificity of HIV and cancer diagnosis. Although a greater proportion of the HIV status was unknown, the methods used to ascertain HIV status such as probabilistic record linkage and text search ensured that we extracted and matched most of the available HIV records. In addition, probabilistic record linkage allowed us to identify records belonging to the same individual even in the absence of a unique identifier. The greater percentage of black population with HIV and cancer was reflective of the HIV epidemic in South Africa as well as patterns of access to public health services. In addition to this, the use of IPW allowed for assessment of the risk estimates given the possible selection bias due to the high proportion of missing HIV status. The conclusions from the weighted analysis (IPW) were comparable with the complete case analysis. Moreover, women were well represented with enough numbers for cancers that are common in females to be fully analysed.

Despite all these strengths, our study had limitations. Due to its laboratory-based surveillance system, the NCR underreports some cancers that are diagnosed clinically or radiologically like lung and liver cancers. This might potentially result in misrepresentation of association between HIV and these cancers. Although probabilistic record linkage allowed for matching, in the absence of a unique identifier there is still room for some false matches. The national unique identifier remains the gold standard. Another limitation of our study was overrepresentation of the HIV positive individuals. Doctors are more likely to note down the HIV status of a patient if the patient is tested positive. In addition to this, specific cancers such as KS and other symptoms that are known to be associated with HIV are more likely to prompt a clinician to request an HIV test to be done on the patient [36]. This will result in a higher HIV testing and subsequently higher HIV prevalence compared to the general population. Therefore, with the text mining of doctors’ clinical notes in pathology-reports, we were more likely to pick up those that were tested positive than those that were tested negative or never tested. As such, our study also shares the same limitations as proportionate incidence ratio studies. The increased risk observed may be a reflection of a higher HIV prevalence resulting in more cancer cases that are associated with HIV in our study population compared to that in the general population. The evaluation of cancer risk in PLHIV as a function of time was not possible in this study. However, through the SAM study determination of cancer risk with a person-time denominator will be possible. Data on other potential confounders such as lifestyle patterns (smoking, alcohol intake, diet and exercise) and other opportunistic infections was also not available. Access to this information would have possibly made the results more robust.


PLHIV have a higher risk for all ADCs and most virus-related NADCs. The risk of anogenital cancers and conjunctival cancer continues to rise in the ART era and suggests that, ART alone is inadequate in reducing cancer in PLHIV. Most of these cancers are HPV-related. Targeted public health interventions for HPV such as screening and expansion of HPV vaccination (for cervical cancer) amongst PLHIV are essential in reducing the burden. To consolidate these efforts, ART expansion and availability as well as retention in care should be strengthened. With the introduction of universal ART treatment in 2016, further decreases in ADCs are expected provided individuals report to health care centres before the HIV disease has advanced.



AIDS defining cancer


Antiretroviral treatment


Basal cell carcinoma


Human immunodeficiency virus


Human Papilloma Virus


Inverse probability weighting


Kaposi sarcoma


Non-AIDS defining cancer


National Health Laboratory Service


People living with HIV


South African HIV Cancer Match


Squamous cell carcinoma


  1. WHO. World Health Organization. HIV/AIDS - WHO fact sheet. 2018. Accessed 15 Mar 2019.

  2. Human Sciences Research Council. South African Nationa HIV Prevalence, Incidence, Behaviour and Communication Survey, 2017. 2018; c:1–4.

  3. Shisana O, Rhele T, Simbayi LC, Zuma K, Jooste S, Zungu N, et al. South African national HIV prevalence, incidence and behaviour survey, vol. 2014; 2012.

    Google Scholar 

  4. Dal Maso L, Polesel J, Serraino D, Lise M, Piselli P, Falcini F, et al. Pattern of cancer risk in persons with AIDS in Italy in the HAART era. Br J Cancer. 2009;100:840–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Silverberg MJ, Chao C, Leyden WA, Xu L, Horberg MA, Klein D, et al. HIV infection, immunodeficiency, viral replication, and the risk of cancer. Cancer Epidemiol Biomark Prev. 2011;20:2551–9.

    Article  CAS  Google Scholar 

  6. Crum-Cianflone N, Hullsiek KH, Marconi V, Weintrob A, Ganesan A, Barthel RV, et al. Trends in the incidence of cancers among HIV-infected persons and the impact of antiretroviral therapy: a 20-year cohort study. AIDS. 2009;23:41–50.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Albini L, Calabresi A, Gotti D, Ferraresi A, Festa A, Donato F, et al. Burden of non-AIDS-defining and non-virus-related cancers among HIV-infected patients in the combined antiretroviral therapy era. AIDS Res Hum Retrovir. 2013;29:1097–104.

    Article  CAS  PubMed  Google Scholar 

  8. Grulich AE, van Leeuwen MT, Falster MO, Vajdic CM. Incidence of cancers in people with HIV/AIDS compared with immunosuppressed transplant recipients: a meta-analysis. Lancet. 2007;370:59–67.

    Article  PubMed  Google Scholar 

  9. Bedimo RJ, McGinnis KA, Dunlap M, Rodriguez-Barradas MC, Justice AC. Incidence of non-AIDS-defining malignancies in HIV-infected versus noninfected patients in the HAART era: impact of immunosuppression. J Acquir Immune Defic Syndr. 2009;52:203–8.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Dryden-Peterson S, Medhin H, Kebabonye-Pusoentsi M, Seage GR, Suneja G, Kayembe MKA, et al. Cancer incidence following expansion of HIV treatment in Botswana. PLoS One. 2015;10:1–13.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Rohner E, Valeri F, Maskew M, Prozesky H, Rabie H, Garone D, et al. Incidence rate of Kaposi sarcoma in HIV-infected patients on antiretroviral therapy in southern Africa. JAIDS J Acquir Immune Defic Syndr. 2014;67:547–54.

    Article  CAS  PubMed  Google Scholar 

  12. Mbulaiteye SM, Katabira ET, Wabinga H, Parkin DM, Virgo P, Ochai R, et al. Spectrum of cancers among HIV-infected persons in Africa: the Uganda AIDS-Cancer registry match study. Int J Cancer. 2006;118:985–90.

    Article  Google Scholar 

  13. Sengayi M, Babb C, Egger M, Urban MI. HIV testing and burden of HIV infection in black cancer patients in Johannesburg, South Africa: a cross-sectional study. BMC Cancer. 2015;15:144.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Sengayi MM, Kielkowski D, Egger M, Dreosti L, Bohlius J. Survival of patients with Kaposi’s sarcoma in the South African antiretroviral treatment era: a retrospective cohort study. South African Med J. 2017;107:871.

    Article  CAS  Google Scholar 

  15. Sengayi M, Chen W, Spoerri A, Singh E, Egger M. South African HIV cancer match study: a pilot study towards precision public health. Top Antivir Med. 2018;26(Supplement 1):281s–2s.

    Google Scholar 

  16. National Health Laboratory Service. National Health Laboratory Service | Who Are We? Accessed 4 Sep 2018.

  17. Blecher M, Kollipara A, DeJager P, Zulu N. Health Financing. In: Padarath A, English R, editors. South African Health Review. 10th edition. Health Systems Trust; 2011. Accessed 17 Feb 2018.

  18. Harpaz R, Callahan A, Tamang S, Low Y, Odgers D, Finlayson S, et al. Text Mining for Adverse Drug Events: the promise, challenges, and state of the art. Drug Saf. 2014;37:777–90.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Bouvard V, Baan R, Straif K, Grosse Y, Secretan B, Ghissassi F. El, et al. a review of human carcinogens—part B: biological agents. Lancet Oncol. 2009;10:321–2.

    Article  PubMed  Google Scholar 

  20. Newson RB. Attributable and unattributable risks and fractions and other scenario comparisons. Stata J. 2013;13:672–98. Accessed 4 Sep 2018.

    Article  Google Scholar 

  21. Stein L, Urban MI, O’Connell D, Yu XQ, Beral V, Newton R, et al. The spectrum of human immunodeficiency virus-associated cancers in a South African black population: results from a case-control study, 1995-2004. Int J Cancer. 2008;122:2260–5.

    Article  CAS  PubMed  Google Scholar 

  22. Southern African HIV Clinicians Society. Guidelines for adherence to antiretroviral therapy in adolescents and young adults (expanded version): Recommendations, resources and references. Johannesburg, South Africa; 2017. Accessed 6 Apr 2018.

  23. South Africa National Department of Health. South Africa Antiretroviral Treatment Guidelines. 2004. Accessed 28 Jan 2019.

  24. Meyer-Rath G, Johnson LF, Pillay Y, Blecher M, Brennan AT, Long L, et al. Changing the south African national antiretroviral therapy guidelines: the role of cost modelling. PLoS One. 2017;12:e0186557.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Hernández-Ramírez RU, Shiels MS, Dubrow R, Engels EA. Cancer risk in HIV-infected people in the USA from 1996 to 2012: a population-based, registry-linkage study. Lancet HIV. 2017;4:e495–504.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Sitas F, Newton R. Kaposi’s sarcoma in South Africa. J Natl Cancer Inst Monogr. 2001:1–4.

    Article  Google Scholar 

  27. Amerson E, Woodruff CM, Forrestel A, Wenger M, McCalmont T, LeBoit P, et al. Accuracy of clinical suspicion and pathologic diagnosis of Kaposi sarcoma in East Africa. J Acquir Immune Defic Syndr. 2016;71:295–301.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Shiels MS, Pfeiffer RM, Gail MH, Hall HI, Li J, Chaturvedi AK, et al. Cancer burden in the HIV-infected population in the United States. J Natl Cancer Inst. 2011;103:753–62.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Sasco AJ, Jaquet A, Boidin E, Ekouevi DK, Thouillot F, Lemabec T, et al. The challenge of AIDS-related malignancies in sub-Saharan Africa. PLoS One. 2010;5:e8621.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Adler DH, Wallace M, Bennie T, Mrubata M, Abar B, Meiring TL, et al. Cervical dysplasia and high-risk human papillomavirus infections among HIV-infected and HIV-uninfected adolescent females in South Africa. Infect Dis Obstet Gynecol. 2014;2014:498048.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Chokunonga E, Borok MZ, Chirenje ZM, Nyakabau AM, Parkin DM. Trends in the incidence of cancer in the black population of Harare, Zimbabwe 1991-2010. Int J Cancer. 2013;133:721–9.

    Article  CAS  PubMed  Google Scholar 

  32. Crum-Cianflone N, Hullsiek KH, Satter E, Marconi V, Weintrob A, Ganesan A, et al. Cutaneous malignancies among HIV-infected persons. Arch Intern Med. 2009;169:1130.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Lanoy E, Costagliola D, Engels EA. Skin cancers associated with HIV infection and solid-organ transplantation among elderly adults. Int J Cancer. 2010;126:1724–31.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Bohlius J, Valeri F, Maskew M, Prozesky H, Garone D, Sengayi M, et al. Kaposi’s sarcoma in HIV-infected patients in South Africa: multicohort study in the antiretroviral therapy era. Int J Cancer. 2014;135:2644–52.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Dandapani SV, Eaton M, Thomas CR, Pagnini PG. HIV- positive anal cancer: an update for the clinician. J Gastrointest Oncol. 2010;1:34–44.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Wong EY, Jordan WC, Malebranche DJ, DeLaitsch LL, Abravanel R, Bermudez A, et al. HIV testing practices among black primary care physicians in the United States. BMC Public Health. 2013;13:96.

    Article  PubMed  PubMed Central  Google Scholar 

Download references


The authors would like to thank all funders, the University of the Witwatersrand, the National Health Laboratory Service and the National Cancer Registry. This work forms part of a Master’s degree for one of the authors (Tafadzwa Dhokotera), with the University of the Witwatersrand.


This work was supported by grants from the U.S. Civilian Research & Development Foundation (CRDF Global), the NIH administrative supplement to Existing NIH Grants and Cooperative Agreements (Parent Admin Supplement) (The South African HIV Cancer Match Study; U01AI069924–09, PI Matthias Egger, co-PI Julia Bohlius) PEPFAR supplement (PI Matthias Egger) and the Swiss National Science Foundation (The South African HIV cancer Match Study, 320030_169967, PI Julia Bohlius). The contents are solely the responsibility of the authors and do not necessarily reflect the views of the funding bodies.

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author upon reasonable request.

Author information

Authors and Affiliations



MS, ES and JB contributed towards the study design. TD contributed towards literature search, data analysis and drafting of first version of manuscript. ES and MS contributed towards data acquisition. AS contributed towards data linkage. VS contributed towards text mining of cancer pathology reports to assign HIV status. All authors contributed towards data interpretation and critical comments on the first and subsequent drafts of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Tafadzwa Dhokotera.

Ethics declarations

Ethics approval and consent to participate

Permission to use the routinely collected NHLS and NCR data was sought from the relevant authorities. Ethical approval to conduct the study was granted by the University of the Witwatersrand Human Research Ethics Committee [Ethics certificate numbers (SAM: M160944) and (BCAH: M171083)].

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Dhokotera, T., Bohlius, J., Spoerri, A. et al. The burden of cancers associated with HIV in the South African public health sector, 2004–2014: a record linkage study. Infect Agents Cancer 14, 12 (2019).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: