Interview: Xia Wang, AstraZeneca on Unraveling Patient Treatment Journey by NLP on Clinical Notes
We discuss Analytics at AstraZeneca, prominent use cases, how NLP helped understanding patient treatment journey in diabetes, data sources, insights, and more.

Prior to stepping into the clinical domain, Xia was with the AstraZeneca the innovational medicines unit, focused on the areas of informatics and computational modeling to accelerate candidate drug identification and optimization in the early discovery phases. Xia holds a Ph.D. in computational chemistry and has extensive training in broad areas of Informatics.
Here is my interview with her:
Anmol Rajpurohit Q1. Can you describe your role at AstraZeneca? What are the prominent use cases of Analytics in Medicine Development?

My role as a Principal Informatician always starts with understanding the key challenges in the life cycle of the development of a medicine. This involves focusing on identifying the most relevant data and implementing the most appropriate analytical methodologies, in order to reveal the insights and make the decisions that can meet those challenges.
In my opinion, the most prominent use cases of Analytics here are those that show that new medicines are effective, safe and prescribed to the right patient. For example, the analytics of clinical trials data to obtain the approval of a new medicine, the comparative effectiveness analytics to compare benefit of existing treatment in clinical practice versus a newly approved medicine, and the analytics of patient populations to identify the common characteristics of a group of patients who can benefit the most from a certain medicine.
AR: Q2. What was the motivation behind the pilot work on utilizing Natural Language Processing (NLP) to explore the understanding of patient treatment journey in diabetes?

We believe the newly available and de-identified digital patient charts which contain clinical documentation directly from physicians are promising to help address these challenges. This pilot focused to address two fundamental questions: 1) whether we can identify the events of interest along the diabetic patient treatment journey from the clinical note sources and 2) the evaluation of NLP technology in retrieving this meaningful information.
AR: Q3. What were the key sources of data for your study? Were there any data sets that were desired but not accessible?

Ideally for this pilot we would like to have included the de-identified electronic health records (EHR) as a more structured data format that can be linked with the clinical documentation. However, the longitudinal coverage that such structured data provides was less desirable at the time this study was conducted.
AR: Q4. What were the key insights from this research? Any recommendations or future directions?

At the same time, the limitations of the type and extent of information captured in clinical notes was observed. So in the future the longitudinal coverage of patient life plus integration of structured data from clinical notes are essential to provide a comprehensive picture of treatment journey for diabetic patients.
Second part of the interview
Related: