Talking Medicines Academic Paper 1 : Classifying Patient Voices

Solutions

DrugVoice unlocks the authentic voice of Healthcare Professionals and Patients

Find Out More Button

PeopleVoice turns unstructured Employee data into strategic intelligence

Find Out More Button

DrugVoice

PeopleVoice

About Us

TMLabs is our in-house Centre of Excellence for Data Science for Life Sciences – where we train, test, and refine proprietary models purpose-built to decode real-world health dialogue at scale

About Us Button

Articles & Scientific Publications

Our Articles & Scientific Publications showcase the rigorous methodologies and validated outcomes behind our Data Science – demonstrating the impact of Talking Medicines Predictive Intelligence in peer-reviewed research

See Publications Button

About TMLabs

Scientific Publications

Resources

Blogs

Our Blogs share insights at the intersection of data science, life sciences, and real-world health, covering trends, thought leadership, and innovation from the TM team

The Talking Room

Discover how The Talking Room demystifies AI, LLMs, and Machine Learning, showcasing data stories and expert insights that transform Patient and HCP conversations into actionable intelligence

Compliance Hub

The Compliance Hub outlines our commitment to data integrity, ethical AI, and regulatory standards, ensuring our intelligence is accurate, safe, and fully compliant

ESG

Our ESG principles guide how we operate, driving responsible innovation, and reducing environmental impact through ethical operating and data practices

Talking Medicines Academic Paper 1 : Classifying Patient Voices

The team at Talking Medicines are delighted to have published an academic paper on the methodologies behind the classification of the patient voice by Alex et al. BMC Med Inform Decis Mak (2021) 21:244. This is part of the R&D work that has been completed in the development of the cutting edge commercial platform PatientMetRx® offering Pharma the opportunity to access patient confidence score by medicine brand.

Abstract Background:
Patient-based analysis of social media is a growing research field with the aim of delivering precision medicine but it requires accurate classification of posts relating to patients’ experiences. We motivate the need for this type of classification as a pre-processing step for further analysis of social media data in the context of related work in this area. In this paper we present experiments for a three-way document classification by patient voice, professional voice or other. We present results for a convolutional neural network classifier trained on English data from two different data sources (Reddit and Twitter) and two domains (cardiovascular and skin diseases). Results: We found that document classification by patient voice, professional voice or other can be done consistently manually (0.92 accuracy). Annotators agreed roughly equally for each domain (cardiovascular and skin) but they agreed more when annotating Reddit posts compared to Twitter posts. Best classification performance was obtained when training two separate classifiers for each data source, one for Reddit and one for Twitter posts, when evaluating on in-source test data for both test sets combined with an overall accuracy of 0.95 (and macro-average F1 of 0.92) and an F1-score of 0.95 for patient voice only. Conclusion: The main conclusion resulting from this work is that combining social media data from platforms with different characteristics for training a patient and professional voice classifier does not result in best possible performance. We showed that it is best to train separate models per data source (Reddit and Twitter) instead of a model using the combined training data from both sources. We also found that it is preferable to train separate models per domain (cardiovascular and skin) while showing that the difference to the combined model is only minor (0.01 accuracy). Our highest overall F1-score (0.95) obtained for classifying posts as patient voice is a very good starting point for further analysis of social media data reflecting the experience of patients. Keywords: Patient voice, Professional voice, Social media, Classification, Reddit, Twitter.

To read the full article please go to the online publication

Sign Up to Stay Ahead of Message Impact

Discover how Pharma marketeers are finally measuring which messages change HCP behavior. Our newsletter shares evidence-led insights powered by DrugVoice and the Message Resonance Score™ so you can predict and prove message impact—before prescriptions are written.

Subscribe on LinkedIn