Solutions

DrugVoice unlocks the authentic voice of Healthcare Professionals and Patients

Find Out More Button

PeopleVoice turns unstructured Employee data into strategic intelligence

Find Out More Button

TMLabs

About TMLabs

TMLabs is our in-house Centre of Excellence for Data Science for Life Sciences – where we train, test, and refine proprietary models purpose-built to decode real-world health dialogue at scale

About Us Button

Scientific Publications

Our Scientific Publications showcase the rigorous methodologies and validated outcomes behind our Data Science – demonstrating the impact of Talking Medicines Predictive Intelligence in peer-reviewed research

See Publications Button

Resources

Blogs

Our Blogs share insights at the intersection of data science, life sciences, and real-world health, covering trends, thought leadership, and innovation from the TM team

$

The Talking Room

Discover how The Talking Room demystifies AI, LLMs, and Machine Learning, showcasing data stories and expert insights that transform Patient and HCP conversations into actionable intelligence

$

Compliance Hub

The Compliance Hub outlines our commitment to data integrity, ethical AI, and regulatory standards, ensuring our intelligence is accurate, safe, and fully compliant

$

ESG

Our ESG principles guide how we operate, driving responsible innovation, and reducing environmental impact through ethical operating and data practices

$
Classifying Patient and Professional Voice in Social Media Health Posts

Sign Up to our Newsletter

Stay ahead of the curve. Get expert insights, practical updates, and cutting-edge perspectives on how Advanced Data Science is transforming HCP and Patient intelligence in Pharma Marketing.

Subscribe on LinkedIn
We continue to contribute toward the scientific validation of analysing social media to listen to the Voice of the Patient. We are advancing knowledge in this field through AI. Read a summary of the publication below put together by Talking Medicines.

Background

Patient-based analysis of social media is a growing research field with the aim of delivering precision medicine but it requires accurate classification of posts relating to patients’ experiences. We motivate the need for this type of classification as a pre-processing step for further analysis of social media data in the context of related work in this area. In this paper we present experiments for a three-way document classification by patient voice, professional voice or other. We present results for a Convolutional Neural Network classifier trained on English data from two different data sources (Reddit and Twitter) and two domains (cardiovascular and skin diseases).

Conclusion

The main conclusion resulting from this work is that using more data for training a classifier does not necessarily result in best possible performance. In the context of classifying social media posts by patient and professional voice, we showed that it is best to train separate models per data source (Reddit and Twitter) instead of a model using the combined training data from both sources. We also found that it is preferable to train separate models per domain (cardiovascular and skin) while showing that the difference to the combined model is only minor (0.01 accuracy). Our highest overall F1-score (0.95) obtained for classifying posts as patient voice is a very good starting point for further analysis of social media data reflecting the experience of patients.

Acknowledgements

We would like to thank the Talking Medicines Limited annotators who worked extremely hard to create the data needed for model training and validation.

To read the full article follow the link below:

Research Square Article

 

Authors: Beatrice Alex, Donald Whyte, Daniel Duma, Roma English Owen and Elizabeth A.L. Fairley.

 

Read More

#
$