Published Date : 18/08/2025
The use of artificial intelligence (AI) in healthcare has been on the rise, particularly in providing reliable and accessible information to patients. A recent study aimed to evaluate the accuracy of two AI chatbots, DeepSeek and ChatGPT, in answering frequently asked questions (FAQs) about cervical cancer. The study, conducted at Hunan Cancer Hospital, Xiangya School of Medicine, Central South University, Changsha, China, provides valuable insights into the potential of AI in public health education and patient support.
To compile a list of FAQs concerning cervical cancer, a comprehensive search was conducted on social media and community platforms. The answer keys for all the selected questions were created based on the guidelines of the National Comprehensive Cancer Network (NCCN), the International Federation of Gynecology and Obstetrics (FIGO), and the World Health Organization (WHO) for cervical cancer. The answers given by DeepSeek-R1 and ChatGPT O1 were scored according to the Global Quality Score (GQS).
The study included a total of 74 FAQs, covering a diverse range of topics related to cervical cancer. These topics were categorized into four main areas: diagnosis (16 questions), risk factors and epidemiology (19 questions), treatment (20 questions), and prevention (19 questions). Each question was carefully selected to ensure a comprehensive evaluation of the AI chatbots' performance.
When the answers provided by DeepSeek to the FAQs about cervical cancer were evaluated according to the GQS, 68 answers were rated as score five, 4 answers were rated as score four, and 2 answers were rated as score three. For ChatGPT's responses to the same set of FAQs, 67 answers were classified as score five, 6 answers were classified as score four, and 1 answer was classified as score three. There was no statistically significant difference between the two groups (P > 0.05).
The results indicate that both DeepSeek and ChatGPT demonstrated accurate and satisfactory responses to FAQs about cervical cancer when evaluated according to the GQS. However, the study also highlighted the need for a cautious approach, particularly in regard to treatment issues. While both AI chatbots performed well, the complexity and variability of treatment options require careful consideration and should not be solely relied upon for medical advice.
One notable advantage of DeepSeek over ChatGPT is its free availability. This makes DeepSeek more accessible in resource-limited scenarios, enhancing its utility for public health education and patient support. The free availability of DeepSeek can significantly benefit individuals who may not have access to premium AI services, thereby promoting equitable access to healthcare information.
In conclusion, the study provides valuable insights into the potential of AI chatbots in answering frequently asked questions about cervical cancer. Both DeepSeek and ChatGPT performed satisfactorily, with DeepSeek showing a slight edge in accessibility. However, it is crucial to maintain a cautious approach, especially when it comes to medical treatment advice. Further research and continuous improvement in AI chatbot technology can help enhance their accuracy and reliability in healthcare applications.
The study was conducted at Hunan Cancer Hospital, Xiangya School of Medicine, Central South University, Changsha, China. The hospital is a leading institution in cancer research and treatment, committed to advancing healthcare through innovation and collaboration.
Q: What is the main objective of the study?
A: The main objective of the study was to compare the accuracy of DeepSeek and ChatGPT in answering frequently asked questions (FAQs) about cervical cancer.
Q: How were the FAQs for the study compiled?
A: The FAQs were compiled through a comprehensive search on social media and community platforms, and the answer keys were created based on guidelines from the NCCN, FIGO, and WHO.
Q: What were the main categories of questions in the study?
A: The main categories of questions were diagnosis, risk factors and epidemiology, treatment, and prevention.
Q: What were the key findings of the study?
A: Both DeepSeek and ChatGPT demonstrated accurate and satisfactory responses to FAQs about cervical cancer, with no statistically significant difference between the two. DeepSeek showed a slight edge in accessibility due to its free availability.
Q: What is the significance of the study's findings for healthcare?
A: The findings suggest that AI chatbots can be valuable tools in public health education and patient support, especially in resource-limited settings. However, a cautious approach is needed, particularly for medical treatment advice.