A Health-Focused Risk Taxonomy for AI: Assessing Unsafe Content Detection with Small Language Models (SLMs)

Hewitt, L; Tamimi, AKA; Copeland, Robert; Moore, R; Jhanji, S

A Health-Focused Risk Taxonomy for AI: Assessing Unsafe Content Detection with Small Language Models (SLMs)

Tools

HEWITT, L, TAMIMI, AKA, COPELAND, Robert, MOORE, R and JHANJI, S (2025). A Health-Focused Risk Taxonomy for AI: Assessing Unsafe Content Detection with Small Language Models (SLMs). Ceur Workshop Proceedings, 3985, 1-9. [Article]

[+][-]

Documents

36001:1001950

[+][-]

36001:1001950

[thumbnail of Copeland-AHealth-FocusedRisk(VoR).pdf]

Preview

PDF
Copeland-AHealth-FocusedRisk(VoR).pdf - Published Version
Available under License Creative Commons Attribution.

Download (321kB) | Preview

Abstract

Large Language Models (LLMs) show promise in healthcare. To make the most of this technology, there is a need to address concerns about computational demands and privacy. Small Language Models (SLMs) offer a privacy-preserving alternative for specialised medical applications due to their lower resource needs and potential for local deployment. This paper examines existing LLM safeguarding frameworks and introduces a novel, health-focused risk taxonomy developed through literature review and co-design with healthcare professionals. Furthermore, the ability of 6 SLMs to detect unsafe content using 2 additional risk taxonomies is evaluated and compared. The 8b-parameter Granite Guardian model showed superior adaptation to the novel risk taxonomy (75% accuracy) even without fine-tuning, representing a promising direction for safe and reliable applications of SLMs in clinical settings.

More Information

Official URL:

https://ceur-ws.org/Vol-3985/

Open Access URL:

https://ceur-ws.org/Vol-3985/paper1.pdf

Open Access Version:

Published version

Additional Information:

SLM4Health 2025 Improving Healthcare with Small Language Models Proceedings of the First International Workshop on Improving Healthcare with Small Language Models, SLM4Health 2025 co-located with 23rd International Conference on Artificial Intelligence in Medicine (AIME 2025)

Uncontrolled Keywords:

4609 Information systems

Page Range:

1-9

Identifiers

ORCID for L Hewitt:

orcid.org/0009-0000-6459-8695

ORCID for AKA Tamimi:

orcid.org/0000-0003-2459-0298

ORCID for Robert Copeland:

orcid.org/0000-0002-4147-5876

ORCID for R Moore:

orcid.org/0000-0002-8865-6746

ORCID for S Jhanji:

orcid.org/0000-0002-1116-628X

Library

Item Type:

Article

SWORD Depositor:

Symplectic Elements

Depositing User:

Symplectic Elements

Date record made live:

04 Aug 2025 10:38

Last Modified:

04 Aug 2025 11:00

Date of first compliant deposit:

4 August 2025

Date of first compliant Open Access:

4 August 2025

Version of first compliant deposit:

Version of Record

URI:

https://shura.shu.ac.uk/id/eprint/36001

Statistics

Downloads

Downloads per month over past year

View more statistics

Actions (login required)

View Item

Sheffield Hallam University Research Archive

A Health-Focused Risk Taxonomy for AI: Assessing Unsafe Content Detection with Small Language Models (SLMs)

Downloads

Actions (login required)

Sheffield Hallam University

City Campus, Howard Street

Sheffield S1 1WB

Sheffield Hallam University Research Archive

Contact us: shura@shu.ac.uk

Research at SHU

SHU Library