-
Notifications
You must be signed in to change notification settings - Fork 9
Add design history post: Image reading part 1 #472
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 6 commits
Commits
Show all changes
10 commits
Select commit
Hold shift + click to select a range
d186738
Create index.md
Laboni1Paul 453075d
Add image
Laboni1Paul be29ad3
Update index.md
Laboni1Paul 7fd2a19
Update index.md
Laboni1Paul 855d8b7
Rename Example performance factors.png to Example-performance-factors…
Laboni1Paul e0554b9
Updated index.md with new links
Laboni1Paul 997130e
Update app/breast-screening-reporting/2026/04/image-reading-part-1-da…
Laboni1Paul 35757ae
Update app/breast-screening-reporting/2026/04/image-reading-part-1-da…
Laboni1Paul 85f84b8
Update app/breast-screening-reporting/2026/04/image-reading-part-1-da…
Laboni1Paul 2d12eaf
Update app/breast-screening-reporting/2026/04/image-reading-part-1-da…
Laboni1Paul File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
Binary file added
BIN
+151 KB
...26/04/image-reading-part-1-data-and-performance/Example-performance-factors.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
120 changes: 120 additions & 0 deletions
120
...-screening-reporting/2026/04/image-reading-part-1-data-and-performance/index.md
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,120 @@ | ||
| --- | ||
| title: "Image reading part 1: what we learned about data and performance" | ||
| date: 2026-04-21 | ||
| description: "Before thinking about the future of image reading reporting, we looked at how it works today." | ||
| tags: | ||
| - breast screening | ||
| - image reading | ||
| - FRQA | ||
| - data | ||
| author: Alex Crowfoot | ||
| --- | ||
|
|
||
| For more information on the design sprint and our data infrastructure choices, read [this introduction](https://design-history.prevention-services.nhs.uk/breast-screening-reporting/2026/04/re-designing-image-reading-data-reporting-to-stress-test-the-reporting-roadmap/). | ||
|
|
||
| ## What we did | ||
|
|
||
| As part of our goal to replace the Breast Screening Information System (BSIS), we ran a design sprint to look at the Film Reading Quality Assurance (FRQA) report. Rather than creating a like-for-like replacement, we want to add insight and speed up data feedback where possible. | ||
|
|
||
| A range of stakeholders helped us understand the current state of image reading performance data, the context in which it is used, and the gaps and opportunities. | ||
|
|
||
| ## What is FRQA and what is it for? | ||
|
|
||
| Trained clinicians closely examine mammograms with the intention of identifying any cancer present, a process known as image reading. Data from this process is recorded in the National Breast Screening System (NBSS) to form the FRQA report. | ||
|
|
||
| FRQA is central to understanding how well breast screening units and individual readers are performing, and it is used to identify where improvement is needed. | ||
|
|
||
| This report is mostly used by Directors of Breast Screening, image readers and the Screening Quality Assurance Service (SQAS). | ||
|
|
||
| > [!NOTE] | ||
| > Use of the word ‘film’ dates from when mammograms were printed on film. They are now 100% digital, so we use the term ‘image reading’. We use the word ‘film’ only when referring to the existing FRQA report. | ||
|
|
||
| ## Where the data comes from and what BSIS does with it | ||
|
|
||
| The only way to create a national-level dataset is for each of the 75 Breast Screening Offices (BSOs) to run a report that extracts an Excel file from NBSS. These are then collected into a single dataset, either by an individual or by a software tool. In the case of annual FRQA reporting, each BSO uploads an Excel file into BSIS once a year. | ||
|
|
||
| BSIS then creates a series of tables, charts and graphs that cover a rolling 3 years of data. | ||
|
|
||
| In future there should be no need to continue manually extracting data from 75 separate systems, but until a new system is in place this is how we get the data we need. | ||
|
|
||
| ## Highlights from what we learned | ||
|
|
||
| ### Understanding performance can be like detective work | ||
|
|
||
| “How well are we reading images?” is a complex question to answer because there are so many variables at play. It is a bit like detective work, putting together clues to understand why a metric might look unusual. | ||
|
|
||
| Here are just a few of the variables that someone interpreting the data must consider: | ||
|
|
||
| - differences in the population screened, including possible skews in age, BMI and ethnicity | ||
| - what stage the reader is at in their career | ||
| - the skills mix in their team | ||
| - whether they have seen the other reader’s decision before recording their own | ||
| - how quickly they are reading cases | ||
|
|
||
|  | ||
|
|
||
| > [!NOTE] | ||
| > **Example** | ||
| > | ||
| > A reader may have a high recall rate. This could be because they are over-calling borderline cases, which is problematic because it risks causing unnecessary stress for participants. But a high recall rate could also be because of the physical realities of the population being screened. There may be a skew that increases recall rate in age, ethnicity, BMI or all 3. | ||
|
|
||
| Interpreting the data therefore requires knowledge of the programme and of the specific BSO context. | ||
|
|
||
| ### Performance is a team sport | ||
|
|
||
| Readers have different strengths, so to be sure that cancer is identified to a satisfactory standard, at least 2 individuals will read an image. It is the unit’s decision, not the individual reader’s, that affects the participant. Because the performance of others in the team may influence an individual’s metrics, we have to look at performance in the context of the unit. | ||
|
|
||
| > [!NOTE] | ||
| > **Example** | ||
| > | ||
| > Reader A’s data shows a higher-than-average discrepant cancer rate. This metric shows when Reader A did not identify a cancer but another reader did. However, Reader A’s performance may be perfectly acceptable. The reason for the high discrepancy is that Reader B always spots the cancer that no one else in the team can, thereby driving up Reader A’s discrepant cancer rate. | ||
|
|
||
| ### More timely data is not always usable data | ||
|
|
||
| There is a tension between the idea of getting an earlier heads-up on performance and the reality that some data is only statistically meaningful at higher volumes, and that takes time to accrue. Earlier data is not always an option. | ||
|
|
||
| ### We can speed up some data feedback | ||
|
|
||
| While some data cannot be used earlier, we may be able to find a reliable proxy for some metrics. | ||
|
|
||
| One example of this is the cancer detection rate. We currently wait until cancer surgery data is finalised to confirm whether a participant who was recalled had cancer. | ||
|
|
||
| This gives us certainty, but it also means a delay of between 6 and 30 months between screening and outcome data, limiting a reader’s ability to take timely action if needed. | ||
|
|
||
| Alternatively, core needle biopsies predict cancer with a reported 97% accuracy. Therefore, if we can source this data in the interim, it would give an earlier indication of how well cancer is being identified and enable more prompt action. | ||
|
|
||
| ### Saving time and adding insight for QA | ||
|
|
||
| There are several enhancements for QA that would: | ||
|
|
||
| - save the team time | ||
| - offer new insight | ||
| - improve usability | ||
|
|
||
| We expect to be able to speed up their work and, if we can source the data, deepen their insight. | ||
|
|
||
| ### We can make it easier to monitor, compare and manage performance | ||
|
|
||
| As well as improving on the slow data feedback already mentioned, Directors of Breast Screening would be supported by the ability to more easily monitor, compare and cross-check the performance of their team. | ||
|
|
||
| Also important is the role Directors of Breast Screening play in helping readers interpret their performance data and put it into context. New data tools need to support these conversations, not replace them. | ||
|
Laboni1Paul marked this conversation as resolved.
Outdated
|
||
|
|
||
| ### Image readers rely on others for data | ||
|
|
||
| When it comes to performance data, readers currently rely on their Director of Breast Screening to download the report from BSIS annually, share it with them, and help them interpret it and take the right action. | ||
|
Laboni1Paul marked this conversation as resolved.
Outdated
|
||
|
|
||
| They also do not have easy access to useful operational data such as progress towards reading targets. This is important because, to retain accreditation, they must read a minimum of 5,000 images per year, of which 1,500 must be first reads. Currently, they keep a rough tally in their heads because we cannot get the data out of the system in a timely way using current methods. | ||
|
|
||
| ## What’s next? | ||
|
|
||
| The current programme already works well in terms of identifying cancer, but there are barriers to overcome that would enable teams to work smarter. We believe we can: | ||
|
|
||
| - support teams to identify issues earlier, reliably | ||
| - speed up the ability to take corrective action | ||
| - remove unnecessary manual labour | ||
| - offer better insight | ||
| - offer easier oversight | ||
|
|
||
| Instead of using a one-size-fits-all FRQA for readers, Directors of Breast Screening and SQAS, we will consider different solutions to meet these different needs. | ||
|
Laboni1Paul marked this conversation as resolved.
Outdated
|
||
|
|
||
| We’ll explore these options in our next posts, which will focus on distinct user groups. | ||
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.