Support for DEXA with ImageComments tag containing XML #7

howff · 2024-05-22T21:33:16Z

ImageComments needs to be extracted so that it can be redacted.

Resolves #6
but please see all the comments in that issue to understand the implications.

because Lunar iDXA writes XML into that field which may include EXAM_DATE that needs to be redacted.

use the 'redact' keys from sr_keys_to_extract

use a variable so that it can be changed in future (should never need to be more than 8 I think, depends on how many extra linefeeds appear between sections)

…SemEHR reconstructs the text from its working_fields so that its offsets match ours better

rkm

Hi, apologies for not reviewing this until now.

This is a necessary change to support DEXA extractions, and the PR looks good overall. My only suggested change is that we make this configurable somehow. It would be helpful to have the ability to enable/disable this if we decide this is a "safe" change for some SR extractions only.

and also add other Comments tags, also off by default

and enhance all the tests to test ImageComments

howff · 2024-09-30T14:03:01Z

It is now optional, and the default is off, so ImageComments is not extracted by default.

The option is not exposed in the command line tools (yet) - not quite sure how exactly you want want to change configurable things like this on a per-extraction basis. (Also it's a little bit more complex because the tools can read from MongoDB as well as from DICOM files and the two have different internal representations of the SR structure so need different code to read).

howff added 2 commits May 22, 2024 22:28

CTP_SRAnonTool.sh - more verbose if required

590dcf0

SmiServices library - extract/redact ImageComments for DEXA scans,

6c42fef

because Lunar iDXA writes XML into that field which may include EXAM_DATE that needs to be redacted.

howff self-assigned this May 22, 2024

howff added 5 commits May 29, 2024 09:19

DicomText.py - don't hard-code TextValue and ImageComments,

d443090

use the 'redact' keys from sr_keys_to_extract

DicomText.py - better debug messages

2d60ae3

DicomText.py - don't hard-code a 32-char window to search each way,

4d9d606

use a variable so that it can be changed in future (should never need to be more than 8 I think, depends on how many extra linefeeds appear between sections)

DicomText.py - make text parsed during redaction the same as the way …

96de3d6

…SemEHR reconstructs the text from its working_fields so that its offsets match ours better

Merge branch 'main' into dexa

29083a0

howff requested a review from rkm May 29, 2024 14:29

rkm requested changes Jun 17, 2024

View reviewed changes

howff added 4 commits September 30, 2024 12:23

DicomText - use inline DICOM instead of external file for tests

ab8605f

StructuredReport.py - no ImageComments by default,

68c0391

and also add other Comments tags, also off by default

DicomText.py - add enableTag() to allow ImageComments to be enabled,

22ee9d5

and enhance all the tests to test ImageComments

Update doc

fa5bd0a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for DEXA with ImageComments tag containing XML #7

Support for DEXA with ImageComments tag containing XML #7

howff commented May 22, 2024 •

edited

Loading

rkm left a comment

howff commented Sep 30, 2024

Support for DEXA with ImageComments tag containing XML #7

Are you sure you want to change the base?

Support for DEXA with ImageComments tag containing XML #7

Conversation

howff commented May 22, 2024 • edited Loading

rkm left a comment

Choose a reason for hiding this comment

howff commented Sep 30, 2024

howff commented May 22, 2024 •

edited

Loading