Needs to be something that integrates into how visits are written so there is an original & clean version with metadata in prep for distribution of the data to upstream consumers via magnet
In other words connecting magnet today would work but we would only have visits data for non-cleaned files
This also gives an opportunity for things like OCR or image captioning to be added and flesh out hygiene to truly handle all common media (SEC filings often have images like corporate logos or brochures) but that's another project discussion
Needs to be something that integrates into how visits are written so there is an original & clean version with metadata in prep for distribution of the data to upstream consumers via magnet
In other words connecting magnet today would work but we would only have visits data for non-cleaned files
This also gives an opportunity for things like OCR or image captioning to be added and flesh out hygiene to truly handle all common media (SEC filings often have images like corporate logos or brochures) but that's another project discussion