Releases · wfondrie/depthcharge

10 May 08:13

wfondrie

v0.4.8

486221c

Depthcharge v0.4.8 Latest

Latest

[v0.4.8]

Changed

Tokenizer.detokenize() now truncates the output to the first stop token it finds, if trim_stop_token=True.

Assets 2

09 May 23:20

wfondrie

v0.4.7

1b53a35

Depthcharge v0.4.7

[v0.4.7]

Fixed

Add stop and start tokens for AnnotatedSpectrumDataset, when available.
When reverse is used for the PeptideTokenizer, automatically reverse the decoded peptide.

Assets 2

08 May 06:00

wfondrie

v0.4.6

3ca2297

Depthcharge v0.4.6

[v0.4.6]

Added

Added support for unsigned modification masses that don't quite conform to the Proforma standard.

Assets 2

30 Apr 18:08

wfondrie

v0.4.5

8519369

Depthcharge v0.4.5

Changed

The scan_id column for parsed spectra is not a sting instead of an integer. This is less space efficient, but we ran into issues with Sciex indexing when trying to use only an integer.

Assets 2

29 Apr 22:19

wfondrie

v0.4.4

b8be2e2

Depthcharge v0.4.4

Changed

Partially revert length changes to SpectrumDataset and AnnotatedSpectrumDataset. We removed __len__ from both due to problems with PyTorch Lightning compatibility.
Simplify dataset code by removing redundancy with lance.pytorch.LanceDatset.
Improved warning message for skipped spectra.

Assets 2

26 Apr 06:28

wfondrie

v0.4.3

15d52f4

Depthcharge v0.4.3

Changed

Length of the SpectrumDataset and AnnotatedSpectrumDataset now reflect the samples parameter of the lance.pytorch.LanceDataset parent class.

Assets 2

25 Apr 06:27

wfondrie

v0.4.2

35bf3e7

Depthcharge v0.4.2

Changed

The length of SpectrumDataset and AnnotatedSpectrumDataset is now the number of batches, not the number of spectra. This let's tools like PyTorch Lighting create their progress bars properly.
Parsing a dataset now no longer requires reading essentially the whole first file. Now the schema is inferred from the first 128 spectra.

Assets 2

19 Apr 22:23

wfondrie

v0.4.1

d46adf1

Depthcharge v0.4.1

Added

Significant updates to documentation. Add how to model mass spectra.
Reading and writing from cloud storage on everything!

Changed

Migrated to Mike for mkdocs to manage multiple versions.
Moved test GitHub Action from pip to uv.

Assets 2

17 Apr 20:22

wfondrie

v0.4.0

98035ec

Depthcharge v0.4.0

We have completely reworked of the data module.
Depthcharge now uses Apache Arrow-based formats instead of HDF5; spectra are converted either Parquet or streamed with PyArrow, optionally into Lance datasets.

We now also have full support for small molecules, with the MoleculeTokenizer,
AnalyteTransformerEncoder, and AnalyteTransformerDecoder classes.

Breaking Changes

PeptideTransformer* are now AnalyteTransformer*, providing full support for small molecule analytes. Additionally the interface has been completely reworked.
Mass spectrometry data parsers now function as iterators, yielding batches of spectra as pyarrow.RecordBatch objects.
Parsers can now be told to read arbitrary fields from their respective file formats with the custom_fields parameter.
The parsing functionality of SpctrumDataset and its subclasses have been moved to the spectra_to_* functions in the data module.
SpectrumDataset and its subclasses now return dictionaries of data rather than a tuple of data. This allows us to incorporate arbitrary additional data
SpectrumDataset and its subclasses are now lance.torch.data.LanceDataset subclasses, providing native PyTorch integration.
All dataset classes now do not have a loader() method.

Added

Support for small molecules.
Added the StreamingSpectrumDataset for fast inference.
Added spectra_to_df, spectra_to_df, spectra_to_stream to the depthcharge.data module.

Changed

Determining the mass spectrometry data file format is now less fragile.
It now looks for known line contents, rather than relying on the extension.

Assets 2

19 Aug 04:02

wfondrie

v0.3.1

c18fa1c

depthcharge v0.3.1

[v0.3.1] - 2023-08-18

Added

Support for fine-tuning the wavelengths used for encoding floating point numbers like m/z and intensity to the FloatEncoder and PeakEncoder.

Fixed

The tgt_mask in the PeptideTransformerDecoder was the incorrect type.
Now it is bool as it should be.
Thanks @justin-a-sanders!

Contributors

justin-a-sanders

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[v0.4.8]

Changed

[v0.4.7]

Fixed

[v0.4.6]

Added

Changed

Changed

Changed

Changed

Added

Changed

Breaking Changes

Added

Changed

[v0.3.1] - 2023-08-18

Added

Fixed

Contributors

Releases: wfondrie/depthcharge

Depthcharge v0.4.8

[v0.4.8]

Changed

Depthcharge v0.4.7

[v0.4.7]

Fixed

Depthcharge v0.4.6

[v0.4.6]

Added

Depthcharge v0.4.5

Changed

Depthcharge v0.4.4

Changed

Depthcharge v0.4.3

Changed

Depthcharge v0.4.2

Changed

Depthcharge v0.4.1

Added

Changed

Depthcharge v0.4.0

Breaking Changes

Added

Changed

depthcharge v0.3.1

[v0.3.1] - 2023-08-18

Added

Fixed

Contributors