Is qsv dedup
stable?
#80
-
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 6 replies
-
@jqnatividad Is the behaviour of |
Beta Was this translation helpful? Give feedback.
-
It is now stable again with the release of 0.88.2. It became unstable when we parallelized the sort using With 0.88.2, dedup still does parallelized sort, but in a stable way using |
Beta Was this translation helpful? Give feedback.
@jqnatividad Is the behaviour of
qsv dedup
stable in terms of which rows get removed? My use case is that I have an ID, a bunch of other columns, and a timestamp, and I'd like to remove all but the latest entry for each ID. At the moment it seems that if I ensure that the CSV is ordered by timestamp (ascending) before adedup -s id
, this works — i.e. it's always keeping the last entry it sees for the ID. This doesn't seem to be documented, though, so I'm not sure if I can rely on it.