How to handle huge trace files #16

bobbingwide · 2015-12-21T09:31:20Z

Whilst it's not advisable to trace in a production system, it's possible to forget to disable tracing.
If a trace file is not being reset for each request, ie. if the trace file is being appended to, then the file can get very large indeed.

Something needs to be done about this since it can lead to the server returning HTTP error code 500.

Note: There could be a problem with the trace file being written to even when tracing is only configured for a specific IP address. I'm looking into it.

bobbingwide · 2018-01-29T10:35:35Z

One possible solution to this is to implement some cycling logic using generations and sets of trace files.
Where

a generation is a finite number of files with a particular file mask.
sets are used to create multiple file masks.

The current logic for file naming is:

bwtrace.loh is the default for 'normal' requests
The trace file name and extension can be specified. e.g. bwtraces.loh
AJAX requests can be traced to a different file e.g. bwtrace.ajax
batch requests may be traced to a different file name.

The new file mask would be of the form: path/filename.set.generation
where

path is optional and relative to ABSPATH
filename defaults to bwtrace
set defaults to the request type, but can be set per request type
generation is an integer from 1 to limit.
limit defaults to 100

The generation logic will depend on the value given for limit.

limit	Generation logic	Generation
blank/null	Not used.	null
0	Unlimited	.timestamp - from REQUEST_TIME_FLOAT
>0	Cycling	.generation

The cycling logic is intended to ensure that concurrent requests do not use the same trace file.

The generation number picked is the first missing from the series 1 ... limit.
If all generations are used then we pick the oldest file, based on file modification time.
The logic to determine the oldest file needs to be fairly be efficient.
The maximum value of limit should be a reasonable number.

Concurrently executed transactions may still choose the same file.
If this happens a lot then use the limit =0 option.
Trace file reset logic will apply to the selected trace file, except when the limit is 0.

reset	generation	Results in	Supports concurrency
n	n	One big file . i.e. the current situation.	Output mixed
n	y	Lots of ever increasing files	Should do
y	n	Trace file reset every transaction	Possible loss of data
y	y	Selected trace file reset	Should do

bobbingwide · 2018-01-29T10:46:00Z

This change should also take in account and/or help address.

bobbingwide · 2018-01-31T16:27:06Z

The trace admin page should be updated to reflect the generation logic

The trace file name will be wrong when the value of limit is set.
The reset capability of the [bwtrace] shortcode needs to be revisited.

…selector class

…ttings

…ST separately. Also, implement generation logic based on the value of limit.

…e generations

…ctored code

bobbingwide · 2018-04-21T17:48:20Z

Using the generation limit of 0 for REST calls allows concurrent tracing of the REST requests issued by the new block editor ( Gutenberg ). But it's not necessary for the standard browser, AJAX or batch ( CLI ) requests. So, I'm going to make 4 versions of the "limit" setting; one per request type. The Trace options page will be updated. screenshot-1.jpg reflects the latest version.

https://github.com/bobbingwide/oik-bwtrace/blob/23b773c69d108fee586cd05b6c6be9334e34a6fd/screenshot-1.jpg

Note: I haven't updated the PHPUnit tests yet.

bobbingwide · 2018-05-14T11:17:01Z

I noticed that trace reset is not working when the generation limit is not set.
The logic in trace_file-selector::reset_as_required() is incorrect.
The solution is to change the switch statement to a simple if statement.

bobbingwide · 2019-12-12T08:43:35Z

There is still the possibility that the trace output files could use up a lot of disk space. The oik trace admin section for files shows the total space used for the files. And there’s logic to purge the files. This is documented, so I believe it’s safe to close the issue.

bobbingwide mentioned this issue Dec 21, 2015

Tracing being performed when the specific ip does not match. #17

Closed

bobbingwide added the enhancement label Nov 1, 2016

bobbingwide self-assigned this Nov 1, 2016

bobbingwide added the Priority: A label Jan 29, 2018

bobbingwide mentioned this issue Jan 31, 2018

Add logic to limit tracing to CLI processing #58

Closed

bobbingwide added a commit that referenced this issue Feb 1, 2018

Issue #16 - start developing trace file generation logic

fa26759

bobbingwide added a commit that referenced this issue Feb 1, 2018

Issue #16 - switch to bw_trace_file2() which uses the new trace_file_…

09fb4c1

…selector class

bobbingwide added a commit that referenced this issue Feb 1, 2018

Issue #16 - Start improving solution and tests for different limit se…

1e2f4c5

…ttings

bobbingwide added a commit that referenced this issue Feb 1, 2018

Issue #16 - cater for a reduced generation limit

fa10bb2

bobbingwide added a commit that referenced this issue Feb 1, 2018

Issue #16 - support Batch trace file selection

6a1290a

bobbingwide added a commit that referenced this issue Feb 1, 2018

Issue #52 - pre-detect REST API calls, issue #16 - support tracing RE…

cf88c87

…ST separately. Also, implement generation logic based on the value of limit.

bobbingwide added a commit that referenced this issue Feb 1, 2018

Issue #16 - update admin tests for new fields, en_GB and bb_BB

68f4cb0

bobbingwide added a commit that referenced this issue Feb 1, 2018

Issue #16 - start refactoring

c1bda02

bobbingwide added a commit that referenced this issue Feb 1, 2018

Issue #16 - continue refactoring. Implement reset logic for trace fil…

b701bda

…e generations

bobbingwide added a commit that referenced this issue Feb 1, 2018

Issue #16 - more refactoring. Ensure globals are still set for unrefa…

89ef463

…ctored code

bobbingwide added a commit that referenced this issue Feb 6, 2018

Issue #16 - correct initialisation sequence

7c602c0

bobbingwide added a commit that referenced this issue Feb 6, 2018

Issue #16 - when trace limit's 0 use REQUEST_TIME_FLOAT.

a17ef30

bobbingwide added a commit that referenced this issue Feb 7, 2018

Issue #16 - when getting trace_url cater for null values

7b06174

bobbingwide added a commit that referenced this issue Apr 21, 2018

Issue #16 - Allow different trace limits for each request type

23b773c

bobbingwide added a commit that referenced this issue May 14, 2018

Issue #16 - fix trace file reset when $this->limit is null

9b6d1a4

bobbingwide added a commit that referenced this issue Aug 17, 2019

Issue #16 - correct class trace_file_selector tests

fb455db

bobbingwide mentioned this issue Nov 29, 2019

Improve bw_trace_trace_startup() #61

Closed

bobbingwide closed this as completed Dec 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to handle huge trace files #16

How to handle huge trace files #16

bobbingwide commented Dec 21, 2015 •

edited

Loading

bobbingwide commented Jan 29, 2018 •

edited

Loading

bobbingwide commented Jan 29, 2018

bobbingwide commented Jan 31, 2018

bobbingwide commented Apr 21, 2018 •

edited

Loading

bobbingwide commented May 14, 2018

bobbingwide commented Dec 12, 2019

How to handle huge trace files #16

How to handle huge trace files #16

Comments

bobbingwide commented Dec 21, 2015 • edited Loading

bobbingwide commented Jan 29, 2018 • edited Loading

bobbingwide commented Jan 29, 2018

bobbingwide commented Jan 31, 2018

bobbingwide commented Apr 21, 2018 • edited Loading

bobbingwide commented May 14, 2018

bobbingwide commented Dec 12, 2019

bobbingwide commented Dec 21, 2015 •

edited

Loading

bobbingwide commented Jan 29, 2018 •

edited

Loading

bobbingwide commented Apr 21, 2018 •

edited

Loading