Design choice for timeless debugging #2730

Tim--- · 2021-02-05T20:27:43Z

Tim---
Feb 5, 2021

Hi !

I am looking to implement a Qira-like functionality for Ghidra to allow for "timeless" debugging.
The idea would be to open a Qira trace (or another format), and allow to go forward/backward in time to see the memory and registers (like the Ghidra Trace does for recorded sessions).
It should also be able to add annotations similar to XREF in the listing view, to see for each address where and when it was read/written/executed.

I hesitate between several ways to implement this:

create an external GADP server that reads the Qira trace, and can be used like any other debugger from Ghidra.
For now I have a PoC in Python that works, but it lacks the timeless-specific functionalities that I would need to implement in a new GADP Interface I guess (snap selection, XREF search).
try to convert the Qira traces to Ghidra traces.
I didn't look much into how Ghidra traces work, but I understand that it only takes snapshots of memory/registers at breakpoints.
Also, I guess it doesn't register incremental changes, but takes the content of whole pages.
This means that for each instruction, I would have to take a snapshot of all the memory/registers, instead of the list of changes. This would lead to huge files, and probably not allow an efficient XREF research.
create a new implementation of the Ghidra trace reader, that reads Qira traces.
This would allow me to reuse interesting components like the snap timeline.

So my question:
Has the subject of timeless debugging been tackled while designing the Ghidra debugger/GADP ?
What do you think would be the best approach for this ?

d-millar · 2021-02-05T22:54:59Z

d-millar
Feb 5, 2021
Collaborator

Hey Tim,

So, short answer to your question is yes. We've had numerous discussions regarding the topic and are very interested in efforts in this direction. So far, our principal focus has been on the Microsoft TTD extensions to WIndbg, available in the Preview editions. Right now, you can do a few things in this vein. You can load Windbg-generated traces into the debugger and traverse them as if you were debugging a live target. You can display various aspects of the trace in the different providers, e.g. the Memview Provider, via loader scripts. And you can populate the Ghidra trace database with the TTD data. The latter has proved very expensive, although we haven't tested in after several performance improvements to the main code.

Our intention was to investigate RR with the same goals. RR would be a natural choice as the RR traces would be readily available through the normal gdb interfaces using RR as the wrapped client in the Linux agent. That said, I think Qira would also be an obvious choice.

I haven't played with Qira as much as I have with QEMU. As the traces typically accessed using the gdb machine interface or extensions to the gdb command set. That would be ideal, as you could avoid writing a parser for the Qira trace format in Java or wrapping existing code with JNI or the like. Let us know what your use cases might be and how you envision the two trace sets (Ghidra/Qira) interacting, and we can give you specifics on best approach and some of the underlying infrastructure that might simplify development.

0 replies

Tim--- · 2021-02-06T19:24:40Z

Tim---
Feb 6, 2021
Author

Thank you for your response.

Owww, I stumbled upon TTD while doing some research, and I skipped it because it was on Windows.
But I didn't see that there was an implementation in Ghidra -- oops. I'll take a look at it to see what it looks like and how it works.

I looked a bit at RR and from what I understand, it basically records the syscall results, and replays the program deterministically using these results to get the same execution. It's a clever approach, but it doesn't offer a "must-have" feature in my opinion: getting the list of memory accesses for an address.
If you have to set a watchpoint and use continue/reverse-continue several times to find where an address was written... it's kind of sad.

That's why I focused more on Qira. The approach is crude (record all the read/writes for each instruction !), the traces are slower to record and heavier on disk, but I think it better fits reverse-engineering purposes. As you said, I'd like to avoid implementing it in Java. The traces are made of really small entries, and once parsed the whole thing is kept in a unordered_map<Address, map<SnapNum, Value>> that keeps track of each value read/written for all addresses. For large traces, that's two things that the JVM would probably not like.

Oh, I didn't think about interfacing at the GDB level. Adding a few custom GDB commands to get the extent of the trace, jump to a specific "snap", and optionally get the list of accesses at some memory address... This would probably be a small addition to RR, and possible to make an implementation based on the Qira traces. I'm probably missing a lot of gotchas, but is that what you had in mind ?

1 reply

d-millar Feb 7, 2021
Collaborator

Hmmmm, interesting - that helps. So, you might start by looking at the PopulateTraceLocal script in Debug/Debugger-agent-dbgmodel-traceloader. The scripts reads a TTD trace file and pulls select records into the Ghidra trace database. The commented-out code is actually most similar to your use case. I might also look at the PopulateMemviewLocal script which populates a time vs address-style display with similar info.

Both of these, obviously, use the TTD trace format and worse, in some ways, access the file through the API which is already implemented in the dbgmodel agent, but, if the records you're trying to access are simple enough and if parsing the file does not require parsing every record type, implementing similar functionality might be straightforward.

Also possibly relevant are the scripts written for Ghidra in Python/Jython. I think Qira has a Python interface (?) and using that in combination with a script might save you from writing a parser in Java. That said, keep in mind, the default Python interface is really Jython, not Python, so there may be issues if the Qira interface does things will C-types or more current features in Python. There are options for purely Python interaction but I'm not really the guy to talk to on that front and would have to do some asking around if you had issues.

Tim--- · 2021-02-09T21:43:21Z

Tim---
Feb 9, 2021
Author

Thanks again for your response, this is very helpful !

I took a quick look at the scripts you gave, and I confirm that this is probably what I was looking for. I will definitely start from here.

I tried to test the scripts on Windows to see the actual result but unfortunately had some trouble. I built Ghidra on Windows, made a TTD trace of notepad with WinDbg preview (notepad01.run), and tried to feed it into the scripts. But each time, I got an error (OpenDumpFileWide returns E_INVALIDARG). Idk if I should report a bug, or if it's just me being dumb ?

In qira, the parser is written in C++, and there is a Cython interface to expose it to the main application. But the file format is kind of dumb, so I guess I can just reimplement it in Java.

I'll give it a try and keep you updated.

0 replies

d-millar · 2021-02-09T23:51:44Z

d-millar
Feb 9, 2021
Collaborator

Admittedly, this is somewhat buried in the help files, but did you copy the Windbg Preview amd64 directory contents into the bin directory in the JDK? There's a path issue with hardcoded precedence in the JDK for its version of dbgeng and possibly some of the other files that's, well, truly annoying and for which we haven't figured out a workaround.

1 reply

Tim--- Feb 19, 2021
Author

Thanks for your workaround, it's not pretty but it worked ! I was able to play with the different dbgmodel scripts to see how the traces work.

So I started to implement an import script for the qira trace based on these. I made some progress: I can load the executable/libraries in memory, and parse the register/memory writes for each tick.
So now I can travel back and forth between the ticks and see the register and memory changes. Neat :).
I thought this would take more effort than it did, but the APIs are pretty easy to use !

However, the import is quite slow (~10 minutes for a trace of bash just starting), and leads to big files (~3GB for the same trace). I know that there's some things that I need to fix, but is it expected ? (I filled the tree with ~500k nodes)

10 minutes is not much but it looks like the time gets exponentially slow as the trace gets bigger. However, once it is imported, moving through the ticks is blazing fast, and reopening the same trace takes only a few seconds.

I've put the code here for now, if you want to take a look.
I have more work to do but it's a good start I think.

d-millar · 2021-02-19T23:11:49Z

d-millar
Feb 19, 2021
Collaborator

Oh, wow - that's very exciting. Very happy you made progress so quickly. End of my day, but will definitely take a look at your code when next I get a chance.

Re speed, good question. Right now, we're hashing through a bunch of performance issues in the debugger proper. Whether fixes to these will help you out depends, I guess, on where the bottle necks are. The MemoryViewer could definitely have exponential issues as it reparses the sets on adds. It sort of has to because it's using relative position rather than absolute time or address to pack the display. That said, I am not an algorithm guru - am sure we can find some folks with ideas for increasing the speed.

0 replies

westurner · 2024-08-23T00:35:11Z

westurner
Aug 23, 2024

Additional time traveling debuggers w/ QEMU, Bochs, Hypervisors FWIW: - rr - https://news.ycombinator.com/item?id=41285518 - https://github.com/gamozolabs/applepie - https://github.com/gamozolabs/orange_slice - https://github.com/MarginResearch/cannoli : It consists of a small patch to QEMU to expose locations to inject some code directly into the JIT, a shared library which is loaded into QEMU to decide what and how to instrument, and a final library which consumes the stream produced by QEMU in another process, where analysis can be done on the trace. Cannoli is designed to record this information with minimum interference of QEMU's execution. In practice, this means that QEMU needs to produce a stream of events, and hand them off (very quickly) to another process to handle more complex analysis of them. Doing the analysis during execution of the QEMU JIT itself would dramatically slow down execution. Cannoli can handle billions of target instructions per second, can handle multi-threaded qemu-user applications, and allows multiple threads to consume the data from a single QEMU thread to parallelize processing of traces. - Qira: https://github.com/geohot/qira

…

On Thu, Aug 22, 2024, 10:38 AM Dan ***@***.***> wrote: Closed #2730 <#2730> as resolved. — Reply to this email directly, view it on GitHub <#2730>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAMNS74VCNOWLR6IWMR7ZTZSXZW7AVCNFSM6AAAAABM6JT7DCVHI2DSMVQWIX3LMV45UABFIRUXGY3VONZWS33OIV3GK3TUHI5E433UNFTGSY3BORUW63R3GE2DMMRVGM4Q> . You are receiving this because you are subscribed to this thread.Message ID: <NationalSecurityAgency/ghidra/repo-discussions/2730/discussion_event/1462539 @github.com>

1 reply

nsadeveloper789 Aug 23, 2024
Maintainer

Thanks for the additional information. I had closed this discussion as it was aging. I left a similar discussion open, where I'd like to keep these things consolidated: #4051.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Design choice for timeless debugging #2730

{{title}}

Replies: 6 comments 3 replies

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Design choice for timeless debugging #2730

Tim--- Feb 5, 2021

Replies: 6 comments · 3 replies

d-millar Feb 5, 2021 Collaborator

Tim--- Feb 6, 2021 Author

d-millar Feb 7, 2021 Collaborator

Tim--- Feb 9, 2021 Author

d-millar Feb 9, 2021 Collaborator

Tim--- Feb 19, 2021 Author

d-millar Feb 19, 2021 Collaborator

westurner Aug 23, 2024

nsadeveloper789 Aug 23, 2024 Maintainer

Tim---
Feb 5, 2021

Replies: 6 comments 3 replies

d-millar
Feb 5, 2021
Collaborator

Tim---
Feb 6, 2021
Author

d-millar Feb 7, 2021
Collaborator

Tim---
Feb 9, 2021
Author

d-millar
Feb 9, 2021
Collaborator

Tim--- Feb 19, 2021
Author

d-millar
Feb 19, 2021
Collaborator

westurner
Aug 23, 2024

nsadeveloper789 Aug 23, 2024
Maintainer