You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
pe-parse is a principled, lightweight parser for windows portable executable files. It was created to assist in compiled program analysis, potentially of programs of unknown origins. This means that it should be resistant to malformed or maliciously crafted PE files, and it should support questions that analysis software would ask of an executable program container. For example, listing relocations, describing imports and exports, and supporting byte reads from virtual addresses as well as file offsets.
6
+
pe-parse is a principled, lightweight parser for windows portable executable files.
7
+
It was created to assist in compiled program analysis, potentially of programs of unknown origins.
8
+
This means that it should be resistant to malformed or maliciously crafted PE files, and it should
9
+
support questions that analysis software would ask of an executable program container.
10
+
For example, listing relocations, describing imports and exports, and supporting byte reads from
11
+
virtual addresses as well as file offsets.
7
12
8
13
pe-parse supports these use cases via a minimal API that provides methods for
9
14
* Opening and closing a PE file
@@ -15,20 +20,34 @@ pe-parse supports these use cases via a minimal API that provides methods for
15
20
* Reading bytes from specified virtual addresses
16
21
* Retrieving the program entry point
17
22
18
-
The interface is defined in `parser-library/parse.h`. The program in `dump-prog/dump.cpp` is an example of using the parser-library API to dump information about a PE file.
23
+
The interface is defined in `parser-library/parse.h`.
19
24
20
-
Internally, the parser-library uses a bounded buffer abstraction to access information stored in the PE file. This should help in constructing a sane parser that allows for detection of the use of bogus values in the PE that would result in out of bounds accesses of the input buffer. Once data is read from the file it is sanitized and placed in C++ STL containers of internal types.
25
+
The program in `dump-prog/dump.cpp` is an example of using the parser-library API to dump
26
+
information about a PE file.
27
+
28
+
Internally, the parser-library uses a bounded buffer abstraction to access information stored in
29
+
the PE file. This should help in constructing a sane parser that allows for detection of the use
30
+
of bogus values in the PE that would result in out of bounds accesses of the input buffer.
31
+
Once data is read from the file it is sanitized and placed in C++ STL containers of internal types.
32
+
33
+
pe-parse includes Python bindings via `pepy`, which can be installed via `pip`:
34
+
35
+
```bash
36
+
$ pip3 install pepy
37
+
```
38
+
39
+
More information about `pepy` can be found in its [README](./pepy/README.md).
40
+
41
+
## Dependencies
21
42
22
-
Dependencies
23
-
========
24
43
### CMake
25
44
* Debian/Ubuntu: `sudo apt-get install cmake`
26
45
* RedHat/Fedora: `sudo yum install cmake`
27
46
* OSX: `brew install cmake`
28
47
* Windows: Download the installer from the [CMake page](https://cmake.org/download/)
PE files that have a Resource section with strings for the Type are encoded in UTF-16, but that `std::string` expects UTF-8. Some cross-platform solution
48
-
is desired. You can let cmake choose one it finds in your build environment or you can choose one from the following options yourself and specify it with
49
-
the `-DUNICODE_LIBRARY` argument when generating the project files with cmake:
50
-
*`icu` (preferred) - "[ICU](http://site.icu-project.org/) is a mature, widely used set of C/C++ and Java libraries providing Unicode and Globalization support for software applications"
51
-
*`codecvt` - A C++ library header file ([now deprecated](http://open-std.org/JTC1/SC22/WG21/docs/papers/2017/p0618r0.html)) supported by some C++ runtimes
66
+
PE files that have a Resource section with strings for the Type are encoded in UTF-16, but that
67
+
`std::string` expects UTF-8. Some cross-platform solution is desired.
68
+
69
+
You can let `cmake` choose one it finds in your build environment or you can choose one from the
70
+
following options yourself and specify it with the `-DUNICODE_LIBRARY` argument when generating the
71
+
project files with `cmake`:
72
+
73
+
*`icu` (preferred) - "[ICU](http://site.icu-project.org/) is a mature, widely used set of C/C++
74
+
and Java libraries providing Unicode and Globalization support for software applications"
You can see a full example in the examples/peaddrconv folder.
82
112
83
-
Authors
84
-
=======
85
-
pe-parse was designed and implemented by Andrew Ruef ([email protected]), with significant contributions from [Wesley Shields](https://github.com/wxsBSD).
113
+
## Authors
114
+
115
+
pe-parse was designed and implemented by Andrew Ruef ([email protected]), with significant
116
+
contributions from [Wesley Shields](https://github.com/wxsBSD).
0 commit comments