Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Do you take struct into consideration? #8

Open
XinyuShe opened this issue Mar 21, 2024 · 3 comments
Open

Do you take struct into consideration? #8

XinyuShe opened this issue Mar 21, 2024 · 3 comments

Comments

@XinyuShe
Copy link

XinyuShe commented Mar 21, 2024

Do you take struct into consideration?
And how do you handle the issue of excessively long functions in assembly code?

@albertan017
Copy link
Owner

No, currently we only consider a single function.

Gathering data and developing a workable approach for decompiling complex files with multiple functions and structures is quite demanding. Therefore, this initial version of LLM4Decompile is limited to decompilation of individual functions.

Addressing the complexities posed by external functions and struct definitions is a primary focus of our future decompilation efforts. Our team is actively working on strategies to address these issues. While the nature of the problem maybe ill-posed, a larger and more varied training dataset will allow the model to make statistical guesses about the potential functions and types that correspond to the missing pieces. We'll report the results asap!

@XinyuShe
Copy link
Author

@albertan017 Thanks for your reply!
I am also wondering where did you find those c file datasets without structs and long function?

@albertan017
Copy link
Owner

@albertan017 Thanks for your reply! I am also wondering where did you find those c file datasets without structs and long function?

We remove those parts in Anghabench for simplification. The original dataset is available here. But the dataset is only compilable, not linkable. Therefore, we are looking for other benchmarks and collecting our own data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants