Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

will there for assembly soon? #21

Open
mbrmaker opened this issue Jul 28, 2024 · 3 comments
Open

will there for assembly soon? #21

mbrmaker opened this issue Jul 28, 2024 · 3 comments

Comments

@mbrmaker
Copy link

I would like to know if it will soon be possible to convert a .bin (from NASM) to .asm source code with your AI.

@albertan017
Copy link
Owner

Thanks for your interest. However, our work exclusively concentrates on decompilation on top of asm. Future projects will also proceed on this basis, utilizing assembly code that has been disassembled by tools like objdump or other disassemblers.

@mbrmaker
Copy link
Author

but is it really possible?

@albertan017
Copy link
Owner

albertan017 commented Aug 12, 2024

Compared to decompiling, disassembling is a more established field. Therefore, we only focus on decompilation. The current LLM4Decompile has demonstrated impressive performance on the HumanEval benchmark and shows promise with real data. Our objective is to enhance its practical application using methods such as scaling up training, retrieval-augmented generation, and additional techniques.

For obfuscated or protected binaries, we do not consider language models like LLM to be an effective solution because the costs associated with training typically outweigh the benefits derived from decompiling or disassembling the code, considering the wide range of possible obfuscation techniques.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants