Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BinExport wrongly identifies some functions as IMPORTED #90

Open
patacca opened this issue Jun 22, 2022 · 11 comments
Open

BinExport wrongly identifies some functions as IMPORTED #90

patacca opened this issue Jun 22, 2022 · 11 comments
Labels
bug Something isn't working

Comments

@patacca
Copy link

patacca commented Jun 22, 2022

Consider the executable ntoskrnl.exe[1] (official version of ntoskrnl.exe)
After exporting it with BinExport on IDA you will find that the function FsRtlMdlReadCompleteDevEx at address 0x14032E010 is erroneously identified as IMPORTED while in reality the function code is present in the binary (IDA shows it without problem).

After looking a little bit into the sources it seems that the problem lies in this check:

      if (function.GetBasicBlocks().empty()) {
        function.SetType(Function::TYPE_IMPORTED);
      }

It seems that BinExport doesn't extract the basic blocks for that particular function.

[1] Here you can find the executable as well with the pdb file associated

@cblichmann cblichmann added the bug Something isn't working label Jun 22, 2022
@Myles1
Copy link

Myles1 commented Jul 12, 2023

Any updates on this? I'm running into the same issue. A function that's correctly disassembled by IDA is showing up as IMPORTED, with no instructions, in the binexport file.

I'm working on an arm64 binary and, as far as I can tell, this happens when there is a JUMPOUT from the current function, or sometimes when the current function is just very large.

@Myles1
Copy link

Myles1 commented Jul 25, 2023

@cblichmann Do you have any suggestions/ideas for fixing this?

@cblichmann
Copy link
Member

Oh wow, I should've noticed this much earlier.
@patacca: You're right, we export this as "Imported", which is a bug. However, after a bit of debugging and not finding anything odd, I simply logged everything to stdout with

./ida64 -OBinExportAlsoLogToStdErr:TRUE -OBinExportAutoAction:BinExportBinary ~/ntoskrnl-5012647.exe.i64

This resulted in (among others):

I0726 14:14:00.924358  256587 flow_graph.cc:498] Discarding excessively large function 000000014032E010: 3202 basic blocks, 5000 edges, 14559 instructions (Limit is 5000, 5000, 20000)

So in a sense this is working as intended. IDA itself also by default refuses to show the graph view for it (> 1000 basic blocks). This is exactly hitting the kMaxFunctionEdges limit of 5000 edges (defined in flow_graph.h).

@Myles1: Can you check with a command-line similar to the above whether you also hit one of these limits?
Also, how often does this occur for typical binaries you analyze?

We should probably increase the limits somewhat and also set the function type to Function::TYPE_INVALID instead.

@patacca
Copy link
Author

patacca commented Jul 26, 2023

Why do we want to enforce the limits by hardcoding the values?
Isn't it better to let the users adjust those values to their specific needs?

@cblichmann
Copy link
Member

Yes, we can definitely do that. It'd be awkward to use from a command-line and we'd also need to make this a BinDiff setting. I'll look into that.

copybara-service bot pushed a commit that referenced this issue Jul 27, 2023
Otherwise these will be interpreted as imported, which is wrong and
confusing (see issue #90).

PiperOrigin-RevId: 551456988
Change-Id: Id645ef630c580b2b4c4db244bc1387598b5b3c07
@cblichmann
Copy link
Member

Note: You can try this out with the artifacts that are now being built for each commit: https://github.com/google/binexport/actions/runs/5678688428

@Myles1
Copy link

Myles1 commented Aug 1, 2023

This happens very frequently in the binaries that I'm analyzing. Is there a workaround for keeping the excessively large cfg? I need to keep the cfg, no matter how large it might be.

@cblichmann
Copy link
Member

As a temporary workaround, you can increase the limits in flow_graph.h and recompile.
This should indeed be configurable, but I'm trying to figure out the best way to do this.
Contenders are:

  • environment variables (BINEXPORT_MAX_FUNCTION_BASIC_BLOCKS)
  • Plugin options that can be set on IDA's command-line (-OBinExportMaxFunctionBasicBlocks:<N>)
  • not making it configurable, increasing the limits drastically instead

As an aside, IIRC, the Ghidra exporter extension has no hard-coded limits.

@patacca
Copy link
Author

patacca commented Aug 2, 2023

I'm voting for the IDA command line option, leaving the current default values when not specified. It keeps it consistent with the other options already present (like OBinExportAlsoLogToStdErr or OBinExportAutoAction) making it easier for people to find them.
Just giving my personal unwanted opinion 😄

@Myles1
Copy link

Myles1 commented Aug 7, 2023

Shouldn't we set the default to a large number and allow configuration for lowering it, as opposed to the other way around? That would make the defaults as sane as possible for new people. This issue took a while for me to track down and I hope others don't have to do the same.

Edit: I'm also unable to recompile because I have IDA Home, not IDA Pro. Would it be possible to make a new release with updated values?

@cblichmann
Copy link
Member

@Myles1 I just submitted such a change for review.
As for new binaries, you can always find them under the Actions tab under cmake-build. E.g. this is a previous one: https://github.com/google/binexport/actions/runs/5689470178.

copybara-service bot pushed a commit that referenced this issue Aug 22, 2023
Machines have gotten faster and have lots more memory since these limits were
introduced originally.
A follow-up will make them configurable. See discussion in #90.

PiperOrigin-RevId: 559098513
Change-Id: Ie19ae7dcc1d81e41f82c286ca5fd3dd791cecb0c
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants