Collect metrics during compilation #6820

aakoshh · 2024-12-16T12:34:15Z

Problem

While investigating #6786 I added some custom statistics to the code to see the "big picture" of what is going on, looking at aggregate numbers to guide me in where the changes in behaviour are coming from. For example sometimes it was the number of instructions in an SSA pass, other times the number of instructions generated due to the different ratio of signed/unsigned values used by the passes.

The example was too big to look at the full SSA passes, or the ACIR (e.g. hundred of thousands of instructions, a million opcodes), but to compare what is different between two commits, the high level stats were helpful. NB I could not use nargo info because the example was a contract and info is looking for binary packages.

I thought we might want to collect statistics using one of the metrics libraries such as this one with an option to export it at the end of the compilation, or perhaps between SSA passes as well. prometheus even allows pushing metrics from batch processes, in case we would want to set up something to monitor the compilation on CI.

Happy Case

We could have something like a --show-metrics CLI argument to install a backend to collect metrics, which would otherwise be ignored, and then collect them anywhere we think is interesting to keep track of, e.g. the number of range constraints added due to this or that reason. Then we could use some diff tool to compare what changed in this summary of the compilation on the same program between one commit to the next.

In the code we could collect metrics like this:

    fn convert_ssa_instruction(
        &mut self,
        instruction_id: InstructionId,
        dfg: &DataFlowGraph,
    ) -> Result<Vec<SsaReport>, RuntimeError> {
        let before = self.acir_context.acir_ir.opcodes().len();
        let mut tag = None;
        match &dfg[instruction_id] {
            Instruction::Binary(binary) => {
                tag = Some("binary");
                let result_acir_var = self.convert_ssa_binary(binary, dfg)?;
                self.define_result_var(dfg, instruction_id, result_acir_var);
            }
            Instruction::Constrain(lhs, rhs, assert_message) => {
                ...
            }
            ...
        };
        if let Some(tag) = tag {
            let after = self.acir_context.acir_ir.opcodes().len();
            counter!(format!("convert_ssa_instruction.{tag}")).increment(after - before);
        }
        ...
    }

There are other ways, like using the tracing library and subscribing to do aggregations, or to derive metrics based on a struct for type safety; here's another derivation example.

Workaround

Yes

Workaround Description

Edit the Rust code to add ad-hoc counters and prints, then execute with commands such as this:

cargo run --release -p nargo_cli -- --program-dir ../../noir-projects/noir-contracts compile --package stateful_test_contract --silence-warnings

Additional Context

No response

Project Impact

None

Blocker Context

No response

Would you like to submit a PR for this Issue?

None

Support Needs

No response

The text was updated successfully, but these errors were encountered:

aakoshh · 2025-01-08T08:24:08Z

Thanks to a tip from @TomAFrench about flamegraph.sh I see we already have the tools to report detailed gate counts together with the lines they come from which is fantastic 🙇 I think that covers a lot of what my I was looking for in this proposal. Here's an example of opcode counts produced by this command:

It would be nice if the profiler could pull out the definition of a function from a contract artefact on its own, rather than having to rely on node and a TypeScript file to do it.

The format gave me another idea for how to collect metrics, though: In the above example, we see there are range constraints, but we don't quite know where they come from. It turns out there was a bug in check_unsigned_overflow which added range constraints to Mul operations, which it shouldn't have in this case because the rhs was a bool cast to an u8. Say we instrumented the code to emit tracing events, and also added tracing spans along the way, then we could conditionally add a subscriber to these events in the application root and send them to a file. This file could then be used by the profiler to construct a flamegraph that isn't grouped by the location based call stack of where an operation came from in the Noir code, but rather the stack of spans that produced the instruction in the compiler code. For instance we could see that the "range instruction added" events were induced by "binary mul" instructions.

aakoshh added the enhancement New feature or request label Dec 16, 2024

github-project-automation bot added this to Noir Dec 16, 2024

github-project-automation bot moved this to 📋 Backlog in Noir Dec 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Collect metrics during compilation #6820

Collect metrics during compilation #6820

aakoshh commented Dec 16, 2024 •

edited

Loading

aakoshh commented Jan 8, 2025

Collect metrics during compilation #6820

Collect metrics during compilation #6820

Comments

aakoshh commented Dec 16, 2024 • edited Loading

Problem

Happy Case

Workaround

Workaround Description

Additional Context

Project Impact

Blocker Context

Would you like to submit a PR for this Issue?

Support Needs

aakoshh commented Jan 8, 2025

aakoshh commented Dec 16, 2024 •

edited

Loading