Partial conversion values #16

martinthomson · 2024-09-23T21:07:51Z

I had been assuming we could report partial conversion value if there is insufficient but non-zero privacy budget. Since the privacy budget is likely to be chosen by the browser (and may vary between browsers and over time), the conversion site doesn't necessarily know the available budget.

Originally posted by @andyleiserson in #11 (comment)

csharrison · 2025-01-14T21:37:49Z

@martinthomson you mentioned in the PR you "have a preference for avoiding that sort of thing". Can you expand on it?

It's a bit hard to reason about, but overall I don't see a huge problem with this. My intuition is that is probably a minor utility win at no cost to privacy.

martinthomson · 2025-01-29T23:23:46Z

This isn't a privacy question as much as it is a question about what sort of functionality we present to sites.

If someone wants to submit a value of 10 with epsilon 5 and they only have epsilon 1 of their budget remaining, there would seem to be two options:

Spend the partial budget and cap the contribution (to 2 in this case)
Don't spend the budget and send a zero

The feedback I've gotten from people is that they would like budget exhaustion to have as predictable an effect on the result.

On reflection, I don't know what the best option is. I don't know which option is most predictable, though I will observe that if we reserve budget for reporting on truncation, the amount we need might increase if we want to indicate that partial spend has occurred, as opposed to a simple boolean (i.e, a count) of the number of (wholly) lost conversions.

I do have a preference for fixing this in the specification, rather than making it a choice, but even that is open to debate. What is your preference?

csharrison · 2025-02-04T02:23:09Z

Thinking about this more, I actually think spending a partial budget should not be allowed as it allows violations of the privacy guarantee. Here is an example, where each item is a new epoch:

Impression 1
Impression 2
Impression 3
Conversion spending .5 matching imp 2 and 3 → (0, 0, .5)
Conversion spending 1 matching imp 1 and 3 → (0, 0, .5) (partial budget spend happens here)

Neighbor removes imp 3. Conversions return (0, .5, 0) and (1, 0, 0), so the L1 diff = 1 + 1.5 = 2.5.
Without partial credit the first database returns (0, 0, .5), (0, 0, 0) and the neighbor returns (0, .5, 0) and (1, 0, 0) and the issue is resolved (L1 diff 2).

Silver lining I guess is if this is true, then it's one less thing we need to discuss 😆 ?

martinthomson · 2025-02-04T02:31:07Z

Hmm, I think that we're resolving that a change to one epoch has -- at most -- an L1 difference of 2, so that would seem to blow the limit in an undesirable way.

An alternative interpretation would be to have queries capped by the budget that is available across all epochs that contain impressions. Then the two outcomes would be (0, 0, .5) and (0, 0, .5) for the case with impression 3 and (0, .5, 0) and (.5, 0, 0) for the one with. That is probably not an outcome that the paper contemplates though. (It is consistent with the description in the paper, but it requires a re-assessment.)

csharrison · 2025-02-04T02:50:34Z

An alternative interpretation would be to have queries capped by the budget that is available across all epochs that contain impressions. Then the two outcomes would be (0, 0, .5) and (0, 0, .5) for the case with impression 3 and (0, .5, 0) and (.5, 0, 0) for the one with. That is probably not an outcome that the paper contemplates though. (It is consistent with the description in the paper, but it requires a re-assessment.)

This doesn't seem like a great idea. I think it's a reasonable assumption that older epochs will have less budget, and limiting the contribution to recent epochs seems quite limiting. I'm not sure this "partial spend" functionality is worth it at that point.

csharrison · 2025-02-04T14:50:38Z

Thinking about this more, I actually think spending a partial budget should not be allowed as it allows violations of the privacy guarantee. Here is an example, where each item is a new epoch:

Impression 1

Impression 2

Impression 3

Conversion spending .5 matching imp 2 and 3 → (0, 0, .5)

Conversion spending 1 matching imp 1 and 3 → (0, 0, .5) (partial budget spend happens here)

Neighbor removes imp 3. Conversions return (0, .5, 0) and (1, 0, 0), so the L1 diff = 1 + 1.5 = 2.5. Without partial credit the first database returns (0, 0, .5), (0, 0, 0) and the neighbor returns (0, .5, 0) and (1, 0, 0) and the issue is resolved (L1 diff 2).

Hm actually I think I made an error, it should be:

Without partial credit the first database returns (0, 0, .5), (1, 0, 0) and the neighbor returns (0, .5, 0) and (1, 0, 0) and the issue is resolved (L1 diff 1).

Since we treat the budget exhaustion as the same as if there are no matching impressions.

csharrison · 2025-02-11T07:46:29Z

We agreed to close this issue as wontfix in the 2/11 call.

bmcase · 2025-02-12T01:31:32Z

I chatted about this further with Roxana and Pierre. The answer right now on the theory side is that: with the current theory, which relies on filters, adjusting sensitivity based on available budget is not justified analytically. Adapting the accounting theory to odometers could potentially permit this but it requires analytical work.

Thought it would be good to at least file that with this issue in case we pick it up again in relation to multi-touch attribution where I think in the meeting there was some interest to see this reconsidered.

martinthomson · 2025-02-12T01:59:25Z

So for multi-touch, it seems like we'd be forced to spend all the budget in all epochs with impressions, even for a) epochs that add nothing to the final contribution and b) epochs that only contribute partially to the result. That would seem less than ideal, but I understand that we don't have the analytical framework that would allow us to do better. Hopefully we can do better, but that applies even for last-touch, where we spend budget for epochs 0 through 4 if they have impressions, even if the impression we use comes from epoch 5.

martinthomson added the discuss Needs working group discussion label Jan 29, 2025

martinthomson added this to PPA API, Level 1 Feb 3, 2025

martinthomson moved this to Essential in PPA API, Level 1 Feb 3, 2025

csharrison mentioned this issue Feb 4, 2025

Avoid using floating point arithmetic for privacy budget #77

Open

martinthomson mentioned this issue Feb 11, 2025

Develop a format for examples #88

Open

csharrison closed this as not planned Won't fix, can't repro, duplicate, stale Feb 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Partial conversion values #16

Partial conversion values #16

martinthomson commented Sep 23, 2024 •

edited

Loading

csharrison commented Jan 14, 2025

martinthomson commented Jan 29, 2025 •

edited

Loading

csharrison commented Feb 4, 2025

martinthomson commented Feb 4, 2025

csharrison commented Feb 4, 2025

csharrison commented Feb 4, 2025

csharrison commented Feb 11, 2025

bmcase commented Feb 12, 2025

martinthomson commented Feb 12, 2025

Partial conversion values #16

Partial conversion values #16

Comments

martinthomson commented Sep 23, 2024 • edited Loading

csharrison commented Jan 14, 2025

martinthomson commented Jan 29, 2025 • edited Loading

csharrison commented Feb 4, 2025

martinthomson commented Feb 4, 2025

csharrison commented Feb 4, 2025

csharrison commented Feb 4, 2025

csharrison commented Feb 11, 2025

bmcase commented Feb 12, 2025

martinthomson commented Feb 12, 2025

martinthomson commented Sep 23, 2024 •

edited

Loading

martinthomson commented Jan 29, 2025 •

edited

Loading