Probability notes from set theory to more advanced
Set theory
- Universal set (sample space) : all possible outcomes.
- Event: any subset .
- Empty event : never occurs.
Certain event : always occurs. - Set operations:
- Union: — “ or happens.”
- Intersection: — “ and happen.”
- Complement: — “ does not happen.”
- Difference: .
- Disjoint (mutually exclusive): .
These operations satisfy Boolean algebra rules:
Sigma-algebra
collection of measurable events To define probabilities consistently, we use a σ-algebra over :
- .
- If , then .
- If , then .
This ensures we can take complements and countable unions safely.
Basic probability spaces
- Sample space: (all outcomes). An event .
- Probability measure satisfies , , countable additivity.
- Example: fair coin , .
Conditional probability & product rule
- Conditional probability of given (with ):
- Product rule (equivalent):
Independence
- and are independent iff equivalently (when ).
- For random variables : independence means events and independent for all measurable .
Law of total probability
- If partition (mutually disjoint, ) then for any event :
- Useful to expand by conditioning on cases.
Expectation and indicator variables
- Indicator of event : if , else .
- Expectation of indicator: .
- Linearity: (no independence required).
- For a discrete r.v. : .
Useful identities
- .
- .
- If is a Bernoulli() bit independent of other randomness: (special case of total law).
Short worked example from our paper (the paper’s setup)
Arthur chooses uniformly. If he samples ; if he samples . He sends to Merlin, who outputs where . Arthur accepts iff .
Compute :
- Condition on (law of total probability):
- Evaluate each term:
- If : Merlin is correct iff , so .
- If : Merlin is correct iff , so .
- Since ,
- Using (Scheffé set), if then
Absolutely. This is a great way to solidify your understanding. The proof in Claim 4.4 is a classic example of an averaging argument, which is fundamental in complexity theory and cryptography.
Let’s walk through it.
1. Summary of the Soundness Proof (Claim 4.4)
The high-level goal of the soundness proof is to show that if the distributions and are close (i.e., ), then no prover , no matter how cleverly designed, can succeed with a probability much better than random guessing.
The proof proceeds in two logical steps:
- First, we prove this bound for all deterministic provers.
- Second, we show that this implies the bound for all randomized provers.
1.1 Bounding any Deterministic Prover ()
A deterministic prover is just a fixed function . As we discussed, this is mathematically equivalent to the prover picking a fixed subset and using the strategy: “Guess 1 if , and guess 0 otherwise.” We call this strategy .
The proof follows this path:
-
Write the Success Probability: For any such , we calculated its exact success probability: [cite_start] [cite: 219]
-
[cite_start]Apply the Soundness Promise: We are promised that the distributions are close[cite: 216]. This promise is .
-
Use the Definition of TV Distance: The total variation distance is defined as the maximum possible “advantage” over all possible sets:
This means for the specific set that our prover chose, its advantage must be less than or equal to the maximum possible advantage.
Therefore, .
-
Chain the Inequalities: We can now chain steps 2 and 3 together to get our key bound:
-
Conclude for Deterministic Provers: We substitute this bound back into our success probability equation from step 1:
This proves that no deterministic strategy can do better than .
1.2 Extending to any Randomized Prover ()
[cite_start]This is the “convexity” or “averaging” part of the argument[cite: 219].
-
Define a Randomized Prover: A randomized prover is simply an algorithm that randomly chooses which deterministic strategy to run. Formally, it’s a probability distribution over all possible sets .
-
Write the Success Probability: The overall success of is the average (or expected) success of all the deterministic strategies it might pick, weighted by the probability of picking them:
-
Apply the Deterministic Bound: We just proved in part 1.1 that every single term inside that expectation is bounded: for all .
-
Conclude for Randomized Provers: If you take an average of many values, and every one of those values is less than or equal to some number , the average itself must also be less than or equal to .
Therefore, .
This completes the proof. It shows that no strategy, deterministic or randomized, can break the soundness bound.
2. Glossary of Our Formal Concepts
Here are the “other bits” we defined, which form the conceptual toolkit for understanding the proof.
-
Honest Prover () vs. Adversarial Prover ():
- [cite_start] (no star) is the honest prover who follows the protocol as written (e.g., in Algorithm 1, uses the Scheffé set for its strategy)[cite: 210, 211].
- (with star) is an adversarial or arbitrary prover. This represents any possible algorithm (even a malicious, cheating one) that the prover might use. The soundness proof must hold “for all .”
-
Deterministic Strategy ():
- A strategy that involves no randomness. Given the same input sample , it will always produce the same guess .
- It is just a fixed function .
-
Randomized Strategy ():
- A strategy that uses internal randomness. It can be seen as a probability distribution over all possible deterministic strategies.
- It first “flips its coins” (picks a random seed ) to select a deterministic strategy , and then executes .
-
Set-Based Strategy ():
- This is just our more descriptive notation for .
- It makes explicit that every deterministic strategy is uniquely defined by a single subset , where .
- Saying “Merlin chooses ” is identical to saying “Merlin chooses .”