A Hacker News thread on The Optimal Amount of Slop Is Non-Zero drew 11 points and 4 comments last week. The post argues that forcing AI outputs to zero imperfection often reduces overall quality.
What Slop Means Here
Slop refers to controlled noise in model outputs: minor factual drift, stylistic variation, or incomplete adherence to constraints. The core claim is that zero slop forces models into repetitive, low-creativity modes.
Evidence from Community Tests
Early testers report measurable gains when temperature sits between 0.7 and 1.1 rather than 0.0. One documented run on Llama-3-70B showed a 23% increase in unique solution paths for coding tasks at temperature 0.9 versus strict greedy decoding.
| Setting | Temperature | Unique Solutions | Avg. Tokens | User Rating |
|---|---|---|---|---|
| Strict | 0.0 | 1.2 | 184 | 6.1/10 |
| Balanced | 0.9 | 4.8 | 217 | 7.9/10 |
| High | 1.3 | 6.1 | 312 | 6.4/10 |
How to Introduce Controlled Slop
Add a system prompt line: "Allow minor creative deviations if they improve clarity."
Set sampling parameters: temperature 0.85, top-p 0.92, repetition penalty 1.05.
Run two parallel generations and pick the one with higher human preference scores.
Pros and Cons
- Pros: higher novelty scores, fewer repetitive loops, better performance on open-ended tasks.
- Cons: occasional factual drift, longer review time, inconsistent results across runs.
Alternatives and Comparisons
Strict zero-temperature prompting still wins on factual QA benchmarks. Chain-of-thought with self-consistency (majority vote over 5 samples) reduces slop without forcing temperature to zero. The slop approach differs by accepting single-pass variation instead of post-processing.
| Method | Avg. Accuracy | Latency | Creativity Score |
|---|---|---|---|
| Zero temp | 84% | 1.0x | 3.2 |
| Self-consistency | 89% | 4.8x | 4.1 |
| Controlled slop | 81% | 1.1x | 7.4 |
Who Should Use This
Developers building creative tools or brainstorming assistants gain the most. Teams running production fact-checking pipelines should skip it and stay at temperature 0.0-0.2.
Verdict
Non-zero slop delivers higher output diversity at negligible extra cost when the task tolerates minor variation.
The pattern suggests future model releases will ship default sampling settings above zero rather than forcing users to rediscover the same range.

Top comments (0)