Strategies for Number Puzzles of all kinds
  Print Version  
Solvers
Puzzles
Str8ts
Other
  The Relative Incidence of Sudoku Strategies

A recent question from a reader prompted me to run off some statistics which I think are interesting and worth exploring.

Comment:There is something I am curious about that I really hope you can answer, although it's quite subjective and I suppose the answer will be a ballpark figure but I was hoping a sudoku expert such as yourself could take your best educated guess at.

If I know all of your basic, tough, and diabolical strategies, but don't go as far as any of your evil strategies that you list, what percentage of sudoku puzzles (in your opinion) do you think I could solve-80% of all puzzles that I would try? 85%? 90%? 95%? 99%?

What would you guess if you had to estimate? I know it's hard since there are literally trillions of puzzles, but easy, medium, tough, and many diabolical puzzles I can already solve with these current strategies, excluding your evil ones. Do you think the percentage of puzzles where you HAVE to use one or more evil strategies in order to solve the puzzle is a small percentage, perhaps 1%? 2%? 5%? 10%?

Just curious what your opinion is.


There is a lot to grading and scoring a Sudoku puzzle. I've put some thoughts about this into
http://www.scanraid.com/Sudoku_Creation_and_Grading.pdf. There is not a one to one correspondence between the published grade (or the grade on my solver) and the list of strategies and many factors contribute to the grade. My strategy list is partially subjective in that I choose to label certain strategies as 'tough' for ease of explanation and to show what I consider the best order in which to attack a puzzle. It is an attempt at a 'minimum path'.

It should also be noted that because I don't use strategy X to solve a puzzle in the solver, it does not follow that strategy X could not be used. There are often many ways to solve the same puzzle.

However it is still an interesting question what proportion of all puzzles require at least one strategy in each grade group. I've run a count on my 2010 stock library which contains 25,972 Sudoku puzzles. These were produced randomly and I did not know the grade until after I created them. The sample is therefore fair. The results are:

  • 70% required only trivial strategies, that is only naked and hidden singles.
  • 7.0% required the use of Naked Pairs and Hidden Pairs.
  • 2.3% required the above and moderate strategies (Triples, Quads, Intersection removal)
  • 5.3% required the above and tough strategies like X-Wings, Simple Colouring, Y wings etc)
  • 10.0% required the above and diabolical strategies
  • 5.6% required the above and extreme strategies
  • 0.4% (91 in total) could not be solved using my list of logical strategies, although 87 did solve using the 'trail and error' Bowman's Bingo.

This confirms my view that the vast majority of puzzles are uninteresting. In order to produce a 100 puzzles of all grades I need to over produce many puzzles since the incidence of higher grade puzzles is low. Note that the 2.3% of 'moderate' only puzzles does not mean they are rare. Any hard puzzle will require many more incidences of moderate strategies to complete in addition to the hard ones.
It follows that I can produce a list of all the Sudoku strategies and a count of their incidence in solving the stock.
STRATEGYCount%
Naked Singles 2345590.3%
Hidden Singles 1346451.8%
Naked Pairs 608323.4%
Hidden Pairs 320912.4%
Naked Triples 1186 4.6%
Hidden Triples 206 0.8%
Naked Quads 30 0.1%
Hidden Quads 0 0.0%
Tough Strategies
Intersection Removal 488918.8%
X-Wing 1189 4.6%
Simple Colouring 2295 8.8%
Y-Wings 2134 8.2%
Sword-Fish 237 0.9%
Diabolical Strategies
Multivalue X-Wing 83 0.3%
Jelly-Fish 8 0.0%
X-Cycles 1760 6.8%
Unique Rectangles 782 3.0%
Hidden Unique Rectangles 1218 4.7%
Avoidable Rectangle 25 0.1%
XYZ Wing 652 2.5%
XY-Chain 270210.4%
Aligned Pair Exclusion 537 2.1%
BUG 0 0.0%
Evil Strategies
Sue-de-Coq 85 0.3%
Multivalue X-Wing 0 0.0%
Grouped X-Cycles 306 1.2%
Forcing Chains 220 0.8%
Empty Rectangles 24 0.1%
Finned X-Wing 179 0.7%
Finned Sword-Fish 129 0.5%
Almost Locked Sets 1215 4.7%
Death Blossom 0 0.0%
Altern. Inference Chains 541 2.1%
AIC with ALSs 10 0.0%
Pattern Overlay Method 79 0.3%
Bowman Bingo 87 0.3%
So if you were wondering, as I was, how useful certain strategies are, this data is interesting. The only other caveat I'd add is that some strategies are sub-sets of others, or can be expressed in terms of another strategy. For example, Remote Pairs are a special case of XY-Chains. It is useful for the solver to split these out but when making and grading I don't do so. Multi-Value X Wing can be expressed as a chain or cycle. So there is some overlap.

The answer to the reader's original question - the incidence of 'evil' strategies, is I'd say, about 5%.

Andrew Stuart

breakline

Add your comments

A confirmation email will be sent to you after submitting.

Your Name

Email Address - required for confirmation (it will not be displayed here)

Your comment or question

Remember me


Please keep your comments relevant to this article. Email addresses are never displayed, but they are required to confirm your comments.

When you enter your name and email address, you'll be sent a link to confirm your comment. If you want to be remembered, the check box above will store a cookie with your name and email address on your computer.

Line breaks and paragraphs are automatically converted — no need to use <p> or <br> tags.



Article created on 31-December-2009. Views: 1590
This page was last modified on 31-December-2009, at 12:12.
All text is copyright and for personal use only but may be reproduced with the permission of the author.
Copyright Andrew Stuart @ Scanraid Ltd, 2009