Data
3-million-Sudoku-puzzles-with-ratings

3-million-Sudoku-puzzles-with-ratings

active ARFF CC0: Public Domain Visibility: public Uploaded 23-03-2022 by Onur Yildirim
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Overview This dataset contains 3 million Sudoku puzzles and their solutions. The level of difficulty varies -- some can be solved easily by a beginner, while others will challenge experienced solvers. Most puzzles have between 23 and 26 clues. The minimum number of clues in the dataset is 19, and the maximum is 31. It has been shown that 17 is the minimum number of clues for a valid, uniquely solvable Sudoku puzzle. However, these puzzles are difficult to find, so they are not included in our dataset. Each row of the dataset includes the number of clues and an estimated difficulty rating. The difficulty rating is computed by an automated solver and it is based on the average search tree depth over 10 attempts. 43 of the puzzles have a difficulty of zero, meaning that it can be solved using a simple scanning technique. The highest difficulty rating is 8.5. The puzzles were generated using Blagovest Dachev's Sudoku generator and solver, at https://github.com/dachev/sudoku.

4 features

difficulty (target)numeric76 unique values
0 missing
id (ignore)numeric3000000 unique values
0 missing
puzzlestring3000000 unique values
0 missing
solutionstring3000000 unique values
0 missing
cluesnumeric13 unique values
0 missing

19 properties

3000000
Number of instances (rows) of the dataset.
4
Number of attributes (columns) of the dataset.
0
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
2
Number of numeric attributes.
0
Number of nominal attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
0
Percentage of missing values.
-0.39
Average class difference between consecutive instances.
50
Percentage of numeric attributes.
0
Number of attributes divided by the number of instances.
0
Percentage of nominal attributes.
Percentage of instances belonging to the most frequent class.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.

0 tasks

Define a new task