Data
Quality-Prediction-in-a-Mining-Process

Quality-Prediction-in-a-Mining-Process

active ARFF CC0: Public Domain Visibility: public Uploaded 23-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context The dataset comes from one of the most important parts of a mining process: a flotation plant The main goal is to use this data to predict how much impurity is in the ore concentrate. As this impurity is measured every hour, if we can predict how much silica (impurity) is in the ore concentrate, we can help the engineers, giving them early information to take actions (empowering). Hence, they will be able to take corrective actions in advance (reduce impurity, if it is the case) and also help the environment (reducing the amount of ore that goes to tailings as you reduce silica in the ore concentrate). Content The first column shows time and date range (from march of 2017 until september of 2017). Some columns were sampled every 20 second. Others were sampled on a hourly base. The second and third columns are quality measures of the iron ore pulp right before it is fed into the flotation plant. Column 4 until column 8 are the most important variables that impact in the ore quality in the end of the process. From column 9 until column 22, we can see process data (level and air flow inside the flotation columns, which also impact in ore quality. The last two columns are the final iron ore pulp quality measurement from the lab. Target is to predict the last column, which is the % of silica in the iron ore concentrate. Inspiration I have been working in this dataset for at least six months and would like to see if the community can help to answer the following questions: Is it possible to predict % Silica Concentrate every minute How many steps (hours) ahead can we predict % Silica in Concentrate This would help engineers to act in predictive and optimized way, mitigatin the % of iron that could have gone to tailings. Is it possible to predict % Silica in Concentrate whitout using % Iron Concentrate column (as they are highly correlated)

24 features

datestring4097 unique values
0 missing
%_Iron_Feedstring278 unique values
0 missing
%_Silica_Feedstring293 unique values
0 missing
Starch_Flowstring409317 unique values
0 missing
Amina_Flowstring319416 unique values
0 missing
Ore_Pulp_Flowstring180189 unique values
0 missing
Ore_Pulp_pHstring131143 unique values
0 missing
Ore_Pulp_Densitystring105805 unique values
0 missing
Flotation_Column_01_Air_Flowstring43675 unique values
0 missing
Flotation_Column_02_Air_Flowstring80442 unique values
0 missing
Flotation_Column_03_Air_Flowstring40630 unique values
0 missing
Flotation_Column_04_Air_Flowstring196006 unique values
0 missing
Flotation_Column_05_Air_Flowstring194711 unique values
0 missing
Flotation_Column_06_Air_Flowstring90548 unique values
0 missing
Flotation_Column_07_Air_Flowstring86819 unique values
0 missing
Flotation_Column_01_Levelstring299573 unique values
0 missing
Flotation_Column_02_Levelstring331189 unique values
0 missing
Flotation_Column_03_Levelstring322315 unique values
0 missing
Flotation_Column_04_Levelstring309264 unique values
0 missing
Flotation_Column_05_Levelstring276051 unique values
0 missing
Flotation_Column_06_Levelstring301502 unique values
0 missing
Flotation_Column_07_Levelstring295667 unique values
0 missing
%_Iron_Concentratestring38696 unique values
0 missing
%_Silica_Concentratestring55569 unique values
0 missing

19 properties

737453
Number of instances (rows) of the dataset.
24
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
0
Number of numeric attributes.
0
Number of nominal attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
0
Percentage of missing values.
Average class difference between consecutive instances.
0
Percentage of numeric attributes.
0
Number of attributes divided by the number of instances.
0
Percentage of nominal attributes.
Percentage of instances belonging to the most frequent class.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.

0 tasks

Define a new task