Data
Indian-News-Articles

Indian-News-Articles

active ARFF Database: Open Database, Contents: Original Authors Visibility: public Uploaded 24-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context It's always interesting to analyse what's going on in the news but there's no good data avaialable in indian context on kaggle, so I created one. Content Apart from the main content of the articles, each row contains author, link, publish date etc. Acknowledgements Grateful to the creator of python, beautifulsoup library and firstpost.com. Inspiration Data is not always available to us, sometimes we need to create one, this is my attempt to do that.

8 features

sourcestring2 unique values
0 missing
categorystring4 unique values
0 missing
linkstring1563 unique values
0 missing
authorstring129 unique values
6 missing
published_atstring1518 unique values
6 missing
headerstring1557 unique values
0 missing
subheaderstring1538 unique values
12 missing
contentstring1543 unique values
9 missing

19 properties

1568
Number of instances (rows) of the dataset.
8
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
33
Number of missing values in the dataset.
15
Number of instances with at least one value missing.
0
Number of numeric attributes.
0
Number of nominal attributes.
0.01
Number of attributes divided by the number of instances.
0
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0.96
Percentage of instances having missing values.
Average class difference between consecutive instances.
0.26
Percentage of missing values.

0 tasks

Define a new task