OpenML
Medium-Articles

Medium-Articles

active ARFF CC0: Public Domain Visibility: public Uploaded 24-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context Medium is one of the most famous tools for spreading knowledge about almost any field. It is widely used to published articles on ML, AI, and data science. This dataset is the collection of about 350 articles in such fields. Content The dataset contains articles, their title, number of claps it has received, their links and their reading time. Acknowledgements This dataset was scraped from Medium. I created a Python script to scrap all the required articles using just their tags from Medium. Check out the script here Inspiration How to write a good article? How to inform the reader in an interesting way? What sort of title attracts more crowd? How long an article should be?

5 features

authorstring182 unique values
0 missing
clapsstring163 unique values
0 missing
reading_timenumeric25 unique values
0 missing
link (ignore)string337 unique values
0 missing
titlestring230 unique values
0 missing
textstring230 unique values
0 missing

19 properties

337
Number of instances (rows) of the dataset.
5
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
1
Number of numeric attributes.
0
Number of nominal attributes.
0.01
Number of attributes divided by the number of instances.
20
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
Average class difference between consecutive instances.
0
Percentage of missing values.

0 tasks

Define a new task