OpenML
Coursera-Course-Dataset

Coursera-Course-Dataset

active ARFF GPL 2 Visibility: public Uploaded 23-03-2022 by Onur Yildirim
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context This is a dataset i generated during a hackathon for project purpose. Here i have scrapped data from Coursera official web site. Our project aims to help any new learner get the right course to learn by just answering a few questions. It is an intelligent course recommendation system. Hence we had to scrap data from few educational websites. This is data scrapped from Coursera website. For the project visit: https://github.com/Siddharth1698/Coursu . Please do show your support by following us. I have just started to learn on data science and hope this dataset will be helpful to someone for his/her personal purposes. The scrapping code is here : https://github.com/Siddharth1698/Coursera-Course-Dataset Article about the dataset generation : https://medium.com/analytics-vidhya/web-scraping-and-coursera-8db6af45d83f Content This dataset contains mainly 6 columns and 890 course data. The detailed description: course_title : Contains the course title. course_organization : It tells which organization is conducting the courses. courseCertificatetype : It has details about what are the different certifications available in courses. course_rating : It has the ratings associated with each course. course_difficulty : It tells about how difficult or what is the level of the course. coursestudentsenrolled : It has the number of students that are enrolled in the course. Inspiration This is just one of my first scraped dataset. Follow my GitHub for more: https://github.com/Siddharth1698

7 features

Unnamed:_0numeric891 unique values
0 missing
course_titlestring884 unique values
0 missing
course_organizationstring154 unique values
0 missing
course_Certificate_typestring3 unique values
0 missing
course_ratingnumeric14 unique values
0 missing
course_difficultystring4 unique values
0 missing
course_students_enrolledstring205 unique values
0 missing

19 properties

891
Number of instances (rows) of the dataset.
7
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
2
Number of numeric attributes.
0
Number of nominal attributes.
0.01
Number of attributes divided by the number of instances.
28.57
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
Average class difference between consecutive instances.
0
Percentage of missing values.

0 tasks

Define a new task