Data
Linkedin_Job_Postings

Linkedin_Job_Postings

active ARFF Attribution-ShareAlike (CC BY-SA) Visibility: public Uploaded 06-05-2024 by Iwo Godzwon
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Description: The "postings.csv" dataset comprises various job postings across different companies and locations. It includes detailed information on job titles, job descriptions, salaries, and application details. With columns specifying job IDs, company names, job titles, job descriptions, max salary, pay period, location, company IDs, views, median salary, minimum salary, formatted work type, number of applications, posting and listing times, remote work allowance, job posting URLs, application URLs, application types, expiry dates, closed times, experience levels, required skills, and work types. The dataset provides a rich source of information for analyzing job market trends, company hiring practices, and job seeker behaviors. Attribute Description: - job_id: Unique identifier for each job posting. - company_name: Name of the company offering the job. - title: Job title. - description: Detailed job description. - max_salary, med_salary, min_salary: Salary information (maximum, median, minimum). - pay_period: Basis of salary compensation (e.g., hourly, yearly). - location: Geographic location of the job. - company_id: Unique identifier for each company. - views: Number of views each posting has received. - formatted_work_type: Employment type (e.g., full-time, part-time). - applies: Number of applications submitted. - original_listed_time, closed_time: Timestamps for when job postings were listed and closed. - remote_allowed: Indicates if remote work is permitted. - job_posting_url, application_url: URLs for the job posting and application submission. - application_type: Method of application submission. - expiry: Expiry date of the job posting. - formatted_experience_level: Required experience level. - skills_desc: Description of required skills. - posting_domain: Domain of the job posting. - sponsored: Indicates if posting is sponsored. - work_type: Nature of the work (e.g., full-time, part-time). - currency, compensation_type: Currency and type of compensation. Use Case: This dataset is invaluable for researchers and analysts focusing on labor market trends, HR professionals seeking comparative analysis for salary benchmarking or recruiting strategies, and job seekers looking to understand the dynamics of the job market. It can also be used to develop machine learning models to predict industry trends, salary ranges, and successful recruitment campaigns based on textual data from job descriptions and titles.

28 features

job_idstring123849 unique values
0 missing
company_namestring24428 unique values
1719 missing
titlestring72521 unique values
0 missing
descriptionstring107827 unique values
7 missing
max_salarynumeric5321 unique values
94056 missing
pay_periodnominal5 unique values
87776 missing
locationstring8526 unique values
0 missing
company_idnumeric24474 unique values
1717 missing
viewsnumeric684 unique values
1689 missing
med_salarynumeric1417 unique values
117569 missing
min_salarynumeric4612 unique values
94056 missing
formatted_work_typenominal7 unique values
0 missing
appliesnumeric274 unique values
100529 missing
original_listed_timestring65036 unique values
0 missing
remote_allowedstring1 unique values
108603 missing
job_posting_urlstring123849 unique values
0 missing
application_urlstring84800 unique values
36665 missing
application_typenominal4 unique values
0 missing
expirynumeric54851 unique values
0 missing
closed_timenominal698 unique values
122776 missing
formatted_experience_levelnominal6 unique values
29409 missing
skills_descstring2212 unique values
121410 missing
listed_timestring53231 unique values
0 missing
posting_domainnominal4443 unique values
39968 missing
sponsoredstring1 unique values
0 missing
work_typenominal7 unique values
0 missing
currencynominal6 unique values
87776 missing
compensation_typenominal1 unique values
87776 missing

19 properties

123849
Number of instances (rows) of the dataset.
28
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
1133501
Number of missing values in the dataset.
123849
Number of instances with at least one value missing.
7
Number of numeric attributes.
9
Number of nominal attributes.
Average class difference between consecutive instances.
32.69
Percentage of missing values.
0
Number of attributes divided by the number of instances.
25
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
32.14
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
1
Number of binary attributes.
3.57
Percentage of binary attributes.
100
Percentage of instances having missing values.

0 tasks

Define a new task