OpenML

JavaScript is required to properly view the contents of this page!

Explore
- Data
- Task
- Flow
- Run
- Study
- Task type
- Measure
- People
Help
Blog
Contact
Please cite us

Accidents-on-the-Rio-Niteri-bridge

active ARFF CC0: Public Domain Visibility: public Uploaded 23-03-2022 by Onur Yildirim
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes

Issue	#Downvotes for this reason	By

Loading wiki

Help us complete this description Edit

Context About the Rio Niteri Bridge, according to Wikipedia: Presidente Costa e Silva Bridge, popularly known as Rio Niteri Bridge, is a bridge that crosses Guanabara Bay, in the state of Rio de Janeiro, in Brazil. It connects the municipalities of Rio de Janeiro and Niteri. Currently, the bridge is the largest in the southern hemisphere in prestressed concrete and the largest in Latin America. The structure receives more than 150 thousand passengers a day, according to information from the Ecoponte concessionaire, on days with normal flows. [1] It is also known as the largest straight span in the world and the largest set of prestressed structures in America. [2] Content The data to carry out this research were obtained on the website of the Federal Highway Police, which provides an Open Data session which, according to the website: has no restrictions on licenses, patents or control mechanisms, so that they are freely available for use and redistributed at will . In the Accidents section it is possible to find accident records in csv format over the years. Acknowledgements It is great that this data is public and can present valuable insights to improve people's quality of life Inspiration Some guiding questions about this data: Is the number of accidents on the bridge daily? Have the number of accidents on the bridge decreased? Has the installation of security cameras reduced the number of accidents? How to import from source See below the code used to import and organize the open data of the site: Load tidyverse: library(tidyverse) Import available data: dataset - glue::glue("datatran2007:2020.csv") map(.x data.table::fread(sep = ";",dec=",", encoding = "Latin-1") as_tibble() ) Perform data pre-processing in parallel using the foreach package: library(foreach) cl - parallel::makeCluster(4) doParallel::registerDoParallel(cl) dataset - foreach(x = dataset, info = 2007:2020, .packages = c("dplyr", "stringr", "lubridate")) dopar x mutate(km = as.character(km), br = as.character(br)) mutate_if(is.character, .x enc2native() stringi::stri_trans_general(id = "Latin-ASCII") tolower() str_replace_all("/","-") str_replace_all(",",".") ) mutate(data_inversa = if_else(is.na(dmy(data_inversa)), ymd(data_inversa), dmy(data_inversa)) , km = as.numeric(km), br = as.numeric(br), id = as.character(id), dia_semana = str_remove_all(dia_semana,"-feira") ) filter(br==101 km=321 km334) slice(str_which(municipio,"(niteroirio de janeiro)")) parallel::stopCluster(cl) Save the tidy dataset: dataset map2_df(2007:2020, mutate(.x,ano = .y)) write_csv('accidents-rio-niteroi-bridge.csv', na = "")

31 features

id	numeric	9366 unique values 0 missing
data_inversa	string	3402 unique values 0 missing
dia_semana	string	7 unique values 0 missing
horario	string	971 unique values 0 missing
uf	string	1 unique values 0 missing
br	numeric	1 unique values 0 missing
km	numeric	101 unique values 0 missing
municipio	string	2 unique values 0 missing
causa_acidente	string	25 unique values 0 missing
tipo_acidente	string	22 unique values 0 missing
classificacao_acidente	string	4 unique values 0 missing
fase_dia	string	4 unique values 0 missing
sentido_via	string	2 unique values 0 missing
condicao_metereologica	string	10 unique values 0 missing
tipo_pista	string	3 unique values 0 missing
tracado_via	string	8 unique values 0 missing
uso_solo	string	4 unique values 0 missing
ano	numeric	14 unique values 0 missing
pessoas	numeric	19 unique values 0 missing
mortos	numeric	4 unique values 0 missing
feridos_leves	numeric	16 unique values 0 missing
feridos_graves	numeric	6 unique values 0 missing
ilesos	numeric	12 unique values 0 missing
ignorados	numeric	4 unique values 0 missing
feridos	numeric	15 unique values 0 missing
veiculos	numeric	9 unique values 0 missing
latitude	numeric	213 unique values 8876 missing
longitude	numeric	223 unique values 8876 missing
regional	string	1 unique values 8876 missing
delegacia	string	2 unique values 8876 missing
uop	string	2 unique values 8876 missing