Chapter 2 eq_location_clean

2.1 Introduction

This function creates a new column with the earthquake LOCATION. The function eq_clean_data uses it behind the scenes, so it is not necessary to call this function after call eq_clean_data.

2.2 Example

Piping a raw data to creates a LOCATION column.

# Path to the raw data.
raw_data_path <- system.file("extdata", "signif.txt", package = "msdr")

# Loading the dataset of Earthquake.
df <- readr::read_delim(file = raw_data_path,      
                        delim = '\t',              
                        col_names = TRUE,          
                        progress = FALSE,           
                        col_types = readr::cols())

# Printing some columns.
df %>%
       eq_location_clean() %>%
              # Selecting some features.
              select(YEAR,
                     COUNTRY,
                     LOCATION,
                     EQ_PRIMARY,
                     TOTAL_DEATHS) %>% 
                     # Filtering.
                     filter(YEAR > 1990 &
                            YEAR < 2019) %>%
                            # Show the first 10 rows.
                            head(10) %>%
                                   # Enhance table visualization.
                                   kable()

YEAR	COUNTRY	LOCATION	EQ_PRIMARY	TOTAL_DEATHS
1991	MYANMAR (BURMA)	Thabeikkyin, Mandalay	7.1	NA
1991	AFGHANISTAN	Badakhstan, Baghlan, Laghman, Nagarhar	6.4	848
1991	SOLOMON ISLANDS	Solomon Islands	6.9	NA
1991	FRANCE	France	3.8	9
1991	RUSSIA	Kuril Islands	5.7	NA
1991	BERING SEA	Bering Sea	6.7	NA
1991	CHINA	Kalpin	6.1	NA
1991	CHINA	Ne, Datong	5.5	NA
1991	PERU	Rioja, Neuva Cajamarca	6.4	NA
1991	PERU	Rioja, Moyobamba, Nueva Cajamarca	6.7	53

As you can see, the LOCATION column has only cities in Title Case mode.