Member-only story

100 Quick R Commands for Every Data Science Tasks

btd
3 min readNov 23, 2023

--

Here are 100 one-liners in R grouped by various concepts in data science:

Data Manipulation and Cleaning

  1. Subset a dataframe: subset(df, column == value)
  2. Remove missing values: na.omit(df)
  3. Rename a column: names(df)[names(df) == "old_name"] <- "new_name"
  4. Convert factor to numeric: as.numeric(as.character(factor_column))
  5. Create a new variable: df$new_var <- df$var1 + df$var2
  6. Filter rows based on condition: df[df$column > 10,]
  7. Remove duplicates: unique(df)
  8. Convert character to date: as.Date(character_column, format="%Y-%m-%d")
  9. Pivot data from wide to long: melt(df, id.vars=c("id"), measure.vars=c("var1", "var2"))
  10. Aggregate data: aggregate(value ~ group, data=df, FUN=mean)

Data Visualization

  1. Plot a histogram: hist(df$column)
  2. Scatter plot: plot(df$var1, df$var2)
  3. Boxplot: boxplot(df$column)
  4. Line chart: plot(df$time, df$value, type="l")
  5. Bar chart: barplot(df$counts, names.arg=df$categories)

--

--

btd
btd

No responses yet