tidytuesday | Official repo for the # tidytuesday project | Data Visualization library

by rfordatascience HTML Version: tidytuesday_tweets-dda1055 License: CC0-1.0

X-Ray Key Features Code Snippets Community Discussions(10)Vulnerabilities Install Support

kandi X-RAY | tidytuesday Summary

tidytuesday is a HTML library typically used in Analytics, Data Visualization applications. tidytuesday has no bugs, it has no vulnerabilities, it has a Permissive License and it has medium support. You can download it from GitHub.

A weekly data project aimed at the R ecosystem. As this project was borne out of the R4DS Online Learning Community and the R for Data Science textbook, an emphasis was placed on understanding how to summarize and arrange data to make meaningful charts with ggplot2, tidyr, dplyr, and other tools in the tidyverse ecosystem. However, any code-based methodology is welcome - just please remember to share the code used to generate the results.

Support

Quality

Security

License

Reuse

Support

tidytuesday has a medium active ecosystem.

It has 5635 star(s) with 2272 fork(s). There are 541 watchers for this library.

It had no major release in the last 12 months.

There are 187 open issues and 195 have been closed. On average issues are closed in 181 days. There are no pull requests.

It has a neutral sentiment in the developer community.

The latest version of tidytuesday is tidytuesday_tweets-dda1055

Quality

tidytuesday has no bugs reported.

Security

tidytuesday has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

License

tidytuesday is licensed under the CC0-1.0 License. This license is Permissive.

Permissive licenses have the least restrictions, and you can use them in most projects.

Reuse

tidytuesday releases are available to install and integrate.

Installation instructions are not available. Examples and code snippets are available.

Top functions reviewed by kandi - BETA

kandi's functional review helps you automatically verify the functionalities of the libraries and avoid rework.
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of tidytuesday

Get all kandi verified functions for this library.

tidytuesday Key Features

No Key Features are available at this moment for tidytuesday.

tidytuesday Examples and Code Snippets

No Code Snippets are available at this moment for tidytuesday.

Community Discussions

Trending Discussions on tidytuesday

Tidymodels / XGBoost error in last_fit with rsplit value

Tidymodels - Help evaluating regression models made via recipes

R Shiny app loads, but radio buttons do not select values properly

scale_x_date limits problem or filtering problem?

How to increase height of Plot to occupy full height space in flexdashboard shiny in r?

Using a newly assigned column in a `groupby` statement? (method chaining with Pandas)

How do I remove labels based on certain conditions in ggplot2?

How to color geom_line() & geom_point() properly based on condition if less than 0 or greater than 0 in r?

How do I repeat a symbol in the legend of a ggplot to gain a better view of the symbol?

How to sort/order values within a factor in ggplot r?

QUESTION

Tidymodels / XGBoost error in last_fit with rsplit value

Asked 2021-Jun-15 at 04:08

I am trying to follow this tutorial here - https://juliasilge.com/blog/xgboost-tune-volleyball/

I am using it on the most recent Tidy Tuesday dataset about great lakes fishing - trying to predict agency based on many other values.

ALL of the code below works except the final row where I get the following error:

...

ANSWER

Answered 2021-Jun-15 at 04:08

If we look at the documentation of last_fit() We see that split must be

An rsplit object created from `rsample::initial_split().

You accidentally passed the cross-validation folds object stock_folds into split but you should have passed rsplit object stock_split instead

Source https://stackoverflow.com/questions/67978723

QUESTION

Tidymodels - Help evaluating regression models made via recipes

Asked 2021-May-24 at 23:31

I am working with the current tidytuesday data about salaries and trying to create a model with tidymodels and recipes. I want to predict salary with many of the other factors present using the recipes code, but I run into an issue.

Issue 1 - My recipe says there are empty rows, but I do not know how to figure out how. This does not give an error, so maybe it is not a problem.

Issue 2 - Understanding what my models actually did and how to visualize the performance. I want to plot the models performance on the initial data. Here is an example of my goal: https://indescribled.files.wordpress.com/2021/05/image-17.png?w=782

I do not understand exactly how to use the predict function with my recipe. juice(rec) is less than 1000 rows while the testing data is about 6000. Perhaps I am reading it backwards, but can someone try to point me in the right direction?

The code below should be an exact reproduction of mine.

...

ANSWER

Answered 2021-May-24 at 23:31

Looks like you have things pretty well along!

Source https://stackoverflow.com/questions/67665563

QUESTION

R Shiny app loads, but radio buttons do not select values properly

Asked 2021-May-06 at 07:47

This is my first time using stack overflow so apologies if I do this wrong.

I'm fairly new to coding in R and I'm trying to make a simple Shiny app using a TidyTuesday dataset. I wanted to make a map with points showing the different types of water systems ("water_tech") and radio buttons to choose which type of water system is plotted on the map. I got the app to load without an error message, however no matter which button is selected, all of the different types of water systems are plotted on the map, not just the one I selected (essentially, the buttons don't work). If anyone has any ideas about what could be causing this to happen I would greatly appreciate it!

Reproducible code:

...

ANSWER

Answered 2021-May-06 at 07:47

rwater() has no effect in this code:

Source https://stackoverflow.com/questions/67412341

QUESTION

scale_x_date limits problem or filtering problem?

Asked 2021-May-04 at 15:16

I'm trying to build a plot with geom_line based on the data from 1990 to 2020, and I'd like my x-axis breaks to be every 5 years. I've filtered the original data to the period 1990-2020, set the limits of scale_x_date and found other answers here on using expand = c(0,0), but there are still some extra years in the beginning that are messing with the breaks so the five years periods are not 1990-1995-2000 etc, but 1993-1998-2003 etc. The limits themselves work properly though, if you set any other dates. What might be the problem here? Thanks!!

and here's my code:

...

ANSWER

Answered 2021-May-04 at 15:16

expand= is an argument to scale_x_date, but you have it within limits=as.Date(.), and it is being ignored:

Source https://stackoverflow.com/questions/67387354

QUESTION

How to increase height of Plot to occupy full height space in flexdashboard shiny in r?

Asked 2021-Apr-02 at 18:33

I am using tidytuesday UN votes dataset and trying to adjust a facet plot by nrows to occupy the full height in flexdashboard of the plot but it's hardly utilizing half of the space and making the plot less visible.

Alternative is I can make 5 different plots but this will run code for 5 times when it can be done in once with facet.

I have also tried facet_grid, par(mfrow = c(1,1)) but both didn't help.

Code: ...

ANSWER

Answered 2021-Apr-02 at 18:14

Use renderPlot to wrap your plot so it will be responsive:

Source https://stackoverflow.com/questions/66919296

QUESTION

Using a newly assigned column in a `groupby` statement? (method chaining with Pandas)

Asked 2021-Mar-22 at 00:36

I'm an R (dplyr) user who's learning how to clean data using pandas. I am practicing using the wind turbines dataset, and I would like to be able to return a data frame with the count of manufacturers per year in British Colombia since the year 2000.

The chunk below returns an error NameError: name 'year' is not defined. Is there a way to pipe a newly generated column, year in this case, into a groupby statement within one chain?

...

ANSWER

Answered 2021-Mar-21 at 23:13

You almost got it, you only have to change the groupby parameters:

Source https://stackoverflow.com/questions/66738389

QUESTION

How do I remove labels based on certain conditions in ggplot2?

Asked 2021-Feb-25 at 17:17

I am doing the #duboischallenge and am on challenge 5.

I would like to remove the labels "0%" on the plot.

code (still learning so might not be most efficient code!):

...

ANSWER

Answered 2021-Feb-25 at 17:17

You can simply take the same approach like with colouring the labels:

Source https://stackoverflow.com/questions/66372501

QUESTION

How to color geom_line() & geom_point() properly based on condition if less than 0 or greater than 0 in r?

Asked 2021-Feb-25 at 15:44

From tidytuesday datasets I am trying to create a line plot which color in blue if greater than 0 and red if less than 0.

...

ANSWER

Answered 2021-Feb-25 at 15:44

As for giving lines different colours based on whether they are above/below some point, you'd need to interpolate the lines at the crossover points to assign different colours, as line segments themselves cannot have multiple colours. Here is a self-plagiarised solution for interpolating such lines.

First, we'll write two functions. One for finding crossovers and shaping data, and the other one for interpolating at crossover sites.

Source https://stackoverflow.com/questions/66370513

QUESTION

How do I repeat a symbol in the legend of a ggplot to gain a better view of the symbol?

Asked 2021-Feb-17 at 07:11

I have just started the du Bois challenge as part of #tidytuesday, and am on challenge 1: Comparative Increase of White and Colored Population in Georgia

The original graph has the "WHITE" symbol with 4 dashes enter image description here, but when I replicate the plot, the legend only has 1 and a bit of the second dash.

How do I repeat the symbol in the legend to get 4 dashes? I don't need to increase the size, just the repetition

...

ANSWER

Answered 2021-Feb-16 at 20:15

Try setting legend.key.width in theme:

Source https://stackoverflow.com/questions/66231244

QUESTION

How to sort/order values within a factor in ggplot r?

Asked 2021-Jan-15 at 11:35

I am using a dataset from tidytuesday and trying to order values within each factor .

For example in below plot I want the values of Years to be in ascending order for each city (Years to start from 2012 to 2021).

How can I order them ? Is there any fct_() to do order within ?

df summary:

...

ANSWER

Answered 2021-Jan-15 at 11:35

Try this:

Source https://stackoverflow.com/questions/65734845

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install tidytuesday

You can download it from GitHub.

Support

Submit the dataset as an Issue a. Find an interesting dataset b. Find a report, blog post, article etc relevant to the data c. Submit the dataset as an Issue along with a link to the article.

Find more information at: