data-wrangling
Here are 506 public repositories matching this topic...
-
Updated
Jan 6, 2021 - Python
-
Updated
Mar 16, 2021
-
Updated
May 7, 2021 - Jupyter Notebook
-
Updated
May 7, 2021 - TypeScript
-
Updated
Jan 5, 2021 - Jupyter Notebook
-
Updated
Feb 11, 2021 - HTML
-
Updated
May 4, 2021 - C#
-
Updated
Mar 29, 2021 - Tcl
-
Updated
Mar 20, 2019 - Jupyter Notebook
-
Updated
Apr 26, 2021 - Python
I suggest either adding a short code piece to use the rename() function to change the column "genus" to "genera" (thus alerting the learners to their relationship here, while adding a new function) or changing the column name in the original dataset. Otherwise, I've found that using the correct plural for genus confuses learners who are not biologists. Although it's the R ecology lesson and one
-
Updated
Feb 9, 2021 - R
-
Updated
Jun 7, 2020 - Jupyter Notebook
-
Updated
May 6, 2021 - R
-
Updated
May 11, 2018 - JavaScript
This challenge asks student to print an informative message if there are any records in gapminder for the year 2002. Two solutions are provided, one using any(gapminder$year == 2002) (note any() isn't introduced until later in that episode) and one much more complicated one involving counting the number of rows for the year 2002. It seems to me the only reasonable way to do this is with %in%
-
Updated
Apr 29, 2021 - Jupyter Notebook
Dear Community,
There is a typo in the section titled "The StringsAsFactors argument" after the second block of code that demonstrates the use of the str() function. Right after the code boxes is written "We can see that the $Color and $State columns are factors and $Speed is a numeric column", but the box shows that the $Color column is a vector of strings.
Regards,
Rodolfo
Teaching feedback
- I felt like
nuniquewas arbitrarily (re)introduced when it was necessary. It wouldn't be top-of-mind for students solving problems. - The lesson answers need to be adjacent to the exercises.
- I like the pre-introduction of masks and then circling back around to explain them.
- I feel like Part 4 needs to be broken up and integrated across other lessons: it felt thin on its own.
- Horizo
In recent (non-Carpentry) Python courses, we have come across learners that have experience with Python and using JupyterLab or Jupyter Notebooks, however are unaware that you can just run a Python script from the command line. We have observed that this has led to some confusion when they've been working with others who use script files.
I'm not for a second suggesting changing the way the les
-
Updated
Jan 2, 2019 - Jupyter Notebook
-
Updated
Feb 10, 2021 - R
In episode _episodes_rmd/12-time-series-raster.Rmd
There is a big chunk of code that can probably be made to look nicer via dplyr:
# Plot RGB data for Julian day 133
RGB_133 <- stack("data/NEON-DS-Landsat-NDVI/HARV/2011/RGB/133_HARV_landRGB.tif")
RGB_133_df <- raster::as.data.frame(RGB_133, xy = TRUE)
quantiles = c(0.02, 0.98)
r <- quantile(RGB_133_df$X133_HARV_landRGB.1, q
-
Updated
Mar 4, 2020 - HTML
-
Updated
Jun 2, 2020 - R
The discussion of data types and data structures in "Vectors and data types" could be clarified. Perhaps even defining these terms before using them would help. Also note that the first sentence of the section reads "A vector is the most common and basic data type in R, and is pretty much the workhorse of R." perhaps this should be changed to "basic data structure"
The Survey table has a field called quant that holds what type of reading was taken. The values in this column are rad, sal, and temp. There is no legend that explains what these mean on the page where the data is introduced (the selecting data chapter). Much later in the course it's mentioned that these mean 'radiation', 'salinity' and 'temperature', but I think it would also be helpful
-
Updated
May 7, 2021 - Jupyter Notebook
-
Updated
May 3, 2021 - R
Improve this page
Add a description, image, and links to the data-wrangling topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-wrangling topic, visit your repo's landing page and select "manage topics."
To Reproduce
Configure a reconciliation service that returns URIs as IDs.
Hovering over reconciliation suggestions, automatically matched values and manually mapped values will fail to retrieve the preview information as ID contained in the callback is truncated.