You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello! I'm currently going through the R Programming>Looking at Data lesson. I'm in R 4.3.1/RStudio 2023.06.1+524 on macOS 13.5.
After being instructed to try summary(plants), I get the following output:
Scientific_NameDurationActive_Growth_PeriodFoliage_ColorpH_MinLength:5166Length:5166Length:5166Length:5166Min.:3.000Class:characterClass:characterClass:characterClass:character1stQu.:4.500Mode:characterMode:characterMode:characterMode:characterMedian:5.000Mean:4.9973rdQu.:5.500Max.:7.000NA's :4327 pH_Max Precip_Min Precip_Max Shade_Tolerance Temp_Min_F Min. : 5.100 Min. : 4.00 Min. : 16.00 Length:5166 Min. :-79.00 1st Qu.: 7.000 1st Qu.:16.75 1st Qu.: 55.00 Class :character 1st Qu.:-38.00 Median : 7.300 Median :28.00 Median : 60.00 Mode :character Median :-33.00 Mean : 7.344 Mean :25.57 Mean : 58.73 Mean :-22.53 3rd Qu.: 7.800 3rd Qu.:32.00 3rd Qu.: 60.00 3rd Qu.:-18.00 Max. :10.000 Max. :60.00 Max. :200.00 Max. : 52.00 NA's:4327NA's :4338 NA's:4338NA's :4328
However, the output is described by the lesson as follows:
Duration (also a factor variable) tells us that our dataset contains 3031 Perennial plants, 682 Annual plants, etc.
This does not match the output, which shows Duration as a character, not factor, variable. This also occurs with Active_Growth_Period, which is described as:
| You can see that R truncated the summary for Active_Growth_Period by including a catch-all
| category called 'Other'. Since it is a categorical/factor variable, we can see how many times
| each value actually occurs in the data with table(plants$Active_Growth_Period).
Perhaps something changed in the dataset or default output of summary, but this is confusing and I'm not sure how to get output that matches the description. Many thanks for maintaining this valuable project!
The text was updated successfully, but these errors were encountered:
In case this is useful for any other beginners finding this issue, I worked around it by coercing the appropriate columns from character vectors into factors:
Hello! I'm currently going through the R Programming>Looking at Data lesson. I'm in R 4.3.1/RStudio 2023.06.1+524 on macOS 13.5.
After being instructed to try
summary(plants)
, I get the following output:However, the output is described by the lesson as follows:
This does not match the output, which shows Duration as a character, not factor, variable. This also occurs with Active_Growth_Period, which is described as:
Perhaps something changed in the dataset or default output of summary, but this is confusing and I'm not sure how to get output that matches the description. Many thanks for maintaining this valuable project!
The text was updated successfully, but these errors were encountered: