Data Frames
The data.frame
is, like we saw in 9 Data Structures, one of the core R data structures used in Statistics, Machine Learning, Data Science. Data frames store tabular data, where rows usually represent cases and columns usually represent variables. Columns can be of different classes/types, e.g. numeric (float), integer, character, factor.
When working with tabular data structures, core data science techniques to learn include:
- Reading in data (12 Base Data I/O)
- Filtering and selecting rows and columns (25 Working with Data frames)
- Reshaping (26 Reshaping)
- Joins (27 Joins)
See also
data.table
, a modern, highly optimized and powerful alternative todata.frame
.