R Basic

How to Create a Data Frame from Vectors in R

In R, you can think of a vector as a series of values in a single column. It contains the same data type. Each column of a data frame is a vector. So, if I have several vectors, I can combine them and create a data frame out of them.

To create a data frame from vectors in R, use the data.frame() function and pass the vectors. Ensure you have the same length of all the vectors; otherwise, R throws an error.

After the data frame has been created, each column retains its data type.

Basic Data Frame Creation

The above figure shows five vectors. Each vector has its own type and the same length, which is five. After converting vectors to a data frame, each vector becomes a column of the data frame.

title <- c("Inception", "The Dark Knight", "Interstellar", "The Matrix", "Gladiator")
genre <- c("Sci-Fi", "Action", "Sci-Fi", "Sci-Fi", "Drama")
year <- c(2010, 2008, 2014, 1999, 2000)
rating <- c(8.8, 9.0, 8.6, 8.7, 8.5)
runtime <- c(148, 152, 169, 136, 155)


# Create a data frame from vectors
df <- data.frame(title, genre, year, rating, runtime)

print(df)

Output

Displaying the internal structure of the data frame

If you want to implement the internal structure of the data frame, you can use the “str()” function and pass the data frame to it.

title <- c("Inception", "The Dark Knight", "Interstellar", "The Matrix", "Gladiator")
genre <- c("Sci-Fi", "Action", "Sci-Fi", "Sci-Fi", "Drama")
year <- c(2010, 2008, 2014, 1999, 2000)
rating <- c(8.8, 9.0, 8.6, 8.7, 8.5)
runtime <- c(148, 152, 169, 136, 155)


# Create a data frame from vectors
df <- data.frame(title, genre, year, rating, runtime)

# Printing data frame structure
str(df)

Output

The above screenshot shows that title, genre have character columns, and year, rating, runtime have numeric columns.

Adding a new column to an existing data frame

To add a column to an existing data frame, you can use the “$” operator and assign values to the new column.

Let’s add the “director” column to our data frame.

title <- c("Inception", "The Dark Knight", "Interstellar", "The Matrix", "Gladiator")
genre <- c("Sci-Fi", "Action", "Sci-Fi", "Sci-Fi", "Drama")
year <- c(2010, 2008, 2014, 1999, 2000)
rating <- c(8.8, 9.0, 8.6, 8.7, 8.5)
runtime <- c(148, 152, 169, 136, 155)


# Create a data frame from vectors
df <- data.frame(title, genre, year, rating, runtime)

# Adding a new column to an existing data Frame
df$director <- c("C Nolan", "C Nolan", "C Nolan", "L Wachowski", "R Scott")
print(df)

Output

Adding a new row to an existing data frame

You can add a new row to an existing data frame using the rbind() function. It accepts the first argument as an existing data frame and a vector that will become a new row in the data frame.

title <- c("Inception", "The Dark Knight", "Interstellar", "The Matrix", "Gladiator")
genre <- c("Sci-Fi", "Action", "Sci-Fi", "Sci-Fi", "Drama")
year <- c(2010, 2008, 2014, 1999, 2000)
rating <- c(8.8, 9.0, 8.6, 8.7, 8.5)
runtime <- c(148, 152, 169, 136, 155)

# Create a data frame from vectors
df <- data.frame(title, genre, year, rating, runtime)

# Adding a row to an existing data frame
df <- rbind(df, c("Kaun", "Horror", 1999, 7.8, 134))
print(df)

Output

That’s it!

Recent Posts

cbind() Function: Binding R Objects by Columns

R cbind (column bind) is a function that combines specified vectors, matrices, or data frames…

1 week ago

rbind() Function: Binding Rows in R

The rbind() function combines R objects, such as vectors, matrices, or data frames, by rows.…

1 week ago

as.numeric(): Converting to Numeric Values in R

The as.numeric() function in R converts valid non-numeric data into numeric data. What do I…

2 weeks ago

Calculating Natural Log using log() Function in R

The log() function calculates the natural logarithm (base e) of a numeric vector. By default,…

3 weeks ago

Dollar Sign ($ Operator) in R

In R, you can use the dollar sign ($ operator)  to access elements (columns) of…

1 month ago

Calculating Absolute Value using abs() Function in R

The abs() function calculates the absolute value of a numeric input, returning a non-negative (only…

1 month ago