Pandas dataframe.replace()



Python | Pandas dataframe.replace()

Python is a great language for doing data analysis, primarily because of the fantastic ecosystem of data-centric python packages. Pandas is one of those packages and makes importing and analyzing data much easier.

Pandas dataframe.replace() function is used to replace a string, regex, listdictionary, series, number etc. from a dataframe. This is a very rich function as it has many variations.
The most powerful thing about this function is that it can work with Python regex (regular expressions).

Syntax: DataFrame.replace(to_replace=None, value=None, inplace=False, limit=None, regex=False, method=’pad’, axis=None)

Parameters:
to_replace : [str, regex, list, dict, Series, numeric, or None] pattern that we are trying to replace in dataframe.
value : Value to use to fill holes (e.g. 0), alternately a dict of values specifying which value to use for each column (columns not in the dict will not be filled). Regular expressions, strings and lists or dicts of such objects are also allowed.
inplace : If True, in place. Note: this will modify any other views on this object (e.g. a column from a DataFrame). Returns the caller if this is True.
limit : Maximum size gap to forward or backward fill
regex : Whether to interpret to_replace and/or value as regular expressions. If this is True then to_replace must be a string. Otherwise, to_replace must be None because this parameter will be interpreted as a regular expression or a list, dict, or array of regular expressions.
method : Method to use when for replacement, when to_replace is a list.

Returns: filled : NDFrame

For link to CSV file Used in Code, click here

Example #1: Replace team “Boston Celtics” with “Omega Warrior” in the nba.csv file

# importing pandas as pd
import pandas as pd
 
# Making data frame from the csv file
df = pd.read_csv("nba.csv")
 
# Printing the first 10 rows of the data frame for visualization
df[:10]

Output:

We are going to replace team “Boston Celtics” with “Omega Warrior” in the ‘df’ data frame

# this will replace "Boston Celtics" with "Omega Warrior"
df.replace(to_replace ="Boston Celtics",
                 value ="Omega Warrior")

Output:

 

Example #2: Replacing more than one value at a time. Using python list as an argument

We are going to replace team “Boston Celtics” and “Texas” with “Omega Warrior” in the ‘df’ dataframe.

# importing pandas as pd
import pandas as pd
 
# Making data frame from the csv file
df = pd.read_csv("nba.csv")
 
# this will replace "Boston Celtics" and "Texas" with "Omega Warrior"
df.replace(to_replace =["Boston Celtics", "Texas"], 
                            value ="Omega Warrior")

Output:

Notice the College column in the first row, “Texas” has been replaced with “Omega Warriors”

Example #3: Replace the Nan value in the data frame with -99999 value.

# importing pandas as pd
import pandas as pd
 
# Making data frame from the csv file
df = pd.read_csv("nba.csv")
 
# will replace  Nan value in dataframe with value -99999 
df.replace(to_replace = np.nan, value =-99999)

Output:

Notice all the Nan value in the data frame has been replaced by -99999. Though for practical purposes we should be careful with what value we are replacing nan value.

 

Last Updated on October 26, 2021 by admin

Leave a Reply

Your email address will not be published. Required fields are marked *

Recommended Blogs