Trading dan Investasi
ad1
Iklan Gratis
Sorting datasets based on multiple columns using sort_values
Kisah Nakal
April 08, 2019
You can easily sort your dataset based on single or multiple columns. First you need to load your data into pandas dataframe and then use sort_values function to sort the dataset.
Consider a Load Prediction dataset. We will try to sort this dataset based upon Applicant and Coapplicant income in both ascending and descending order.
Step 1: Import the required libraries
import pandas as pd
import numpy as np
Step 2: Load the dataset
dataset = pd.read_csv("C:/train_loan_prediction.csv")
Step 3: Sort the dataset in ascending and descending order
Lets sort our dataset based on the Applicant Income and Coapplicant Income.
dataset_sorted = dataset.sort_values(['ApplicantIncome','CoapplicantIncome'])
dataset_sorted[['ApplicantIncome','CoapplicantIncome']].head(50)
Above code will sort the dataset based on multiple columns (ApplicantIncome and CoapplicantIncome) in ascending order.
If you want to sort the dataset in descending order, just add ascending=False in sort_values function like this:
dataset_sorted = dataset.sort_values(['ApplicantIncome','CoapplicantIncome'], ascending=False)
dataset_sorted[['ApplicantIncome','CoapplicantIncome']].head(50)
Now it will show the data in descending order. Data of applicants with higher income is displayed first.
Consider a Load Prediction dataset. We will try to sort this dataset based upon Applicant and Coapplicant income in both ascending and descending order.
Step 1: Import the required libraries
import pandas as pd
import numpy as np
Step 2: Load the dataset
dataset = pd.read_csv("C:/train_loan_prediction.csv")
Step 3: Sort the dataset in ascending and descending order
Lets sort our dataset based on the Applicant Income and Coapplicant Income.
dataset_sorted = dataset.sort_values(['ApplicantIncome','CoapplicantIncome'])
dataset_sorted[['ApplicantIncome','CoapplicantIncome']].head(50)
Above code will sort the dataset based on multiple columns (ApplicantIncome and CoapplicantIncome) in ascending order.
If you want to sort the dataset in descending order, just add ascending=False in sort_values function like this:
dataset_sorted = dataset.sort_values(['ApplicantIncome','CoapplicantIncome'], ascending=False)
dataset_sorted[['ApplicantIncome','CoapplicantIncome']].head(50)
Now it will show the data in descending order. Data of applicants with higher income is displayed first.
Tidak ada komentar :
Posting Komentar
Leave A Comment...