

Data & code used in this Tutorial:
Python Pandas Documentation:
Let me know if you have any questions!
In this video we walk through many of the fundamental concepts to use the Python Pandas Data Science Library. We start off by installing pandas and loading in an example csv. We then look at different ways to read the data. Read a column, rows, specific cell, etc. Also ways to read data based on conditioning. We then move into some more advanced ways to sort & filter data. We look at making conditional changes to our data. We also start doing aggregate stats using the groupby function. We finished the video talking about how you would work with a very large dataset (many gigabytes)
I realized as I upload this video there are some additional things I want to talk about in a later video. The first thing that comes to mind immediately is using the apply() function on a dataframe to alter the data using a custom or lambda function. If you have questions on this or anything else before I get around to making a part 2, feel free to write me a note in the comments.
If you enjoyed this video, be sure to throw it a like and make sure to subscribe to not miss any future videos!
Thanks for watching friends! Happy coding! π
———————————————
Follow me on social media!
Instagram |
Twitter |
β Kite is a free AI-powered coding assistant that will help you code faster and smarter. The Kite plugin integrates with all the top editors and IDEs to give you smart completions and documentation while youβre typing. I’ve been using Kite for 6 months and I love it!
———————————————
Link to original source of data from Kaggle:
———————————————
Video Outline!
0:00 – Why Pandas?
1:46 – Installing Pandas
2:03 – Getting the data used in this video
3:50 – Loading the data into Pandas (CSVs, Excel, TXTs, etc.)
8:49 – Reading Data (Getting Rows, Columns, Cells, Headers, etc.)
13:10 – Iterate through each Row
14:11 – Getting rows based on a specific condition
15:47 – High Level description of your data (min, max, mean, std dev, etc.)
16:24 – Sorting Values (Alphabetically, Numerically)
18:19 – Making Changes to the DataFrame
18:56 – Adding a column
21:22 – Deleting a column
22:14 – Summing Multiple Columns to Create new Column.
24:14 – Rearranging columns
28:06 – Saving our Data (CSV, Excel, TXT, etc.)
31:47 – Filtering Data (based on multiple conditions)
35:40 – Reset Index
37:41 – Regex Filtering (filter based on textual patterns)
43:08 – Conditional Changes
47:57 – Aggregate Statistics using Groupby (Sum, Mean, Counting)
54:53 – Working with large amounts of data (setting chunksize)
————————-
If you are curious to learn how I make my tutorials, check out this video:
*I use affiliate links on the products that I recommend. I may earn a purchase commission or a referral bonus from the usage of these links.
source
Video Outline!
0:45 – Why Pandas?
1:46 – Installing Pandas
2:03 – Getting the data used in this video
3:50 – Loading the data into Pandas (CSVs, Excel, TXTs, etc.)
8:49 – Reading Data (Getting Rows, Columns, Cells, Headers, etc.)
13:10 – Iterate through each Row
14:11 – Getting rows based on a specific condition
15:47 – High Level description of your data (min, max, mean, std dev, etc.)
16:24 – Sorting Values (Alphabetically, Numerically)
18:19 – Making Changes to the DataFrame
18:56 – Adding a column
21:22 – Deleting a column
22:14 – Summing Multiple Columns to Create new Column.
24:14 – Rearranging columns
28:06 – Saving our Data (CSV, Excel, TXT, etc.)
31:47 – Filtering Data (based on multiple conditions)
35:40 – Reset Index
37:41 – Regex Filtering (filter based on textual patterns)
43:08 – Conditional Changes
47:57 – Aggregate Statistics using Groupby (Sum, Mean, Counting)
54:53 – Working with large amounts of data (setting chunksize)
Thanks for watching friends! π
Let me know if you have any questions
2 years after this video was posted, I'm here watching and learning Tons of stuff. Thanks man!!!!
For those have issues reading excel file :
In cmd window :
pip install openpyxl
pip install xldr
df_xlsx=pd.read_excel('pokemon_data.xlsx' , engine="openpyxl")
Amazing!
good stuff π
i hate SMS-es too lol lol
Thank you brother, it really helped. Keep on making.
Thanks for the videos. Very much appreciated
Thanx for such a great video. helped me a lot
THIS CHANNEL IS FUCKIN LIT
This is the best tutorial I have found ever, thank you so much for sharing these skills.
i watched more than 10 different videos about pandas, this is the most easy and understandable one. Worth your time!
This video is just glorious
In 15:26, what if i want to select both 'Grass' and 'Fire', could you show me how can I do that?
57:16
53:37
42:34
38:59
Day 1 on my journey to learn data analysis with python, this vid and kaggle's free pandas course is just what i needed to give me more motivation to keep learning.
What the hell, I imagined this topic in afternoon and video recommended after only few hours. And the shocking fact I didn't even searched about this topic from many days.
Thank you so much.
Only 1% worst thing of this video – god damn Ads!
awesome videos bro …thanks
Why don't this work on an excel file. I almost did the same thing!
Very Useful. Tks alot
Impressive. I'd pay to watch you crush kaggle things.
14:10 use
for index, row in df.iterrows():
print(index, row['Name'])