SitePoint
Blog
Forum
Library
Login
Join Premium
Toggle sidebar
Data Science: Tools & Skills
Toggle community discussions
Close
Content
Bookmarks
Preface
Data Science: Tools & Skills
Notice of Rights
Notice of Liability
Trademark Notice
About SitePoint
1
How to Transition from Software Development to Data Science
What Is Data Science?
The Role of a Data Scientist
Switching to Data Science
Pandas for Excel Users
Big Data in Python: An Introduction to PySpark
How to Dramatically Speed Up Pandas
Vectorization
Top 10 Tools for a Data Scientist
Python for General-purpose Programming
Jupyter for Notebooks
Tableau for Business Intelligence
Pandas for Data Manipulation and Analysis
TensorFlow for Machine Learning
Matplotlib for Visualizations
Google Cloud for Infrastructure Provisioning and Cloud Computing
Apache Spark and Apache Hadoop for Large-scale Data Processing
RapidMiner for Enterprise Data Science Platform
Microsoft Excel for a Back to Basics
Wrap Up
Getting Started with Natural Language Processing in Python
Prerequisites
Step 1: Convert into Tokens
Step 2: Convert Words to their Base Forms
Step 3: Data Cleaning
Word Frequency Distribution
Conclusion
Ensemble Learning: Theory
Why Use Ensemble Methods?
When Should You Avoid Ensemble Methods?
How Do We Build Different Models?
How Do We Combine Models?
Ensemble Learning in Practice
Bagging and Random Forests
Boosting and AdaBoost
Conclusion
Open text modal
Community Questions
Close