SitePoint
  • Premium
  • Library
  • Community
  • Jobs
  • Blog
LoginStart Free Trial
Data Science: Tools & Skills
Data Science: Tools & Skills
Notice of Rights
Notice of Liability
Trademark Notice
About SitePoint

1

How to Transition from Software Development to Data Science

What Is Data Science?
The Role of a Data Scientist
Switching to Data Science
Vectorization
Python for General-purpose Programming
Jupyter for Notebooks
Tableau for Business Intelligence
Pandas for Data Manipulation and Analysis
TensorFlow for Machine Learning
Matplotlib for Visualizations
Google Cloud for Infrastructure Provisioning and Cloud Computing
Apache Spark and Apache Hadoop for Large-scale Data Processing
RapidMiner for Enterprise Data Science Platform
Microsoft Excel for a Back to Basics
Wrap Up
Prerequisites
Step 1: Convert into Tokens
Step 2: Convert Words to their Base Forms
Step 3: Data Cleaning
Word Frequency Distribution
Conclusion
Why Use Ensemble Methods?
When Should You Avoid Ensemble Methods?
How Do We Build Different Models?
How Do We Combine Models?
Ensemble Learning in Practice
Bagging and Random Forests
Boosting and AdaBoost
Conclusion

How to Dramatically Speed Up Pandas

Unlock This Title

You do not have access to this section. Get the full version of this title with a SitePoint Premium Account.

Start Free TrialRead Free Preview

Already a Premium member? Sign In

Community Questions