zhaopinboai.com

Essential Skills to Secure Your First Data Science Role in 2024

Written on

The Landscape

Securing an entry-level position in Data Science in 2024 is comparable to threading a needle. Out of 4,462 full-time Data Science job listings on LinkedIn in the last month, only 581 were categorized as “Entry-Level,” which is merely 13%.

Delving into these “Entry-Level” job postings, I randomly examined ten and found that six of them required two years of prior experience.

More than half of these so-called entry-level roles expect previous experience!

The competition is tough, but you possess the capability to stand out.

This guide will assist you in honing your focus, acquiring knowledge, and excelling in the field.

Before we explore each skill, I want to share an exciting resource.

datanerd.tech analyzes job postings to reveal the top skills in various data professions.

For instance, here are the top five skills sought for Data Scientist positions in the US:

This platform also presents trending skills, salary data by role, and more:

Pretty cool, right? Give it a try!

Skill 1: SQL

In 2024, SQL remains the essential skill for anyone serious about a career in data.

Disregard the skeptics; SQL is your key to understanding and manipulating data.

As long as relational databases are around, SQL will reign supreme!

Example question

How many times did individuals aged 20–30 in Berlin use Tinder yesterday?

Key concepts

  • Grouping and aggregate functions.
  • Joins (left, inner, outer, etc.).
  • Appending data sources (union).
  • Filtering data with complex conditions; understanding the difference between where and having.
  • Window functions (lag and lead).
  • De-duplicating, sorting, and managing missing data.

How to learn

  • sqlzoo: Hands-on practice questions.
  • sqlcourse: Breaks down concepts with examples and exercises.
  • w3schools: Covers more topics but with simpler challenges.
  • Mode: Visual code illustrations with examples and practice problems.

Practice consistently. Engage daily for at least two weeks leading up to your interviews.

Your journey in the data realm begins with SQL.

Skill 2: Python or R

Acquiring proficiency in another programming language will enhance your data science capabilities.

Python and R both pair well with SQL. While companies appreciate both, if you’re just starting, opt for Python. Its wide array of libraries, compatibility with software programming, and deep learning applications make it vital. Once you master one, learning the other becomes straightforward.

For instance, Meta’s GeoLift package was initially developed in R. As a Python user, I adapted the R code for my project in a few hours.

Example question

How many Uber rides took place in Los Angeles last week? What was the average revenue per ride?

Key concepts (Python)

Libraries - Data manipulation and analysis: Pandas, Numpy.

Dedicate 80% of your Python study to these areas. Master SQL concepts within Python, like groupby and join.

You’ll want to engrain this code into your memory:

import pandas as pd
  • Visualization: Matplotlib, Seaborn.
  • Machine Learning: scikit-learn.
  • Statistics: statsmodels.

Variable types - list, tuple, dictionary, set, int, float, boolean, etc.

Features - Conditional statements: if, elif, else. - Loops. - Functions: def. - List manipulations: append, remove.

How to learn

  • 10 minutes to pandas: Pandas official user guide.
  • codecademy Learn Python 3: Covers important features with quizzes and projects.
  • data analysis with python: Covers key libraries for data analysis.
  • learnpython.org: Practice questions.

Skill 3: Product Sense

Cultivating product sense involves evaluating products like an expert. This skill is especially vital for aspiring data scientists at B2C giants like Facebook and Spotify.

How can you develop product sense?

Think like a Product Manager.

What do you appreciate about certain applications but dislike in others? What makes your favorite product so compelling? How do you assess a product's success or failure?

Start by asking these questions, then merge intuition with technical skills and enhance with analytics.

Example question

How would you assess the health of Facebook Groups?

Key concepts

Aligning Goals Understand how company objectives align with product aims. For instance, Meta’s mission — “Giving people the power to build community and bring the world closer together” — informs Instagram Stories, promoting frequent content creation and app usage.

Growth funnel: AARRR

Picture my Medium:

  1. Acquisition: You discovered this article via Google.
  2. Activation: You found it valuable and subscribed.
  3. Retention: You continue to read my articles.
  4. Revenue: You support me by buying me a coffee or hiring me for coaching or consulting.
  5. Referral: You share it with friends.

Funnel Analysis Where do users drop off? What hurdles cause churn? For example, a lengthy sign-up process might deter users. What tactics could bring users back? Perhaps Thursday evening email notifications double the click rate compared to Monday mornings.

Evaluation Metrics 1. North star: The main indicator of product health. For Facebook, it’s Monthly Active Users (MAU). 2. Secondary: Supporting indicators like user retention rates and new user growth. 3. Guardrails: Business metrics to watch for negative impacts, such as the percentage of fake accounts on Facebook.

Troubleshooting Suppose Airbnb searches decline by 35% today. How would you identify the problem?

How to learn

  • Cracking the PM interview: Learn to approach problems as a PM.
  • Exponent PM mock interviews: Observe PMs from leading tech firms tackle product challenges in real-time — a priceless learning tool.
  • Sequoia’s Product blog: Stay informed with insights from industry leaders.

Skill 4: A/B Testing

A/B tests, or controlled experiments, are essential for making informed product launch decisions. By comparing different versions of a webpage, product design, or layout, you can assess performance against a predetermined metric.

Example question

Your PM wishes to launch YouTube Shorts. How would you assess the success of this feature?

Key concepts

  • Understanding causation vs correlation.
  • Evaluation metrics.
  • Statistical concepts for A/B testing: T test, P value, Z score, Significance level, Confidence interval, Minimum detectable effect, Sample size estimation.
  • Recognizing effects:
    1. Network effect (e.g., Bumble, Uber).
    2. Novelty effect when users are more responsive to newly launched features.

[Bonus] When A/B testing is not suitable 1. Small user base. 2. Brand-new app status. 3. Resource constraints.

Steps for Running an A/B Test

  1. Clarify the objective: Gauge user reactions to YouTube Shorts.
  2. Set metrics
    • Goal metric: YouTube daily active users.
    • Secondary metrics: Percentage of users engaging only with Shorts, overall time spent on the platform.
    • Guardrail: Time spent on traditional videos (where ad revenue is generated).
  3. Form hypothesis: YouTube Shorts will boost daily active users.
  4. Create experiment groups
    • Control: Users do not see YouTube Shorts (status quo).
    • Experiment: Users can see and interact with Shorts.
  5. Calculate sample size
    • Choose significance level (alpha).
    • Determine the minimum detectable effect.
  6. Consider potential complications
    • Should all creators have access to Shorts or only a select few?
    • Conduct a randomized user test or a location-based test to minimize network effects.
  7. Decide on experiment length
    • It should run for at least a week to accommodate daily patterns.
    • It may need to extend longer with a smaller user group.
  8. Perform significance test on goal metric & evaluate other metrics.
  9. Make a launch decision.

How to learn

Udacity’s A/B Testing course by Google is my top recommendation. It covers everything mentioned here. Watch it twice: grasp the basics on the first viewing and delve deeper on the second. If you're short on time, read a course summary, but I highly recommend completing the full course and all exercises.

Skill 5: Statistics

Example question

Can you illustrate the distribution of Tinder matches on a given day?

Key concepts

  • Common distributions: Comprehend binomial and normal distributions.
  • Central limit theorem: Understand how sample means form a normal distribution, regardless of the population distribution.
  • Regression to the mean: Recognize that extreme values tend to revert towards the average.
  • Probability and Bayesian probability: Differentiate between classical and Bayesian probability approaches.
  • Variance, standard deviation: Measure data variability.
  • Mean, median, percentiles: Calculate central tendencies and data positioning.

How to learn

  • Probability & statistics from brilliant.org: Offers clear illustrations and practical problems to solidify your understanding.

Skill 6: Machine Learning

Example question

How would you design a news feed ranking algorithm?

Key concepts

  • Types of ML models:
    1. Regression: linear regression.
    2. Classification: logistic regression, decision trees.
    3. Clustering.
  • Supervised vs. unsupervised learning.
  • Evaluation metrics: accuracy, precision, recall, F score, AUC.
  • Cross-validation.
  • Bias-variance tradeoff: Find the right model complexity to avoid overfitting or underfitting.

How to learn

  • Machine Learning with Python: This introductory course covers common models and their applications.
  • geeksforgeeks: Covers a broader range of topics and advanced concepts.
  • ML cheatsheet: A quick reference for each model and key statistical concepts.
  • An Introduction to Statistical Learning: A textbook used in Columbia University’s Stats master’s program. If you have over six months to job hunt or aim for a career in Machine Learning, I highly recommend this book for its systematic and technical approach.

Skill 7: Interviewing and Communication

Excelling in interviews and mastering communication is crucial for securing a job.

To emphasize this vital skill, I’ll provide a more detailed discussion in a separate blog. Stay tuned for the next post!

[Bonus]: What NOT to Focus On

Specific visualization tools like Power BI or Tableau

Don’t worry about mastering tools like Power BI or Tableau unless your desired role specifically requires it. Tech companies often prefer Looker or Databricks for dashboarding, which you can quickly learn on the job. Tableau and Power BI are generally more relevant for data analysts.

Building Deep Learning models

Avoid diving deep into constructing complex models. While it’s tempting to engage in developing advanced technology, entry-level data science roles require a broad understanding of machine learning rather than deep specialization. Positions that require extensive deep learning skills typically have titles like “Machine Learning Engineer” or “Applied Scientist.”

The Most Important Thing...

Select 1-2 resources for each skill and practice diligently.

Before my interviews, I practiced every day for a month.

Conduct mock interviews. Have friends act as interviewers. Before the actual interviews, I went through nine mock interviews — six focused on product and three on coding.

Thinking you can rehearse silently in your mind? Think again.

Speak out loud. Let your mock interviewers challenge you before the real interviews.

Ask your friends for their time. Hire a coach if necessary. There are no shortcuts when your dream job and a $100K salary are on the line.

You are remarkable for making it this far. Best of luck!

This article was originally published on my Substack. Socials: LinkedIn | Twitter/X

Share the page:

Twitter Facebook Reddit LinkIn

-----------------------

Recent Post:

Title: Embracing Honest Ignorance Over Deceptive Expertise

Exploring the value of honest ignorance over deceptive expertise in our learning journeys.

Integrating Nivo Sankey Diagrams into Your React Application

Discover how to implement Nivo Sankey diagrams into your React application with detailed steps and code snippets.

Understanding the Critical Importance of Sleep for Brain Health

Explore the vital role of sleep in brain health and the dangers of sleep deprivation, including insights from recent scientific findings.

# Advancements in Green Hydrogen Production Using Seawater

Researchers have pioneered a cost-effective method for producing green hydrogen from seawater, avoiding the need for desalination and enhancing sustainability.

Understanding Why Stressful Experiences Stick in Our Memories

Explore how stress influences memory retention and the neuroscience behind it.

Understanding Dreams and Fear: A Journey to Self-Discovery

Explore the interplay between dreams and fears, and how they shape our lives and decisions.

Embracing the 2024 Tennis Grass Season: A New Perspective

Discover how my views on the grass season have transformed in 2024 and why I'm now excited about this unique time in tennis.

Navigating Rising Expenses: Practical Solutions and Insights

Explore effective strategies to manage unexpected expenses and financial challenges.