RateOrchard

Career guide

How to become a data scientist in 2026

What the role pays now that the 2020-21 hype has cooled, and the four credible paths into it.

Adrian Serafin, founder and editor of RateOrchardBy Adrian SerafinFounderUpdated April 28, 2026

What you will learn

Whether data science still pays at the level the 2020 cohort thinks it does, where in the country it pays best after cost of living, and which of the four entry paths fits if you are starting from a math, engineering, analyst, or career-change baseline.

National median wage (2024)
$117,330
10-year job growth (BLS, 2024-34)
+35.6%
Annual openings (BLS)
~17,300/yr
Time to first job (bootcamp + portfolio)
9-15 months
See data scientist salary by state

What the job actually involves

A data scientist develops analytical models, derives insights from large datasets, and communicates findings to people who will act on them. The O*NET task list for 15-2051 starts with these activities: apply mathematical or statistical analysis methods, identify business needs that can be addressed through data, design and develop predictive models, and present findings to non-technical audiences. The role sits between research and product, and the balance varies more by company than by title.

The week breaks down differently than most outside the role expect. Roughly forty percent of the time goes to data wrangling: pulling data, cleaning it, joining it, and building the pipeline that keeps it fresh. Twenty-five percent goes to modeling. Twenty percent goes to communicating findings, writing memos, and presenting in stakeholder reviews. Fifteen percent goes to experimentation and AB testing. Junior data scientists spend more time wrangling. Senior data scientists spend more time on the framing of which questions to answer and on the politics of getting the answer adopted.

Where data scientists work matters as much as the title. The BLS QCEW data shows the role concentrated in tech (about thirty-eight percent of employment), finance and insurance (about eighteen percent), healthcare (twelve percent), retail (eight percent), and consulting (six percent). The day-to-day looks completely different in a tech company doing recommendation systems, a bank doing credit risk, a pharma company doing clinical-trial analysis, or a retailer doing pricing optimization, even though the BLS title is the same.

  • Data wrangling, cleaning, pipelines (~40%)
  • Modeling, training, evaluation (~25%)
  • Communicating findings, memos, presentations (~20%)
  • Experimentation, AB testing, causal inference (~15%)

How much data scientists earn

The BLS Occupational Employment and Wage Statistics release for May 2024 shows a national median annual wage of $117,330 for data scientists. The full distribution runs from $66,910 at the 10th percentile to $179,960 at the 90th. Total compensation at large tech companies (FAANG and the next tier of well-funded private companies) frequently exceeds the BLS 90th percentile because BLS captures wage and not equity grants. This is one of the few occupations where public BLS numbers materially understate top-of-market reality.

State differences are large and concentrated. California reports the highest median, around $148,300 in 2024, but California also carries a Regional Price Parity of 114.8 (BEA, 2023). A California data scientist at the median earns roughly $129,180 in national-baseline dollars after cost-of-living adjustment. Washington, New York, Massachusetts, and Connecticut publish similar nominal premiums, partially eaten by housing. The interesting middle is Texas, North Carolina, and Colorado, where pay has caught up faster than housing, so cost-adjusted compensation looks better than the headline.

Two practical comparisons matter more than a leaderboard of states. First, the gap between a senior data scientist at a public tech company and a mid-level data scientist at the same company is often larger than the gap between two metros at the same career stage. Second, the bonus and equity component of compensation is the single largest source of variance. A six-figure base in a non-tech industry does not necessarily beat a five-figure-lower base in tech once equity vests. The cleanest way to compare offers is total comp at the four-year mark, not first-year base.

  • Top 5 paying states (2024 BLS): California, Washington, New York, Massachusetts, Connecticut
  • Bottom 5 (cost-of-living adjusted): Mississippi, Arkansas, West Virginia, Alabama, Kentucky
  • Median by experience (rough): entry $67k, mid-career $117k, senior $180k+
  • FAANG-tier total comp (base + bonus + equity) often exceeds BLS 90th percentile

Four credible paths into the role

Most working data scientists in the US arrived through one of four routes. Each has a different cost, timeline, and filter on who succeeds.

The traditional path is a master's or PhD in statistics, mathematics, computer science, economics, or a quantitative field. Hiring at research-leaning data scientist roles still favors the master's-or-higher signal, particularly at the FAANG and quant-finance tier. PhD programs are five to seven years of below-market stipend, but they fully fund tuition and produce candidates who clear the technical bar at any employer. A master's program runs one to two years and costs $30,000 to $80,000 at most reputable programs.

The bootcamp path takes nine to fifteen months end to end. Reputable data science bootcamps (Springboard, Metis legacy, Galvanize, BrainStation) run twelve to twenty-four weeks of full-time instruction at $10,000 to $20,000. Add three to nine months of job search after graduation. Bootcamps work well for career-changers with a strong quantitative background already, who want a structured way to build a portfolio and a peer cohort. They work badly for people without exposure to linear algebra and probability.

The MOOC plus portfolio path is the cheapest and takes the longest. Andrew Ng's Coursera Machine Learning Specialization, the IBM Data Science Professional Certificate, fast.ai, and DataCamp can cover the same fundamentals as a master's program over twelve to twenty-four months. The trade-off is no credential and no peer network. Most self-taught data scientists we know took eighteen to thirty months to land a first paid role, after publishing two to four real projects on GitHub or Kaggle.

The pivot path is what most working data scientists actually did. People move into the role from software engineering, data analyst, business intelligence, actuarial, quantitative research, or scientific research backgrounds. The pivot typically takes six to eighteen months of focused upskilling on machine learning theory and practical modeling, on top of an existing technical job. Internal moves at a current employer are the most common form of this pivot.

What skills the role rewards

O*NET publishes importance and level scores for each skill in each occupation. For data scientists (15-2051), the top scores point at a different mix than what most "data science roadmap" guides suggest.

Mathematics sits at importance 4.50 out of 5 with a level score of 6.25 out of 7. Complex problem solving scores 4.62. Critical thinking and active learning both score 4.50. Programming sits at 4.25. Reading comprehension scores 4.38. Writing scores higher than most candidates expect, at 4.25. The pattern says that the harder you can frame a fuzzy business question as a tractable statistical problem, communicate the framing, and write up the answer in a way that survives a stakeholder review, the further you go.

Knowledge areas reinforce the same point. Mathematics scores 4.88. Computers and Electronics scores 4.62. English Language scores 4.12. The English score surprises people. It should not. The cost of a data scientist who builds a beautiful model that no one understands is paid by the team and, eventually, by the analyst who has to explain why a number in last month's deck no longer holds.

  • Mathematics (importance 4.50, level 6.25)
  • Complex problem solving (4.62)
  • Critical thinking (4.50)
  • Active learning (4.50)
  • Programming (4.25)
  • Writing (4.25)

Where the role is going

BLS Employment Projections for the 2024 to 2034 cycle show data scientist employment growing by 35.6 percent. That is the "much faster than average" category and one of the top ten growth rates across all 800-plus tracked occupations. Mean annual openings are projected at 17,300 per year, with most of the growth coming from net additions rather than replacement.

The headline number deserves a footnote. The 2020 to 2021 hiring boom pulled forward demand and produced a wave of hires that did not all survive the 2022 to 2023 round of tech layoffs. The role split internally during that window: companies kept the data scientists who shipped products and let go many of the data scientists who only published memos. The current hiring market favors candidates who can demonstrate end-to-end ownership, including the engineering side of model deployment.

For someone making a career decision today, the practical takeaway is: data science is one of the highest-growth occupations in the BLS catalog, but the job spec has tightened. The valuable skills are different from what they were in 2018. Modeling alone is not the moat. The moat is the ability to take a fuzzy business question, frame it as a statistical problem, build a model that solves it, deploy that model into something a product or operation actually uses, and communicate the result. The closer the role is to that full stack, the more resilient it is to whatever the next labor reshuffle brings.

  • Adjacent roles: ML Engineer (15-1252.04 emerging), Data Engineer (15-1242), Quantitative Analyst (13-2099)
  • Common pivots later: research scientist, product analytics lead, applied scientist, founding ML engineer

Geography and remote work

Five metros account for an outsized share of high-paying data science roles: San Francisco Bay Area, New York, Boston, Seattle, and Austin. The pay premium in those metros is real, but the cost of living is too. The interesting middle includes Charlotte (banking analytics), Denver (consulting and biotech), Atlanta (Coca-Cola, Home Depot, Delta data orgs), and Raleigh-Durham (research triangle).

Remote work changed the calculation but did not collapse it. Roughly seventy-five percent of senior individual-contributor data science postings on the major job boards advertise remote-eligible. Junior remote-only roles remain rare; most organizations want junior data scientists in office at least part of the week for mentorship and stakeholder relationships. If you are early in your career, optimizing for a city with a strong scene and good mentor pool matters more than chasing a remote role. After three to five years, remote becomes a normal option.

For people considering relocation, the math we publish on each /salary/[role]/[state] page shows the cost-of-living adjusted figure next to the headline. Seattle and Boston, for example, look very different once cost-adjusted, and the comparison is more useful than "best states for data scientists" listicles built on nominal wages alone.

What it costs

The total cost-and-time picture varies enormously by path.

A two-year master's in data science, statistics, or a related quantitative field costs roughly $30,000 to $80,000 in tuition at most reputable programs, plus two years of opportunity cost. Top programs cost more. Many candidates take loans and pay them back inside three to five years on a senior data scientist salary. The master's gets you into research-leaning roles cleanly and bypasses the credential filter at the small set of employers that screen on degree.

A PhD is fully funded at most US universities through stipend and tuition waiver, but it takes five to seven years of below-market income. The opportunity cost is large. PhDs land cleanly into research scientist, applied scientist, and senior data science roles at top tech and quantitative firms. The path suits people who want to do research and tolerate the variance of the academic job market.

A reputable bootcamp runs $10,000 to $20,000 plus three to nine months of unpaid job search. Total all-in: $15,000 to $30,000 plus six to twelve months not earning. Bootcamps suit career-switchers with a previous quantitative background, runway, and a high tolerance for an intense schedule. Without prior calculus, linear algebra, and basic probability, the modeling sections are punishing.

The MOOC plus portfolio path costs almost nothing in tuition. Coursera Plus annual subscription, DataCamp annual subscription, and a few books run $400 to $1,500. The cost is time and discipline. We have seen people land paid junior data science roles in twelve months. We have also seen people give up after thirty. The single variable that predicts success is whether you can finish two to four real projects (data acquisition, exploratory analysis, modeling, write-up, deployment, public repo) without anyone telling you to.

How to start this week

If you are still deciding which path fits, do three small things this week.

First, work through the first three weeks of Andrew Ng's Machine Learning Specialization on Coursera. The course is free to audit and the early problem sets will tell you within a few hours whether you enjoy thinking in math and code. People who finish week three with energy left have a real chance. People who hate the math should at least know that early.

Second, pick a current data scientist in your network and ask for thirty minutes of their time. Ask what they actually do, not what they would tell a recruiter they do. The day-to-day reality of the role filters out a lot of people who liked the idea of it more than the substance, particularly the wrangling and stakeholder communication portions.

Third, scan our /salary/data-scientists/[your-state] page for the realistic salary range in your state. Compare entry-level to median in your metro, then cross-check against the cost of living. The math behind two years of master's, twelve weeks of bootcamp, or eighteen months of self-study works only if the post-tax pay in your region clears the cost.

If those three steps give you a green light, the actual decision is mostly logistical: pick a path, pick a starting curriculum, set a six-month milestone of two completed portfolio projects, and start. The first ten weeks teach you whether you can hold the schedule. After that, momentum carries.

Frequently asked questions

Do I need a PhD to be a data scientist?
No, and most working data scientists do not have one. PhDs cluster in research scientist roles at top tech labs (Meta AI, Google DeepMind, OpenAI, Anthropic) and at quantitative finance firms. Outside that bracket, master's-prepared candidates and capable bachelor's-with-portfolio candidates fill most data scientist roles. A PhD opens doors at the very top tier of research-leaning roles. It is not a prerequisite for the role broadly.
Python or R for data science in 2026?
Python first, by a clear margin. Python dominates production data science, machine learning, and the engineering side of model deployment. R remains stronger in statistical research, biostatistics, and some social-science analytics. Most working data scientists need Python plus SQL and pick up R only if a specific employer or research domain calls for it.
Is data science still hot in 2026?
BLS projects 35.6 percent growth from 2024 to 2034. The 2022-2023 layoff cycle made the role more selective at the entry level than it was during the 2020-2021 hiring boom, but the long-run trend remains one of the strongest among all tracked US occupations. The job spec tightened. Candidates who can deploy models and own their work end-to-end fare materially better than candidates who only build notebooks.
Bootcamp vs master's degree?
Master's opens more doors at research-leaning employers, signals more strongly on a resume, and costs more time and money. Bootcamp gets you to a portfolio and a peer cohort faster, suits career-changers with a quantitative background, and costs less. The honest answer is: if you can afford the time and the tuition and you want long-term optionality, master's is safer. If you need to start earning sooner and you already know calculus, linear algebra, and probability, bootcamp can work.
What is the difference between data scientist and ML engineer?
Data scientist roles weight the modeling, experimentation, and analysis side. ML engineer roles weight the production, infrastructure, and deployment side. The line between them is not crisp and varies a lot by company. At smaller companies one person often does both. At large tech companies the split is sharper, with ML engineers owning the serving infrastructure and data scientists owning the model development. Pay tends to be similar at the same level.
How long does it take to become a data scientist?
Master's route: roughly two to three years total including admissions and program time. Bootcamp route: 9 to 15 months end to end. Self-taught route with portfolio: 18 to 30 months. Pivot from a related role (analyst, software engineer, scientist): 6 to 18 months. The variance inside each route is wider than the gap between them. The single largest predictor is whether you finish portfolio projects without external structure.
Can I become a data scientist without a math background?
Possible but harder. The role is built on linear algebra, probability, and basic calculus, and the modeling work assumes comfort with those. People without that background can pivot in, but the realistic timeline adds six to twelve months of foundational coursework on top of any program or bootcamp timeline. Free options like Khan Academy and 3Blue1Brown's Essence of Linear Algebra video series cover the prerequisites well at zero cost.
What does a junior data scientist earn?
BLS reports the 10th percentile annual wage for data scientists at $66,910. That figure includes the bottom decile, which skews lower. A typical first job at a non-FAANG company in a non-coastal metro pays $80,000 to $110,000 base. FAANG-tier and well-funded private companies pay $130,000 to $180,000 base for new grads with strong interview performance, plus equity that often doubles total compensation by year four. Variance by metro and company type is high.

Resources

Methodology

This guide was drafted with AI assistance using Anthropic Claude and then reviewed and edited by Adrian Serafin against BLS Occupational Employment Statistics, BLS Employment Projections, O*NET Online, and BEA Regional Price Parities source data. No fact appears in the prose that does not exist in the cited public datasets. If you find an error, write to [email protected].

Disclaimer

Information on this page is for general educational purposes only. It is not career, financial, or tax advice. Wage data reflects BLS estimates and may not match individual offers, employer-specific ranges, or current market conditions. Confirm with a licensed professional before making career or compensation decisions.