本文共 6767 字,大约阅读时间需要 22 分钟。
数据科学家 数据工程师
by David Venturi
大卫·文图里(David Venturi)
More than 15,000 people responded to Free Code Camp’s 2016 New Coder Survey, granting researchers (like me!) an unprecedented glimpse into how people are learning to code. They released the entire dataset on .
超过15,000人对Free Code Camp的2016年New Coder调查做出了回应,使研究人员( 像我一样! )空前地了解了人们如何学习编码。 他们在上发布了整个数据集。
Here are a few high-level statistics from this data-focused subset, which complements Free Code Camp’s .
以下是这个以数据为中心的子集的一些高级统计信息,补充了Free Code Camp 的 。
I’ve borrowed the structure of Free Code Camp’s announcement article for ease of comparison. I’ve also included my comments where findings differ notably. And a few bonus plots, too!
为了便于比较,我借用了Free Code Camp的公告文章的结构。 我还发表了自己的评论,其中发现存在显着差异。 还有一些奖励情节!
Of the 646 developing data scientists and data engineers who responded to the survey:
的 646位接受调查的发展中的数据科学家和数据工程师:
25% are women (4% more)
女性占 25% (增加4%)
their median age is 26 years old (one year younger)
他们的中位年龄是26岁(比她小一岁)
they started programming an average of 16 months ago (5 months earlier)
他们平均在16个月前(比5个月前)开始编程
This is one hour less than new coders in general.
一般而言,这比新编码员少一小时。
Compared to 40% for the full new coder survey, this is a bit shocking. I have a hunch these zero counts are caused by the . Every respondent that answered the job role of interest question has zero counts for “start your own business” and “freelance.”
与全新编码器调查的40%相比,这有点令人震惊。 我直觉这些零计数是由引起 。 每个回答了兴趣职位问题的受访者,“开办自己的企业”和“自由职业”的计分都为零。
This is a longer time horizon than new coders in general, where 65% are applying within the next year.
一般而言,这比新编码员的时间跨度更长,因为新编码员将在明年申请65%的编码。
Only 46% of new coders in general have used at least one of these resources. These companies have a wider range of subject areas than the some of the coding-specific resources listed.
通常,只有46%的新编码员至少使用了其中一种资源。 这些公司的主题领域比列出的某些特定于编码的资源还要广泛。
Of them, , , and are the only data-specific podcasts noted.
其中, , 和是唯一提到的特定于数据的播客。
6% of new coders have attended a bootcamp.
6%的新编码员参加了训练营。
The dominating percentage of North Americans should be expected because Free Code Camp is based in the United States.
因为Free Code Camp的总部位于美国,所以应该可以预期北美人占主导地位。
Compared to 58% for new coders in general, the data-focused subset is more skewed towards post-secondary studies.
相比于一般新程序员的58%,以数据为中心的子集更倾向于中学后学习。
Diversity amongst majors is greater compared to the full survey, where Computer Science and Information Technology checked in at #1 and #2 with 17% and 5%, respectively.
与完整调查相比,专业之间的差异更大,在完整调查中,计算机科学和信息技术分别以17%和5%位居第一和第二。
Two-thirds of the new coder population are currently working.
目前有三分之二的新编码员正在工作。
There is a higher variety of employment fields compared to the full dataset, where 50% of respondents work in software development and IT.
与完整数据集相比,雇佣领域的多样性更高,在整个数据集中,有50%的受访者从事软件开发和IT工作。
The median current salary for the full dataset is $37k.
完整数据集的当前薪水中位数为37,000美元。
The median for the full survey dataset is $50k. With data science/engineering being in 2016, some respondents might be seeking higher wages.
整个调查数据集的中位数为5万美元。 随着2016年数据科学/工程学的 ,一些受访者可能会寻求更高的薪水。
This is 5% higher than new coders in general.
一般而言,这比新编码员高5%。
This average is $3k more than the full survey dataset.
该平均值比整个调查数据集高出3000美元。
You can find a of this analysis on Kaggle, where I outline my process.
您可以在Kaggle上找到此分析的 ,其中概述了过程。
Be sure to check out my initial exploration of , where I dive deeper into the characteristics of new coders:
一定要检查一下我对初步探索,在此我将更深入地研究新编码员的特征:
If you have questions or concerns about this series or the R code that generated it, don’t hesitate to .
如果您对此系列或生成它的R代码有疑问或疑虑,请随时 。
翻译自:
数据科学家 数据工程师
转载地址:http://jhewd.baihongyu.com/