RECRUITING: DATA SCIENCE AND ENGINEERING SKILLS
Required:
· Mathematics
1. Linear algebra
2. Statistics
3. Probability
· Software
1. R or SAS or Mathematica or MATLAB or Sagemath
2. Python or Ruby or Perl or Java
3. Linux
4. Bash scripting including sed, awk, cut, uniq, sort, tr
5. SQL
· Plotting/graphics (describe how to do these using any tool)
1. Scatterplots/matrix plots
2. Line graphs/bar charts
· Previous experience
1. Science or Math research
2. Data analysis
Desired:
· Mathematics
1. Graph theory
2. Network analysis
3. Algorithms
4. Bayesian probability
5. Markov chains/hidden Markov models
6. Principal component analysis
7. Matrix factorization/singular value decomposition
· Software
1. Mahout
2. Graphlab
3. Other machine learning libraries
4. Pig and UDFs
5. Hive and UDFs
6. Source control systems: git, mercurial/hg, subversion
7. Build tools: ant, maven
· Plotting/graphics
1. Advanced/specialized plotting techniques
2. Cross-platform skills: R, JavaScript, Mathematica, Python, etc.
· Previous experience
1. Built recommenders or large-scale computation of metrics(similarity, cohort containment, etc.)
2. Designed and measured performance of predictive models
3. Automation of work
4. Web services
No comments:
Post a Comment