bomboradata/datascience-exam

Name: datascience-exam

Owner: Bombora

Description: Repo supporting data science team interview exams.

Created: 2016-09-28 06:58:55.0

Updated: 2017-08-21 03:44:43.0

Pushed: 2018-04-27 17:01:59.0

Homepage: null

Size: 328

Language: Jupyter Notebook

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

Bombora Data Science: Interview Exam


Welcome!

From all of us at Bombora Data Science, we're excited to get to know you! To do so, we'd like to begin by engaging in a few exercises exploring concepts we're interested in (and hope you are as well).

NOTE: We'll be leveraging open tools and public repositories. Ethics and professional aspirations should drive you to work creatively, enthusiastically, and independently. Clearly, you should leverage all resources you can, but please, do not share this exam or your solutions with any other entity, man or machine. :)

Thank you so much! Now, get comfortable and let's go!

Instructions

Overview

We're going to leverage typical analysis and development workflows used by our team to conduct an interview exam. In short, you'll:

Details
0. Configure exam params:

Define a few parameters used throughout the exam:

If you're a POSIX shell user (e.g., linux or osx), you can set these params via environment variables so future commands won't have to be manually modified!

rt GH_USERNAME='carlsagan'
rt DS_POSITION='data-science-director'
rt DS_EXAM_NAME='sagan-carl'
1. Make your own copy of repo
Fork repo to your GitHub account

It's the button above that looks like the one below (you can click either):

Fork

Local Clone

On your local machine, with git installed:

clone git@github.com:$GH_USERNAME/datascience-exam.git
atascience-exam
Checkout $DS_POSITION branch

Develop against a branch titled for the position you're applying to:

checkout $DS_POSITION
2. Setup your exam working directory
  1. Create your exam directory:

    r ./submittals/$DS_POSITION/$DS_EXAM_NAME
    
  2. Copy exam notebook (ds-interview-exam.ipynb) to your exam directory:

    s-interview-exam.ipynb ./submittals/$DS_POSITION/$DS_EXAM_NAME/
    
3. Setup Jupyter Notebook Environment
3.1 Launch Jupyter

You have two options:

  1. Upload the included ds-interview-exam notebook to try.jupyter.org (easy and recommended):

    • open try.jupyter.org,
    • Upload this notebook to Jupyter session
      • click upload in upper right corner
      • select the local notebook file (e.g., your copy of data-science-exam.ipynb)
    • click on notebook to open.
  2. or, create a local conda env, if you prefer (much) more control of packages, (difficult):

    • install conda / miniconda.
    • create bombora-datascience-exam conda environment, via (from repo root): conda env create -f bombora-datascience-exam.yml
    • activate conda environment, via: source activate bombora-datascience-exam
    • launch Jupyter, via: jupyter notebook
    • open your copy of ds-interview-exam.ipynb notebook
3.2 Setup Jupyter

Note: for those not familiar with Jupyter notebook, please view the Welcome to … intro notebooks included in file listing. Also, you can review the Notebook Basics tutorial.

After opening your notebook, you'll need to select a language kernel of your choice, (you can choose between Scala, R, Haskell, Python 2 or 3, Julia or even Bash).

4. Solve TWO problems, one from each section.
  1. Take note that there are two sections, they are:

    • 1. Algo + Data Structures
    • 2. Prob + Stats
  2. Take 10 minutes to review questions across each section.

  3. Select and answer ONE question from EACH section, or to be totally explicit:

    • select and answer a question from section 1 (e.g., either Q1.1 or Q1.2).
    • select and answer a question from section 2 (e.g., either Q2.1, Q2.2, Q2.3, Q2.4, or Q2.5).

This is you, shining :sparkles:. Think, Hack, Solve, Repeat. Do enjoy this part. :smile:

5. Commit results and push to your remote repo.
6. Open Pull Request to submit exam

To submit, please open a PR to the $DS_POSITION-submittals branch (note: this is a different branch name) of the repo.

Questions?

Open an issue in the GH repo, we provide a button right here, for your convenience.

Issue

Finished, what's next?

Go enjoy your day?you deserve it!


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.