3pillarlabs/text-clustering-classification

Name: text-clustering-classification

Owner: 3Pillar Global Open Source

Description: A collection of projects to study text clustering and classification.

Created: 2018-05-11 10:03:32.0

Updated: 2018-05-11 10:16:47.0

Pushed: 2018-05-11 10:06:12.0

Homepage: null

Size: 1320

Language: Scala

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

About

This is a collection of Scala projects to study text clustering and classification. The projects are:

Setup

Prerequisites
  1. VirtualBox
  2. Vagrant
  3. Chef Development Kit (DK)
Steps
VirtualBox
ant up dev

This will install all the dependencies and data files. This might take a couple of hours, so get some coffee and something to read on the side.

Once the VM is up and running, restart the VM.

ant reload dev
AWS

You will need a private key for this. Place your key in the project root and rename the key file to infop.pem or replace these lines in Vagrantfile:

keypair_name = "your_key_name"
ride.ssh.private_key_path = "path/to/your/key.pem"

Next, create the AWS instance with:

ant up awsdemo --provider aws

Once it is running, restart it:

ant reload awsdemo

This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.