uci-cbcl/polyAcode

Name: polyAcode

Owner: Computational Biology and Computational Learning @ UCI

Description: null

Created: 2015-08-16 20:50:04.0

Updated: 2015-11-25 01:59:35.0

Pushed: 2015-11-25 03:08:08.0

Homepage:

Size: 6959

Language: Python

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

README for polyAcode

This code base provides all the necessary pieces to reproduce the main results of the polyAcode manuscript. If you have any questions, please email yil8@uci.edu

PREREQUISITES

Mandatory
Optional

DATA

features_annotation.xls contains detailed annotation about the 658 features.

MayrLab
JohnLab

TRAINING

AdaBoost

Training AdaBoost model is done by adaboost.py. For example:

adaboost.py ./MayrLab/data_features_train.npy ./MayrLab/data_features_test.npy 0 10 5

In this case, 0 is the feature index; 10 means using 10-fold cross-validation; 5 means using 5 process for training. The complete feature index is given below:

| Index | Definition | | :—–| :———————– | | 0 | All | | 1 | Conservation level | | 2 | Nucleosome positioning | | 3 | Secondary structure | | 4 | Transcript structure | | 5 | Short 3mer motif | | 6 | PAS signal & variants | | 7 | Known regulators | | 8 | Potential unknown motifs |

Logistic Regression

Training logistic regression model is done by lr.py. For example:

lr.py ./MayrLab/data_features_train.npy ./MayrLab/data_features_test.npy 0 10 5

which is similarly defined as adaboost.py.

Linear SVM

Training linear SVM model is done by lsvm.py. For example:

lsvm.py ./MayrLab/data_features_train.npy ./MayrLab/data_features_test.npy 0 10 5

which is similarly defined as adaboost.py.

WD-SVM

Training WD-SVM model is done by wdsvm.py. For example:

wdsvm.py ./MayrLab/data_seq_train.txt ./MayrLab/data_seq_test.txt 10

In this case, 10 means using 10-fold cross-validation;


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.