Wes McKinney

Login: wesm

Company: Two Sigma

Location: New York, NY

Bio: Creator and BDFL of Python pandas. Maker of tools to make data analysis faster and easier. Apache Arrow and Apache Parquet PMC

Blog: http://wesmckinney.com

Blog: http://wesmckinney.com

Member of

  1. conda-forge
  2. DataPad Inc.
  3. Open Source Maintainers on GitHub
  4. pandas
  5. Python for Data
  6. statsmodels
  7. The Apache Software Foundation
  8. Two Sigma

Repositories

arrow
Mirror of Apache Arrow
arrow-1
Graphistry's TypeScript implementation of the Apache Arrow columnar data format
arrow-cpp-feedstock
A conda-smithy repository for arrow-cpp.
arrow-dist
Apache Arrow
arrow-io-test
Continuous integration for the trickier bits in Apache Arrow
arrow-plasma-object-store
Plasma Object Store code for proposed import to Apache Arrow
arrow-site
Mirror of Apache Arrow site
avro
Mirror of Apache Avro
boost-feedstock
A conda-smithy repository for boost.
bootswatch
Themes for Bootstrap
bottleneck-feedstock
A conda-smithy repository for bottleneck.
brotli-feedstock
A conda-smithy repository for brotli.
charlton
Describing statistical models in Python
conda
OS-agnostic, system-level binary package manager and ecosystem
core
there are failing tests. please find any bugs you may have introduced, fix and submit.
cyavro
Cython based wrapper for libavro
cyhello
Minimal Cython project
cysqlite3
null
dask
Versatile parallel programming with task scheduling
datarray
Prototyping numpy arrays with named axes for data management. Docs are available at URL below
dedupe
A free python library for accurate and scalelable deduplication and entity-resolution.
drawarray
null
DVL
Dynamic Visualization LEGO
feather
Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow
feather-format-feedstock
A conda-smithy repository for feather-format.
fye_2010
null
gflags-feedstock
A conda-smithy repository for gflags.
gmail-backup
A Python script to download all your mail from Gmail to your local hard drive.
grin
A grep program configured the way I like it.
hdfs
API and command line interface for HDFS
hello
null
hpat
null
ib-flex-analyzer
Analyze your Interactive Brokers Flex XML reports with pandas
ibis
Productivity-centric Python big data framework for high performance at Hadoop-scale, with first-class integration with Impala. Co-founded by the creator of pandas
ibis-framework-feedstock
A conda-smithy repository for ibis-framework.
impyla
Pure Python client for Impala & Hive using HiveServer2
impyla-feedstock
A conda-smithy repository for impyla.
incubator-kudu
Mirror of Apache Kudu (Incubating)
ipython
Official IPython repository
jemalloc-feedstock
A conda-smithy repository for jemalloc.
jira-wrangle
Convert JIRA dump into something more analyzable
kudu
Kudu, a native column store for the Hadoop ecosystem. Fast analytics on fast data.
libgdf
C GPU Dataframe Library
libhdfs3-downstream
Downstream copy of libhdfs3 for simpler packaging in conda-forge. Please submit changes to https://github.com/apache/incubator-hawq
libndtypes2
Datashape C library
lz4-c-feedstock
A conda-smithy repository for lz4-c.
mapd-core
The MapD Core database
native-toolchain
null
nose
nose is nicer testing for python
nose-ipdb
A nose plugin to use iPDB instead of PDB
numpy
Numpy main repository
pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
pandas2
null
pandas-governance
Project governance documents for the pandas Project
parquet-cpp
Mirror of Apache Parquet
parquet-cpp-feedstock
A conda-smithy repository for parquet-cpp.
parquet-format
Mirror of Apache Parquet
pelican-bootstrap3
Bootstrap 3 theme for Pelican
pelican-octopress-theme
Octopress default theme copied for pelican
protobuf-feedstock
A conda-smithy repository for protobuf.
pyarrow-feedstock
A conda-smithy repository for pyarrow.
pyarrow-windows-wheels
null
pydata-book
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media
pymaging
Pure Python imaging library with Python 2.6, 2.7, 3.1 and 3.2 support
pymapd
null
pymc
Bayesian inference in Python
pyodbc
Python ODBC bridge
PyTables
PyTables is a package for managing hierarchical datasets and designed to efficiently and easily cope with extremely large amounts of data. This a git-svn clone of the Pro veresion recently released under a BSD-flavored license by Francesc Alted!
pytest-ipdb
Provides ipdb on failures for py.test.
rawkit
ctypes based libraw bindings
read-table
Working on IO utilities for loading structured data into Python
r_vs_py
Simple comparison of Python and R for a basic OLS analysis
scikit-learn
scikit-learn main repo
scipy
Scipy main repository
scipy_proceedings
Tools used to generate the SciPy conference proceedings
setuptools_scm
the blessed package to manage your versions by scm tags
snappy-feedstock
A conda-smithy repository for snappy.
spark
Mirror of Apache Spark
staged-recipes
A place to submit conda recipes before they become fully fledged conda-forge feedstocks
statlib
Bayesian State Space and Dynamic Models
statsmodels
main repo of statsmodels
strata-sj-2015
Materials for PyData at Strata/Hadoop World San Jose 2015
textreader
Yet another text file reader for numpy.
tidy-data
A paper on data tidying
tokyo
A Cython wrapper to BLAS and LAPACK
toolchain-build
null
turbodbc-feedstock
A conda-smithy repository for turbodbc.
vbench
vbench: A tool for benchmarking your code through time, for showing performance improvement or regressions
yasnippets-latex
LaTeX snippets for use with the yasnippet Emacs plugin
zipline
Zipline, a Pythonic Algorithmic Trading Library
zlib-feedstock
A conda-smithy repository for zlib.

Commits To

RepositoryMost Recent Commit# Commits


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.