Samsung/meminsight

Name: meminsight

Owner: Samsung

Description: Platform-Independent Memory Profiling Tool for Web Applications

Created: 2014-12-08 05:03:40.0

Updated: 2017-10-31 00:08:40.0

Pushed: 2016-06-02 23:42:58.0

Homepage: null

Size: 4534

Language: JavaScript

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

MemInsight

MemInsight is a memory profiler for web and node.js applications. Through code instrumentation, MemInsight collects a detailed trace of all the memory operations performed by an application during an execution. By processing this trace, MemInsight can automatically find interesting memory-related issues like leaks or excessive, unnecessary allocation. For further details, see our technical report.

Prerequisites

We require that node, java, and gradle be on your PATH.

Installation

First, install dependencies:

npm install

Then, compile the TypeScript and Java code:

./node_modules/.bin/grunt

To just compile the TypeScript code:

./node_modules/.bin/grunt typescript
Collecting A Trace

The first step in using MemInsight is to collect a trace for the application of interest. Trace collection requires instrumenting the application code (via Jalangi 2), and then running and exercising the instrumented app, which generates the trace. MemInsight supports two instrumentation modes:

In general, online instrumentation is preferable, unless you plan to do multiple runs of an instrumented app, in which case the offline approach will do the instrumentation once-and-for-all. We describe how to use both of these schemes below (online and offline).

Online Instrumentation
node.js apps

A node.js application can be instrumented and exercised using the meminsight nodeinstrun command. E.g., to instrumented and exercise script main.js in app path/to/app, run:

./bin/meminsight nodeinstrun path/to/app/main.js args

args are the usual command-line arguments for main.js. At this point, we only support node programs that can be exited without killing them at the command line. Once the instrumented program exits, the lifetime analysis will run, after which trace collection and post-processing is complete. Note that the trace files will be stored in path/to/app, so it is this directory that should be passed to the meminsight inspect command (detailed below).

Web apps

We have preliminary support for collecting a trace for a web application running on a remote server. This support relies on the support for mitmproxy added in Jalangi 2. This library allows for proxying both HTTP and HTTPS connections. To get started, you'll need to have Python 2.7 on your machine. Then, you'll need to install mitmproxy. Our support assumes that the mitmdump script from that library is in your PATH. If you are interested in instrumenting an app that is served over HTTPS, please see these instructions on enabling the mitmproxy certificate authority.

Once you have the pre-requisites installed, you can start up the proxy server process by running the following command from the meminsight directory:

./bin/meminsight proxy path/to/output

Here, path/to/output should be the path where you want the instrumented scripts and MemInsight trace files to be stored. Running this command causes two servers to be launched:

  1. A web proxy server, running on port 8501
  2. The MemInsight websocket server (for recording the trace), running on port 8080

(We plan on supporting custom port numbers soon.) Shortly after running the command, you should see Server is listening on port 8080 printed to the terminal, indicating that the servers are ready to go.

The next step is to set your system/browser to use the proxy server at IP address 127.0.0.1 (localhost), port 8501. Updating the proxy settings varies by operating system and possibly by browser. On Mac OS X, the following commands will enable the proxy server for your WiFi adapter:

sudo networksetup -setwebproxy Wi-Fi 127.0.0.1 8501 off
# for HTTPS
sudo networksetup -setsecurewebproxy Wi-Fi 127.0.0.1 8501 off

Once the proxy settings are correct, open the desired URL in your browser, and then exercise the app. To detect issues like leaks, it is best to repeatedly perform a set of interactions, such that all temporary state should be cleaned up after the interactions. Some caveats about instrumenting a live site:

When you are done exercising the application, press Alt+Shift+T in the browser window. You will see an alert in the browser window when trace generation is complete. At this point, MemInsight's lifetime analysis may still be running. You'll know the analysis is complete when you see the following output in the terminal:

leting lifetime analysis...done writing log

of script complete

At this point, you can disable the proxy server in your settings. The following commands will disable the proxy server on Mac OS X:

sudo networksetup -setwebproxystate Wi-Fi off
sudo networksetup -setsecurewebproxystate Wi-Fi off

Now you can move on to inspecting the results.

Offline Instrumentation

MemInsight can only do offline instrumentation of web or node.js applications residing on the local filesystem. To instrument an app in local directory path/to/app, run the following command from the meminsight directory:

./bin/meminsight instrument path/to/app

This will write the instrumented application to folder /tmp/app. To change the output folder, use the --outputDir option. To only instrument some of the code in the application, pass the appropriate sub-directories to the --only_include option, separated by the path delimiter. E.g., to only instrument the src directory and main.js file, run:

./bin/meminsight instrument --only_include src:main.js path/to/app

Further help can be obtained by running ./bin/meminsight instrument --help.

After instrumentation, the instrumented app must be run to collect a trace. We assume the instrumented app is in /tmp/app.

Web apps

To exercise an instrumented web application, run the following command:

./bin/meminsight run /tmp/app

This will start a webserver running at http://localhost:8888. Open this URL in your browser to load the application, and then exercise it. To detect issues like leaks, it is best to repeatedly perform a set of interactions, such that all temporary state should be cleaned up after the interactions. When you are done exercising the application, press Alt+Shift+T in the browser window. You will see an alert in the browser window when trace generation is complete. In the terminal, you will see output like the following:

bin/meminsight run /tmp/htmlTest1
ing app /tmp/htmlTest1
ing /tmp/htmlTest1 on http://localhost:8888
Jun 02 2015 16:51:51 GMT-0700 (PDT) Server is listening on port 8082
Jun 02 2015 16:52:01 GMT-0700 (PDT) Connection accepted.  Type 'end' and press enter to stop trace generation in the app.

ping tracingTue Jun 02 2015 16:52:04 GMT-0700 (PDT) Peer 127.0.0.1 disconnected. 1006 Connection dropped by remote peer.
leting lifetime analysis...lifetime finish: 57ms

of app  complete

When the program exits, the trace collection and post-processing is complete.

node.js apps

A node.js application can be exercised using the meminsight noderun command. E.g., to exercise script main.js in instrumented app /tmp/app, run:

./bin/meminsight noderun /tmp/app/main.js args

args are the usual command-line arguments for main.js. At this point, we only support node programs that can be exited without killing them at the command line. Once the instrumented program exits, the lifetime analysis will run, after which trace collection and post-processing is complete.

Inspecting Results

To inspect the MemInsight results for an instrumented app in /tmp/app that has been exercised as described above, run:

./bin/meminsight inspect /tmp/app

(For online instrumentation, pass the path to the original node.js app or the output path passed to the meminsight proxy command.) Then, open http://localhost:9000 in your browser to view the GUI.

Issue Table View

Issue Table View

When opened, the GUI first shows the issue table view, as seen above. Note that it may take some time for this page to load, as the server is doing further processing of the trace data. The page includes explanations for the different columns. You can sort by any of the columns, or filter on any table text using the search box. The absolute values in the “Leak,” “Staleness,” “Inline-able,” and “Non-escaping” columns are not meaningful at this time; they are simply intended to rank the issues by interestingness or severity. Clicking on any site will open the Allocation Site View for the site (described further below) in a separate tab.

All-Sites Timeline

All-Sites Timeline

An alternate high-level view is the all-sites timeline, obtained by clicking the “All-Sites Timeline” link at the top of the page. As seen above, this view shows a timeline of the number of live objects during the execution. The darker plot in the timeline shows the number of stale objects across time (an object is stale if it is still reachable but will not be used again during the execution). Hovering over the timeline shows the number of allocated and stale objects at that time. (Note that time is a synthetic measure, unrelated to the wall-clock running time of the app.) The left and right pie charts at the bottom respectively break down the number of allocated and stale objects by allocation site. Clicking on a new point on the timeline will update the pie charts. (Initially, the selected time corresponds to the maximum number of allocated objects.) Clicking the pie slice for any allocation site will open up the Allocation Site View for that site in a separate tab.

Allocation Site View

Allocation Site View

The allocation site view gives detailed information on the behavior of objects allocated from a particular site (an object literal, constructor call, function literal, etc.). The upper left is a source view that initially highlights the allocation site. The upper right is a call tree, showing the call stacks under which the site executed. Clicking any node in the call tree will update the source view accordingly. The lower left shows a timeline of allocated and stale objects for the site, analogous to the all-sites timeline. Clicking on a particular time point populates the lower-right access paths view with the paths through the heap that lead to objects from the site at that point in time. Like the call tree, clicking a node in the access path tree will update the source view.

License

MemInsight is distributed under the Apache License.


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.