ctrl+shift+p filters: :st2 :st3 :win :osx :linux
Browse

Data Wrangler

by AlexLamson ALL

Make quick and dirty data mining easier in Sublime Text

Details

Installs

  • Total 137
  • Win 87
  • Mac 33
  • Linux 17
Feb 28 Feb 27 Feb 26 Feb 25 Feb 24 Feb 23 Feb 22 Feb 21 Feb 20 Feb 19 Feb 18 Feb 17 Feb 16 Feb 15 Feb 14 Feb 13 Feb 12 Feb 11 Feb 10 Feb 9 Feb 8 Feb 7 Feb 6 Feb 5 Feb 4 Feb 3 Feb 2 Feb 1 Jan 31 Jan 30 Jan 29 Jan 28 Jan 27 Jan 26 Jan 25 Jan 24 Jan 23 Jan 22 Jan 21 Jan 20 Jan 19 Jan 18 Jan 17 Jan 16 Jan 15
Windows 0 0 1 1 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1
Mac 0 2 0 0 0 0 0 0 1 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0
Linux 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

Readme

Source
raw.​githubusercontent.​com

DataWrangler - Sublime Plugin

Clean and analyze text data more easily

"Screenshot" I used this to capture the screen

Motivation

While cleaning data from a large survey, I wanted to ask questions about the data so I could clean it better. For example, many people would type what city they were from, but some would misspell it. By looking at the most common responses, I could quickly find the mispelled words and fix them.

Installation

Install via Package Control by searching for DataWrangler.

If you don't have Package Control, you can install it here.

Usage

To run a command, open the command palette (ctrl+shift+P on Windows) and type the name of the function you want.

If a command expects data to be formatted in rows and columns of data, tabs are the preferred separator (it also tries to be clever if you are using another separator, ex. commas). This was chosen because it makes it very compatible with copy-pasting from Google Sheets and Excel spreadsheets.

Commands

All commands are non-destructive, and the results will appear in a new tab.

  • Line/word frequency
  • Flatten a list of lists
  • Vertically align all columns
  • Delete each column that contains a cursor