ctrl+shift+p filters: :st2 :st3 :win :osx :linux
Browse

Data Wrangler

by AlexLamson ST2/ST3

Make quick and dirty data mining easier in Sublime Text

Details

  • 1.1.0
  • github.​com
  • github.​com
  • 9 months ago
  • 18 minutes ago
  • 10 months ago

Installs

  • Total 70
  • Win 45
  • OS X 16
  • Linux 9
Mar 24 Mar 23 Mar 22 Mar 21 Mar 20 Mar 19 Mar 18 Mar 17 Mar 16 Mar 15 Mar 14 Mar 13 Mar 12 Mar 11 Mar 10 Mar 9 Mar 8 Mar 7 Mar 6 Mar 5 Mar 4 Mar 3 Mar 2 Mar 1 Feb 28 Feb 27 Feb 26 Feb 25 Feb 24 Feb 23 Feb 22 Feb 21 Feb 20 Feb 19 Feb 18 Feb 17 Feb 16 Feb 15 Feb 14 Feb 13 Feb 12 Feb 11 Feb 10 Feb 9 Feb 8 Feb 7
Windows 0 0 0 1 0 0 0 0 0 0 0 0 0 2 0 1 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 2 1 0 0 0 0 0 0 0 0 0 0 2
OS X 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
Linux 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

Readme

Source
raw.​githubusercontent.​com

DataWrangler - Sublime Plugin

Clean and analyze text data more easily

"Screenshot" I used this to capture the screen

Motivation

While cleaning data from a large survey, I wanted to ask questions about the data so I could clean it better. For example, many people would type what city they were from, but some would misspell it. By looking at the most common responses, I could quickly find the mispelled words and fix them.

Installation

Install via Package Control by searching for DataWrangler.

If you don't have Package Control, you can install it here.

Usage

To run a command, open the command palette (ctrl+shift+P on Windows) and type the name of the function you want.

If a command expects data to be formatted in rows and columns of data, tabs are the preferred separator (it also tries to be clever if you are using another separator, ex. commas). This was chosen because it makes it very compatible with copy-pasting from Google Sheets and Excel spreadsheets.

Commands

All commands are non-destructive, and the results will appear in a new tab.

  • Line/word frequency
  • Flatten a list of lists
  • Vertically align all columns
  • Delete each column that contains a cursor