Skip to content
forked from saulpw/visidata

A console spreadsheet tool for discovering and arranging data

License

Notifications You must be signed in to change notification settings

cwbooth5/visidata

 
 

Repository files navigation

VisiData v0.41

A curses interface for exploring and arranging tabular data

Usable via any remote shell which has Python3 installed.

VisiData silent demo

Features

  • browse first rows of huge csv/tsv/xlsx files immediately
  • F1 for command help sheet
  • open .csv, .tsv, .json, .hdf5, .xlsx
  • Ctrl-Save .csv, .tsv
  • hjkl cursor movement, top/middle/bottom scroll to position screen cursor
  • [/] sort asc/desc by one column
  • edit cell contents
  • search/select/unselect by regex in column
  • Frequency table for current column with histogram
  • inner/outer/full/diff joins on any number of sheets, matching designated key columns
  • add new column by Python expression
  • Ctrl-O to eval an expression and browse the result as a python object
  • watch long-running sheets load asynchronously
  • : split column (on any sheet)
  • + join selected columns on Columns sheet

Metasheets

  • Sheets metasheet to manage/navigate multiple sheets
  • Columns metasheet
  • Options sheet to change the style or behavior
  • ^Error metasheet
  • ^Threads metasheet

Columns

On the Columns sheet, these commands apply to rows (the columns of the source sheet), instead of the columns on the Columns sheet

  • - hides column (sets width to 0)
  • _ maximizes column width to fit longest value
  • ! marks column as a key column (pins to the left and matches on sheet joins)

Column typing

  • columns start out untyped (unless the source data is typed)
    • # sets column type to int
    • $ sets column type to str
    • % sets column type to float
    • @ sets column type to date
    • ~ autodetects column type
  • all values are stored in their original format, and only converted on demand and as needed.
  • values that can't be properly converted are flagged with ~ on the display
  • for commands like sort which require a correctly typed value, the default (0) value for that type is used
  • cell edits are rejected if they don't convert to the column type

Installation

    $ pip3 install visidata

Dependencies

  • Python 3.3
  • openpyxl (if opening .xlsx files)
  • h5py and numpy (if opening .hdf5 files)
  • google-api-python-client (if opening Google Sheets; must also set up OAuth credentials
  • dateutil.parser (if converting string column to datetime)

Remember to install the Python3 versions of these modules with e.g. pip3

Usage

    $ vd [-r/--readonly] [<input> ...]

Inputs may be paths or URLs. If no inputs are given, starts exploring the current directory. Unknown filetypes are by default viewed with a text browser.

Commands

Definitions of terms used in the help and documentation:

  • 'go': move cursor

  • 'move': change layout of visible data

  • 'show': put on status line

  • 'scroll': change set of visible rows

  • 'push': move a sheet to the top of the sheets list (thus making it immediately visible)

  • 'open': create a new sheet from a file or url

  • 'load': reload an existing sheet from in-memory contents

  • 'jump': change to existing sheet

  • 'drop': drop top (current) sheet

  • 'this': current [row/column/cell] ('current' is also used)

  • 'abort': exit program immediately

F1 opens the Help Sheet, which shows the available commands along with a brief description. This sheet can be searched, sorted, and filtered just like any other sheet.

Here are slightly better descriptions of some non-obvious commands:

  • the "global prefix": always applies to the next command only, but could mean "apply to all columns" (as with the regex search commands) or "apply to selected rows" (as with delete) or "apply to all sheets" (as with q). The global_action column on the Help Sheet shows the specific way the global prefix changes each command.

  • = "add column expression" takes a Python expression as input and appends a new column, which evaluates the expression over the row.

  • Ctrl-Save sheet: the output type is determined by the file extension (currently .tsv and .csv)

  • R sets the source type of the current sheet. The current sheet remains until a reload (Ctrl-R).

  • When sheets are joined, the rows are matched by the display values in the key columns. Different numbers of key columns cannot match (no partial keys and rollup yet). The join types are:

    • &: Join all selected sheets, keeping only rows which match keys on all sheets (inner join)
    • +: Join all selected sheets, keeping all rows from first sheet (outer join, with the first selected sheet being the "left")
    • *: Join all selected sheets, keeping all rows from all sheets (full join)
    • ~: Join all selected sheets, keeping only rows NOT in all sheets (diff join)
  • Edits made to a joined sheet are by design automatically reflected back to the source sheets.

Credits/Contributions

VisiData was created by Saul Pwanson <vd@saul.pw>.

VisiData is currently under active development (as of January 2017).

VisiData needs lots of usage and testing to help it become useful and dependable. If you are actively using VisiData, please let me know! Maybe there is an easy way to improve the tool for both of us.

Also please create a GitHub issue if anything doesn't appear to be working right. If you get an unexpected error (on the status line), please include the full stack trace that you get with ^E.

Please contact me at the email address above if you would like to contribute in some other way.

Inspirations and Related Work

License

VisiData is licensed under GPLv3.

About

A console spreadsheet tool for discovering and arranging data

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%