-
Notifications
You must be signed in to change notification settings - Fork 3
/
README.txt
79 lines (54 loc) · 2.03 KB
/
README.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
LogScraper
==========
A generic library for gathering stats from log files by running regexes
on them. Things you can do: \* Create and run any number of regexes on
any number of files in parallel. \* Aggregate stats by creating named
regex groups in your regexes \* Grab archived logs (so long as you tell
it where your archives live) \* Grab files from remote boxes \* Print
stats to console \* Print regex matches to console \* Search on gzipped
files
Installation
------------
The easiest manner of installation is to grab the package from the PyPI
repository.
::
pip install log_scraper
Usage
-----
Base Usage
^^^^^^^^^^
For off the cuff usage, you can just create a LogScraper object and tell
it what regexes to run and where to look for files. Eg.
::
from log_scraper.base import LogScraper
import log_scraper.consts as LSC
filepath = '/path/to/file'
filename = 'filename.ext'
scraper = LogScraper(default_filepath={LSC.DEFAULT_PATH : filepath, LSC.DEFAULT_FILENAME : filename})
scraper.add_regex(name='regex1', pattern=r'your_regex_here')
# To get aggregated stats
data = scraper.get_log_data()
# To print all the stats
scraper.print_total_stats(data)
# To print each file's individual stats
scraper.print_stats_per_file(data)
# To view log lines matching the regex
scraper.view_regex_matches(scraper.get_regex_matches())
The real power, though, is in creating your own class deriving from
LogScraper that presets the paths and the regexes to run so that anyone
can then use that anywhere to mine data from a process' logs.
Development
-----------
Dependencies
~~~~~~~~~~~~
- Python 2.7
- `paramiko <http://paramiko-www.readthedocs.org/en/latest/index.html>`_
Testing
~~~~~~~
To test successfully, you must set up a virtual environment On Unix, in
the root folder for the package, do the following:
``python -m virtualenv . source ./bin/activate ./bin/python setup.py develop``
Now you can make any changes you want and then run the unit-tests by
doing:
::
./bin/python setup.py test