honduras_tse_pdf

Creator of PDF files from "TSE" in the Amazon S3 bucked, storing all data in your local filesystem.

The app techtes the API.TSE.HN to retrieve the number of votes for Presidential race, only

Description

This application runs on Mac/Linux may need to be updated for Windows. Uses PDF Creator, Image processing, and JSON request.

Using python, to connect to the list of S3 buckets related to the Honduran TSE, retrieving all instances of each MER MER or Mesa Electoral Receptora is a unique ID for each table.

The tables are then fetched from Amzzon S3's bucket, and most likely pushed after the ballots are counted..

TSE uses the format : [ER_ID]106.JPG for president, [MER_ID]606.JPG for alcalde, [MER_ID]405.JPG for diputado.

Setup and initial configuration

Run the following programs in multiple nodes in your cloud

Install PILLOW and FPDF in your machine with Python 2.7

git clone https://github.com/python-pillow/Pillow.git
git clone https://github.com/reingart/pyfpdf

Configuration after git clone is perfomed

cd Pillow 
python setup.py install
cd ..
cd pfpdf
python setup.py install
cd ..

Install all TSE Images and Resources:

chnod +x  downloads.sh 
./downloads.sh

Make sure all JPEGs install with metadata from S3 bucket, the files are in JPEG format from API.TSE.HN The created file structure would be:

alcaldes/*
diputados/*
presidente/*
escrutinioesp/*
pdfs/*

All the "ACTAS" will be pulled from alcaldes/* diputados/* and presidente/* and merged into pdfs, the metadata from the PDF created comes from the timestamp is arriving from AWS from the header data.

And run the follwing in several machines.

python processtoPDF.py 10000 11000 pdfs/data_1000to11000.csv 
python processtoPDF.py 11001 12000 pdfs/data_1100to12000.csv

Finding Image Resoultions and Votind Data

You can create a table with the JPEG image sizes, and all the votes for each prescint in CSV format. This is a quick and dirty way of doing that by redirecting output to the CSV file, but we can use CSV module on python. additionally, the app extracts the resolution of all images and inersts them as P_W and P_H: President Width and HEIGHT and so on.

python findresolutions.py 1 18180  > data_mining.csv

find_by_data.py

This program finds all "ACTAS" with a timestamp by date and sorts them out in a searchable file. Also counts all votes added per date. And uses the data_ming.csv talbe created by findresolutions.py

python find_By_date.py directory data_mining.csv

Dropbox with all PDF files

[Dropbox] ("https://www.dropbox.com/sh/cbqdmiu72er8c6a/AAAqmEAL4AZUrXcQziM7GLjja?dl=0") -- Repository with all PDF Files

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.gitignore		.gitignore
README.md		README.md
data_mining.csv		data_mining.csv
downloads.sh		downloads.sh
fetchbydate.py		fetchbydate.py
findresolutions.py		findresolutions.py
imagesearch.py		imagesearch.py
parselista.py		parselista.py
processtoPDF.py		processtoPDF.py
votecount.py		votecount.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

honduras_tse_pdf

Description

Setup and initial configuration

Configuration after git clone is perfomed

Install all TSE Images and Resources:

Finding Image Resoultions and Votind Data

find_by_data.py

Dropbox with all PDF files

About

Releases

Packages

Languages

win2013/honduras_tse_pdf

Folders and files

Latest commit

History

Repository files navigation

honduras_tse_pdf

Description

Setup and initial configuration

Configuration after git clone is perfomed

Install all TSE Images and Resources:

Finding Image Resoultions and Votind Data

find_by_data.py

Dropbox with all PDF files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages