Skip to content

production-quality template convention to scrap web sites based on cheerio

License

Notifications You must be signed in to change notification settings

molekilla/menio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

menio

production-quality template convention to scrap web sites based on cheerio

Installation

$ npm install menio

A Menio template

module.exports = {
    // Selector
    context: {
        selector: 'table > tr',
        index: 4
    },
    // Properties
    props: {
        
        render: function ($scope) {
            var numeros = $scope.$find('tr');

            // return scope and next = true if this only selects for another template, only available in callback mode
            return {
                $scope: numeros,
                next: true
            };
        }
    }
};

Another Menio template

module.exports = {
    context: 'td',
    props: {
        tipo: function ($scope) {
            // either return or assign to this.model
            return $scope.eq(0).text();
        },
        fecha: function ($scope) {
            return $scope.eq(1).text();
        },
        primero: function ($scope) {
            return {
                numero: $scope.eq(2).text(),
                serie: $scope.eq(4).text(),
                folio: $scope.eq(5).text(),
                verificacion: $scope.eq(6).text(),
                letras: $scope.eq(3).text()
            };
        },
        segundo: function ($scope) {
            return {
                numero: $scope.eq(7).text()
            };
        },
        tercero: function ($scope) {
            return {
                numero: $scope.eq(8).text()
            };
        },
        dateFormatISO: function () {
            return 'es-pa';
        },
        id: function($scope) {
          return this.tipo($scope) + this.fecha($scope);
        }
    }
};

Menio Docs

new Menio(templateDirectory)

menio.parse(htmlOrDOM, templateName, callback)

menio.render(options { template, data }, callback)

Callback(err, scope, model)

Menio Template Docs

Property(scope, cheerio, underscore)

Property(scope, cheerio, underscore)

Response(either value or options)

Response Options($scope, next), only available in callback mode

Example

        // add template directory
        var menio = new Menio(__dirname + '/templates');
        
        // async callback
        // 1st argument - either a $ element or HTML
        // 2nd argument - name of the template
        // 3rd and optional - callback
        menio.parse(html, 'loteria', function (err, $scope) {
            // callback returns error, scope and model
            var data = _.map(_.rest($scope, 3), function (row) {
                // sync returns results right away
                return menio.render({ template: 'row', data: row });
            });



            var matchWith = {
                tipo: 'Dominical',
                fecha: '01-09-2013',
                primero: {
                    numero: '0677',
                    serie: '4',
                    folio: '13',
                    verificacion: '',
                    letras: 'DDBC'
                },
                segundo: {
                    numero: '7260'
                },
                tercero: {
                    numero: '8783'
                },
                dateFormatISO: 'es-pa',
                id : 'Dominical01-09-2013'
            };
            expect(data[0]).toEqual(matchWith);
        });

    });

Changelog

  • 0.2.0: Better API with $scope.$find, context selector index and added render (same as parse) which accepts an options argument

License

Copyright 2015 Rogelio Morrell C. MIT License

About

production-quality template convention to scrap web sites based on cheerio

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published