EnjinScraper

Scrapes an Enjin site via the Enjin API

API calls used are described in detail here.

Usage

Note that this is still a work in progress and as such installation and usage requires manual installation of dependencies and configuration of the config.json file. When done, I may opt to publish as a global npm package or distribute as a CLI electron app.

Installation

git clone https://github.com/Kas-tle/EnjinScraper.git
cd EnjinScraper
yarn

Configuration

Obtaining an API key

Per Enjin's instructions:

To enable your API, visit your admin panel / settings / API area. The content on this page includes your base API URL, your secret API key, and the API mode. Ensure that the API mode is set to "Public".

Obtaining Forum Module IDs

This can be obtained in the admin panel of your site under "Modules". Using the left side panel, you can filter to the type "Forum Board". Make a list of the Module IDs you wish to scrape in the config.json file as shown below.

Obtaining News Module IDs

This can be obtained in the admin panel of your site under "Modules". Using the left side panel, you can filter to the type "News / Blog". Make a list of the Module IDs you wish to scrape in the config.json file as shown below.

Configuring the `config.json` file

Create a config.json file in the root directory of the project. The file should look like this:

{
    "apiKey": "someapiKey", // Required
    "domain": "www.example.com", // Required
    "email": "someemail@email.com", // Required
    "password": "somepassword", // Required
    "sessionID": "someSessionID", // Optional, otherwise it will be fetched automatically
    "forumModuleIDs": [ // Optional, otherwise no forums will be scraped
        "1000001",
        "1000002"
    ],
    "newsModuleIDs": [ // Optional, otherwise no news will be scraped
        "1000001",
        "1000002"
    ],
    "disabledModules": {
        "forums": false,
        "news": false,
        "tickets": false,
        "applications": false,
        "users": false,
        "usertags": true
    }
}

Running

npx ts-node index.ts

Outputs

The scraper will output a single json file for each module scraped in the target directory. This may be improved in the future with some sort of database export type.

TODO

Add support for scraping wikis (Kas-tle#5)
Add support for downloading referenced images and attachments (Kas-tle#4)
Add more options for user data scraping (Kas-tle#8)
Add support for scraping galleries (Kas-tle#6)
Add support for scraping ticket replies (Kas-tle#1)
Export to database

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
index.ts		index.ts
package.json		package.json
site.sqlite		site.sqlite
tsconfig.json		tsconfig.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EnjinScraper

Usage

Installation

Configuration

Obtaining an API key

Obtaining Forum Module IDs

Obtaining News Module IDs

Configuring the `config.json` file

Running

Outputs

TODO

About

Releases

Packages

Languages

License

HippieBeak/EnjinScraper

Folders and files

Latest commit

History

Repository files navigation

EnjinScraper

Usage

Installation

Configuration

Obtaining an API key

Obtaining Forum Module IDs

Obtaining News Module IDs

Configuring the config.json file

Running

Outputs

TODO

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Configuring the `config.json` file

Packages