GitHub - shaho1090/import-data

The Story

Assume you have a huge file to import into a database, like a text log file with more than 10 million lines. I have experimented with many ways to handle this issue during the creation of this app. I had a simple text file with its specific structure to import. So, I needed to parse this file line by line. I created a command to get and parse the file:

php artisan import:from-text "filename.txt"

By running this command, our story gets to start. Just make sure that the file is in this directory:

"storage/app/logFiles"

This command will fetch the full path of the file and will pass it to a job. In the earlier version of the app, I did the entire implementation inside this job, but later I made separate classes to decouple the implementations and logic. I tried to utilize single responsibility.
The queue worker should be run before or after previous command:

php artisan queue:listen --queue="import-text-log-file"

You may complain that the better way to run queue workers is "queue:work", but trust me, I test it many times with spending a lot of time. If you run the queue workers with this command, you will encounter the memory limitation error.
As Mohamed Said mentioned in this link , if you want to prevent memory leaking while running the queue workers you should restart them before that happen. He suggest some ways to do that.
But in our case the better way is using queue:listen instead and you will never face any issue. This command just run the jobs a little bit slower.
The job is responsible for sending three parameters to a class that parse and insert the lines. The parameters are file path, startLine ,and endLine. The TextLogFileParserService import ten lines of the file in each time. Why 10 lines? because I tested this also and if you set the lines in bigger number, such as 20 lines, the time of inserting the lines into database is steadily increased. After a while queue workers will work very slowly.
After passing those three parameters to the parser class, if there are lines remained to parse and import, the job call itself to do the process for the next ten lines until the last line of file.

We also have a endpoint to filter and count the imported data:

localhost:8000/api/logs/count

The example of usage:

Filter by service name,

localhost:8000/api/logs/count?serviceName=invoice-service

Filter by status code:

localhost:8000/api/logs/count?statusCode=201

Filter by start date and end date:

localhost:8000/api/logs/count?startDate=2022-03-17&endDate=2022-03-20

Also, you can combine filters together:

localhost:8000/api/logs/count?serviceName=invoice-service&startDate=2022-03-17&endDate=2022-03-20

You may want to make your own file with specific number of lines to test:

php artisan create:log-file <filename> <numberOfLines>

php artisan create:log-file "testFile" 100000

Installation

I used php version 8.2 and mysql version 8. Just create your own database, clone the repository, open the project directory make your own .env file and set the keys inside it. Do not forget to set the queue connection key:

QUEUE_CONNECTION=database

Then, run the following commands:

- composer install
- php artisan key:generate
- php artisan migrate
- php artisan serve

That's it!

Notice: normally the port for using the app is 8000, it will be shown after running last command.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
app		app
bootstrap		bootstrap
config		config
database		database
lang/en		lang/en
public		public
resources		resources
routes		routes
storage		storage
tests		tests
.editorconfig		.editorconfig
.env.example		.env.example
.env.testing		.env.testing
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
artisan		artisan
composer.json		composer.json
composer.lock		composer.lock
package.json		package.json
phpunit.xml		phpunit.xml
vite.config.js		vite.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Story

Installation

About

Releases

Packages

Languages

shaho1090/import-data

Folders and files

Latest commit

History

Repository files navigation

The Story

Installation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages