AWSome-NLP

This full stack application was made to translate English AWS blog posts into low-resource languages in order to improve the language accessibility for people who may not have access to good translation services.

Translation models

This application uses Amazon Translate and our own, fine tuned Transformer based model hosted on AWS SageMaker in order to translate the blogposts. (Note that the SageMaker model currently only translates the blog posts into Turkish but this can be extended into other languages).

Services

In order to build this application we use the following AWS services:

Amazon Translate to translate blog posts
AWS Amplify to host and deploy our application
AWS Lambda to build logic (for example to scrape the blog posts and save them)
DynamoDB to store translations and ratings for those translations so that it acts as a cache
AWS Step Functions for control flow within the application
AWS AppSync for GraphQL queries and to facilitate communication between different services
AWS S3 to store translations
AWS CloudSync for frontend aesthetics

Tools

The tools that we use in order to build this application are:

React for the frontend (which can be started by running $ npm install & npm start)
AWS Lambda for logic (which can be found on the AWS Console and run using the provided test templates)

Project Architecture

Below is a diagram of the project architecture:

Subdirectories

This Project has many moving parts and therefore has READMEs in each subdirectory. Below is a brief description of each subdirectory:

Frontend: This is the React frontend for the application.
Checking URL: This is the AWS Lambda function that checks if the URL is valid.
Get Blog Content: This is the AWS Lambda function that scrapes the blog content from the AWS blog.
Step Function Invoker: This is the AWS Lambda function that invokes the AWS Step Function.
Storing Translation: This is the AWS Lambda function that stores the translation in DynamoDB.
User Config Function: This is the AWS Lambda function processes the user's configuration and queries the Translation.
CDK Infrastructure as Code: This is code that creates all of the infrastucture for the application.

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
.github/workflows		.github/workflows
Model		Model
__mocks__		__mocks__
amplify		amplify
cdk-init-ts		cdk-init-ts
public		public
src		src
.eslintignore		.eslintignore
.gitignore		.gitignore
.graphqlconfig.yml		.graphqlconfig.yml
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
ProjectArchitecture.png		ProjectArchitecture.png
README.md		README.md
amplify.yml		amplify.yml
aws-exports.js		aws-exports.js
babel.config.js		babel.config.js
package-lock.json		package-lock.json
package.json		package.json
react-app.iml		react-app.iml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AWSome-NLP

Translation models

Services

Tools

Project Architecture

Subdirectories

About

Releases

Packages

Contributors 5

Languages

Tezzish/awsome-nlp

Folders and files

Latest commit

History

Repository files navigation

AWSome-NLP

Translation models

Services

Tools

Project Architecture

Subdirectories

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages