Skip to content

The main aim of this project is to provide career recommendations to the users based on their StackOverflow activity, present temporal trends in technology along with future projections. Finally, we perform tag predictions based on the questions asked by the user. Additionally, we also implement a semantic search technique that takes into accoun…

License

Notifications You must be signed in to change notification settings

sampasha/stack-connect

Repository files navigation

Logo

This Project aims to help induviudals by helping them get insights on the latest technologies and also recommends relevant job postings based on stackoverflow activity. The job postings are scraped from Linkedin and Indeed and combined with reviews from glassdoor in order to provide compact solution to the end user.

Table of Contents

Introduction

For this project, we started by scraping the job postings from Indeed and LinkedIn job portals. We further made use of publicly available StackOverflow dataset that is hosted in BigQuery to get user information like the questions asked, answers provided and overall profile statistics like upvotes, downvotes, etc.

We also consolidated the salary insights and company reviews from the Kaggle Glassdoor dataset that is available publicly. We extracted relevant tags from the questions and answers provided by the user from his StackOverflow profile and matched it against the key skills required from the description of the job posts that we scraped to suggest jobs to the end-user. We also present temporal trends in various technologies and immediate future projection in technology trends, so the user can know what to expect and work towards equipping himself with the right technologies.

We have also implemented a semantic search strategy that improves upon existing search utility in StackOverflow by taking into account the popularity and sentiment in the user answers. Finally, we predict tags for the question asked by the user so that it reaches the right audience.

We can see the architecture diagram of the entire system below:

architecture

Demo

HomePage

homepage

Login

homepage

Profile

homepage

Semantic Search

homepage

GettingStarted

Prerequisites

install streamlit

pip install streamlit

setup Third party libraries

we run this command inside the web-interface folder

pip install -r requirements.txt

Get the service account key for google authentication

You can follow this link to set it up

Get the credentials json for pushing data to firebase

You can follow this link to set it up

Deploying

We use streamlit to deploy our web application

Inside the web-interface folder, simply run the below command

streamlit run app.py

Visualizations

The interactive visualizations available in the web app helps the user by providing temporal trends , which let the user see which technologies have been trending over time. The user can also get location based trends, which helps the user by giving detailed information about popularity of languages in different cities.

Wordcloud

The wordcloud helps us visualize the top technologies in any given year, the popularity is denoted by the font size. Higher font size denotes higher popularity

wordcloud

LineChart

The linechart helps the user visualize how the popularity of any given programming language / technology has changed over time.

linechart

Location Based Trends

The Location based trends helps users visualize the top technologies in every city

location

TechStack

Summary

The main aim of our project was to provide career recommendations to the users based on their stackoverflow activity. We also committed to presenting temporal trends in technology along with future projections for technological trends. Finally, we aimed to perform tag prediction based on the question asked by the user. The above mentioned aims of the project were successfully accomplished by exploiting various Data Science tools and techniques and the recommendations and analyses were provided to the user. Additionally, we also implemented a semantic search technique that takes into account popularity of the user, sentiment of the answers and cosine similarity to improve search results

FutureWork

In the future, We plan on integrating other platforms like HackerRank & GeeksforGeeks to fetch interview questions from the companies to create a more robust platform.

We further plan on using different deep learning models and feature drill downs to understand the feature importances and improve our prediction models

References

Authors

About

The main aim of this project is to provide career recommendations to the users based on their StackOverflow activity, present temporal trends in technology along with future projections. Finally, we perform tag predictions based on the questions asked by the user. Additionally, we also implement a semantic search technique that takes into accoun…

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published