Data Engineer/ Backend Engineer 🔨💻
-
Data Engineer / Solutions Engineer
- Pakistan
-
08:09
(UTC -12:00) - https://danielox.github.io/
- in/danial-shabbir
Highlights
Pinned Loading
-
Data-Engineering-Workflow-DUCKDB
Data-Engineering-Workflow-DUCKDB PublicA Complete Data Engineering Workflow, Data Modelling and Advanced Analytics using Python, DuckDB
-
factory-design-pattern
factory-design-pattern PublicAn Implementation of Factory Design Pattern in Python. It does not use static calling of subclasses but rather automatically gets the name of the concrete class as keys to invoke the factory creato…
Python
-
Natural Language Feature Extraction ...
Natural Language Feature Extraction | Bag of Words (with | using) Scikit-Learn Python 1from sklearn.feature_extraction.text import CountVectorizer
23# corpus source [ https://en.wikipedia.org/wiki/Baseball ]
45corpus = """Baseball is a bat-and-ball game played between two opposing teams who take turns batting and fielding. The game proceeds when a player on the fielding team, called the pitcher, throws a ball which a player on the batting team tries to hit with a bat. The objective of the offensive team (batting team) is to hit the ball into the field of play, allowing its players to run the bases, having them advance counter-clockwise around four bases to score what are called "runs". The objective of the defensive team (fielding team) is to prevent batters from becoming runners, and to prevent runners' advance around the bases.[2] A run is scored when a runner legally advances around the bases in order and touches home plate (the place where the player started as a batter). The team that scores the most runs by the end of the game is the winner."""
-
Natural Language Feature Extraction ...
Natural Language Feature Extraction | Bag of Words (with | using) NLTK Python 1import nltk
2import string
3from collections import defaultdict
45# Sample gutenberg corpus loaded from nltk.corpus
-
markov-chain-problem
markov-chain-problem PublicI am using Markov transition probabilities as a feature matrix for a machine learning algorithm but the problem is for a single document D it will create R rows for each state S R = D x S whereas t…
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.