Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Finished 100/100 alex tsankov #6

Open
wants to merge 5 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file added 20140913.export.tde
Binary file not shown.
3,239 changes: 3,239 additions & 0 deletions Book2.twb

Large diffs are not rendered by default.

36 changes: 19 additions & 17 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,70 +1,72 @@
# Name

write-your-name
Alex Tsankov

# How many points have you earned?

0/100
100/100

(Make your own calculation and replace the number 0 with the points you think you've earned.)

# How many hours have you spent on this?

fill-in-your-answer
2.5

# What is the most difficult part about this week's challenge?

fill-in-your-answer
Getting the headers into the CSV file.

# Show and tell (10 points)

## Link (2 points)

[title-of-the-article](http://link-to-an-article-using_data_to_solve_a_social_problem)
[Using Big Data For Social Good](http://www.forbes.com/sites/ashoka/2014/08/27/using-big-data-for-social-good/)

## Explain why you found the project interesting. (8 points)

fill-in-your-answer
I think the idea of using actual data to predict whether philanthropic organizations will be effective might be useful for encouraging potential donors.

# GDELT (I) (5 points x 6 + 20 points x 3 = 90 points)

## Checkpoints

### 1 (5 points)

![image](image.png?raw=true)
![image](cp1.png)

### 2 What types of questions do you think this database could provide insight into? (5 points)

fill-in-your-answer

Some of the questions the DB could provide insight for:
- What are some areas that have higher than normal amounts of conflict.
- What are some of the most prevalent actors in the world.
- What are the realtions between individuals and different countries.
### 3 (5 points)

![image](image.png?raw=true)
![image](cp3.png)

### 4 (5 points)

![image](image.png?raw=true)
![image](cp4.png)

### 5a (5 points)

![image](image.png?raw=true)
![image](cp5.png)

### 5b Do you have any thoughts on why these events are missing geographic information? (5 points)

fill-in-your-answer
These events seem to be missing geographic data becasue of errors in the input system. With as much analysis as the GDELT project is doing, there is bound to be some mistakes occassionally with the parser or inputting into the database.

## Challenges

### 1 (20 points)
{{one-paragraph-discussion-of-your-answer}}
We can check to see if the climate change march had any impact by seeing if there is an uptick in the number of environmentals related events and by analyzing the general tone of the articles before and after the marches. This is indicitive of an increase in positive awareness. We can also check to see if there is a rise in climate change related events in other countries besides the US in the aftermath of the marches.

### 2 (20 points)

![image](image.png?raw=true)
![image](ch2.png)

### 3 (20 points)

![image](image.png?raw=true)
![image](ch3.png)

{{one-paragraph-discussion-of-your-answer}}
Even with a relatively small dataset of just 5 dates, we can see that there was an increase in awareness with Environemtal events after 9/23, and an increased positivity of the articles, as seen with the deeper green color. Unfortunately, this interest seemed to fizzle out by 10/5, with very few articles about environmental issues in any place outside of South America.
8 changes: 8 additions & 0 deletions appender.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
#!/bin/bash

cat 20140810.export.CSV >> 20140913.export.CSV
cat 20140913.export.CSV >> 20140923.export.CSV
cat 20140923.export.CSV >> 20140924.export.CSV
cat 20140924.export.CSV >> 20141005.export.CSV

cat header| cat - 20141005.export.CSV | sponge 20141005.export.CSV
Binary file added ch2.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added ch3.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added cp1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added cp3.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added cp4.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added cp5.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added full.tde
Binary file not shown.
3 changes: 3 additions & 0 deletions head.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
#replace the .csv with your file
#you need to brew install moreutils if you are on a mac
cat header| cat - 20141005.export.CSV | sponge 20141005.export.CSV
1 change: 1 addition & 0 deletions header
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
GLOBALEVENTID SQLDATE MonthYear Year FractionDate Actor1Code Actor1Name Actor1CountryCode Actor1KnownGroupCode Actor1EthnicCode Actor1Religion1Code Actor1Religion2Code Actor1Type1Code Actor1Type2Code Actor1Type3Code Actor2Code Actor2Name Actor2CountryCode Actor2KnownGroupCode Actor2EthnicCode Actor2Religion1Code Actor2Religion2Code Actor2Type1Code Actor2Type2Code Actor2Type3Code IsRootEvent EventCode EventBaseCode EventRootCode QuadClass GoldsteinScale NumMentions NumSources NumArticles AvgTone Actor1Geo_Type Actor1Geo_FullName Actor1Geo_CountryCode Actor1Geo_ADM1Code Actor1Geo_Lat Actor1Geo_Long Actor1Geo_FeatureID Actor2Geo_Type Actor2Geo_FullName Actor2Geo_CountryCode Actor2Geo_ADM1Code Actor2Geo_Lat Actor2Geo_Long Actor2Geo_FeatureID ActionGeo_Type ActionGeo_FullName ActionGeo_CountryCode ActionGeo_ADM1Code ActionGeo_Lat ActionGeo_Long ActionGeo_FeatureID DATEADDED SOURCEURL