Scraping comprehensive tweets #113

bigleeth · 2023-10-26T14:53:10Z

Hi,

I am trying to collect whole tweets metioning "starbucks" in 2019. Although I have changed page number from pages=10 to pages= 30, the total number of rows from the outcome is limited approximately 86 all the time regardless of the page constraints. The number of tweets is expected to be around 50,000, if the process had worked right.

How can I scrapping whole tweets for the time horizon without missing texts?

`
from tweety import Twitter
import pandas as pd

app = Twitter("session")
app.start()

all_tweets = app.search("(Starbucks) lang:en until:2019-01-11 since:2019-01-01 -filter:links -filter:replies", pages=30, wait_time=2)

df_tweets = pd.DataFrame(columns=["Date","Text", "Author","Likes", "Retweets"])

for tweet in all_tweets:
new_row = pd.DataFrame({
"Date": [tweet.date],
"Text": [tweet.text],
"Author": [tweet.author.username],
"Likes": [tweet.likes],
"Retweets": [tweet.retweet_counts]
})
df_tweets = pd.concat([df_tweets, new_row], ignore_index=True)

print(f"Total rows in the DataFrame: {df_tweets.shape[0]}")

df_tweets.to_csv('tweets_data.csv', index=False)

print(df_tweets)
`

mahrtayyab · 2023-10-29T07:50:56Z

Searching on Twitter Web gives same number

bigleeth · 2023-10-29T11:25:42Z

get_next_page() does not work. Would you sugguest correct approach to scarp entire pages?

bigleeth · 2023-11-01T12:45:12Z

The objective is to scrap whole posts about Starbucks in the year of 2019. Can I apply get_next_page() for the syntax?

mahrtayyab · 2023-11-02T06:31:09Z

better approach would be to use iter_search , with significant wait_time

mahrtayyab · 2023-11-02T06:32:19Z

get_next_page will return the List of Tweet

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scraping comprehensive tweets #113

Scraping comprehensive tweets #113

bigleeth commented Oct 26, 2023 •

edited

Loading

mahrtayyab commented Oct 29, 2023

bigleeth commented Oct 29, 2023

bigleeth commented Nov 1, 2023 •

edited

Loading

mahrtayyab commented Nov 2, 2023

mahrtayyab commented Nov 2, 2023

Scraping comprehensive tweets #113

Scraping comprehensive tweets #113

Comments

bigleeth commented Oct 26, 2023 • edited Loading

mahrtayyab commented Oct 29, 2023

bigleeth commented Oct 29, 2023

bigleeth commented Nov 1, 2023 • edited Loading

mahrtayyab commented Nov 2, 2023

mahrtayyab commented Nov 2, 2023

bigleeth commented Oct 26, 2023 •

edited

Loading

bigleeth commented Nov 1, 2023 •

edited

Loading