Skip to content
This repository has been archived by the owner on Mar 30, 2023. It is now read-only.

No more data! Scraping will stop now. - Cannot fetch more tweets using twint #1295

Open
Logenleedev opened this issue Oct 31, 2021 · 17 comments

Comments

@Logenleedev
Copy link

Logenleedev commented Oct 31, 2021

Make sure you've checked the following:

  • [] Python version is 3.6;
  • [] Updated Twint with pip3 install --user --upgrade -e git+https://github.com/twintproject/twint.git@origin/master#egg=twint;
  • [] I have searched the issues and there are no duplicates of this issue/question/request.

Command Ran

Please provide the exact command ran including the username/search/code so I may reproduce the issue.

In jupyter notebook, I enter the following config code:

c = twint.Config()
c.Limit = 3200
c.Username = "jerallaire"
c.Pandas = True
c.Retweets = True
c.Output = "Tweets.csv"

twint.run.Search(c)

Description of Issue

Please use as much detail as possible.

Based on twint official documentation, it should not be hard to fetch the most recent 3200 tweets from a specific user. However, my problem is: after I run the config code, I only get the most recent 20-60 tweets. Something must be wrong and hope to get some.

Environment Details

Using Windows, Linux? What OS version? Running this in Anaconda? Jupyter Notebook? Terminal?

Mac. I am running this code on Jupyter Notebook

@dominikbenk
Copy link

Same issue.

@christineeeeee
Copy link

same issue here. i could only fetch ~90 tweets including "apple" and "$aapl" for one day, which is not possible...

@Dentordev
Copy link

Have we tried selenium or requests HTML? They have the ability to scroll down and render more of what is hidden

@johnguirgis
Copy link

I am also facing the same issue.

@iamthelabhras
Copy link

iamthelabhras commented Nov 23, 2021

Same issue here. Running Twint on MacOS (Big Sur 11.4) from the command line.

@gansanay
Copy link

Same issue here with Python 3.9.6 on Windows 10 with roughly the same code (get all tweets for one username), having just ran pip3 install --user --upgrade -e git+https://github.com/twintproject/twint.git@origin/master#egg=twint makes things even worse.

@MarionJacquet
Copy link

Same issue here - using Python 3.10.0 on MacOS (Big Sur 11.6) from the command line
Twint only retrieves the last ≈2050 tweets, published in the past 10 days... I modified url.py (lines 70+92) according to instructions : problem remains...

++ I also tried fetching tweets that I know exist from 2018 = Twint immediately returns "no more data" message

@richardpickett
Copy link

+1

Ubuntu, python 3.8.10, both pip3 install --user --upgrade -e git+https://github.com/twintproject/twint.git@origin/master#egg=twint and normal checkout + install, always gets about 200 tweets.

I've tried --resume, doesn't matter.

cmd I'm running: twint -u balajis -o t.csv --csv --count

@19barsav
Copy link

Seems like it's only pulling the last 7 days for me, so I believe I have the same issue.

@CharlesXu123
Copy link

same issue here, checked out the latest version and it only give 20-80 tweets running on linux.

@queerterpreter
Copy link

I was having the same issue, where a search with the exact same parameters would pull a random number of tweets between 0 and 60 (same top tweets each time, so what was random was where it stopped)

As per #1319, I deleted the hashtag in line 92 of url.py and it now works with no problem.

@2spoopy4me
Copy link

I'm only getting a certain amount of tweets as well...any ideas?

@weirdyang
Copy link

#1266 (comment)

@eminekahveci
Copy link

eminekahveci commented Jan 27, 2022

Friends, I changed the typed_query on lines 135 and 136 in url.py and the twint started to work, but only the last week's data is coming. How can we remove this limit? How can I get more history and more

@eminekahveci
Copy link

image

@sarahdewolf
Copy link

Did anyone find a solution or work around?

@2spoopy4me
Copy link

I'd like to spend some time doing a deep-dive on the code to figure out why it's not working. I don't have much info on how twint works; maybe the nature by which is interacts with Twitter, twitter has limited it's functionality so twint can't work properly?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Development

No branches or pull requests