readability drops some <h5> content #65

clockfly · 2024-07-07T09:58:05Z

link:

https://webscraping.ai/faq/colly/how-do-i-handle-redirects-in-colly

after paring

...

 adjusting Colly's redirect handling settings, you can ensure that your web scraper behaves exactly as needed when encountering redirects during the scraping process.

            Related Questions
  !!!!!!!! HERE SOME h5 part text is missing !!!!!!!!

missed part:

seems h5 tag is not kept after readability parsing.

<h2 class="mt-5">Related Questions</h2>

      <div class="card mb-3">
        <div class="card-body">
          <h5 class="card-title mb-0"><a href="/faq/colly/how-can-i-integrate-colly-with-a-database-to-store-scraped-data">How can I integrate Colly with a database to store scraped data?</a></h5>
        </div>
      </div>
      <div class="card mb-3">
        <div class="card-body">
          <h5 class="card-title mb-0"><a href="/faq/colly/is-it-possible-to-scrape-images-or-files-with-colly">Is it possible to scrape images or files with Colly?</a></h5>
        </div>
      </div>
      <div class="card mb-3">
        <div class="card-body">
          <h5 class="card-title mb-0"><a href="/faq/colly/how-do-i-use-colly-s-callback-functions-effectively">How do I use Colly&#39;s callback functions effectively?</a></h5>
        </div>
      </div>
</div>

expected behavior

expect h5 tag to be kept.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readability drops some <h5> content #65

readability drops some <h5> content #65

clockfly commented Jul 7, 2024

readability drops some <h5> content #65

readability drops some <h5> content #65

Comments

clockfly commented Jul 7, 2024

link:

after paring

missed part:

expected behavior