Skip to content

feedparser-rs handles entities in GUIDs differently from feedparser-py #59

@fazalmajid

Description

@fazalmajid

Bug Description

When parsing the Sidequested RSS (https://sidequested.com/feed/), I noticed feedparser-py and feedparser-rs give different GUIDs, thus causing false positives and duplicate posts to appear in my feed reader. The root cause is that the feedparser codes does not decode entities thrown by Quick-XML, and thus:

  • feeparser-py: https://sidequested.com/?post_type=webcomic1&p=3172
  • feedparser-rs: https://sidequested.com/?post_type=webcomic1p=3172

Steps to Reproduce

fptest.py

Expected Behavior

Same behavior as feedparser-py

Feed Sample

<?xml version="1.0"?>
    <rss version="2.0">
        <channel>
            <item>
                <guid isPermaLink="false">https://sidequested.com/?post_type=webcomic1&#038;p=3172</guid>
            </item>
        </channel>
    </rss>

Platform

Python (feedparser-rs-py)

Version

0.4.3

Environment

  • OS: Ubuntu Linux 25.10
  • Rust: 1.93.1
  • Python: 3.12

Additional Context

I am preparing a PR with the fix

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions