widont line break/newline behavior #38

ryneeverett · 2014-10-02T23:52:26Z

If a string ends with a <br> and a single word, widont does nothing:

>>> widont('blah<br>blah')
'blah<br>blah'

This makes sense to me. But if a string ends with a <br>\n, widont replaces the newline with a  :

>>> widont('blah<br>\nblah')
'blah<br>&nbsp;blah'

This doesn't seem right. While the first would render:

blah
blah

the second would render:

blah
blah

The text was updated successfully, but these errors were encountered:

ryneeverett · 2014-10-25T20:18:56Z

>>> re.match(r'\s', '\n')
<_sre.SRE_Match object; span=(0, 1), match='\n'>
>>> re.match(r'\s', r'\n')
>>>

This result came as a surprise to me, but explains why widont has this behavior with newlines. But is this the desired behavior? That is, is the text passed in supposed to be escaped already?

I believe this would be the easiest way to get the correct behavior in the above example:

text = 'blah<br>\nblah'
text = text.encode('unicode-escape')  # b'blah<br>\\nblah'
text = text.decode('utf-8')  # 'blah<br>\\nblah'
text = widont(text)  # 'blah<br>\\nblah'
text = text.encode('utf-8')  # b'blah<br>\\nblah'
text = text.decode('unicode-escape')  # 'blah<br>\nblah'

It seems like it would be preferable for typogrify to deal with this, and I think it can be done without any encoding/decoding.

ryneeverett linked a pull request Oct 25, 2014 that will close this issue

Fix widont newline behavior. #39

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

widont line break/newline behavior #38

widont line break/newline behavior #38

ryneeverett commented Oct 2, 2014

ryneeverett commented Oct 25, 2014

widont line break/newline behavior #38

widont line break/newline behavior #38

Comments

ryneeverett commented Oct 2, 2014

ryneeverett commented Oct 25, 2014