Wrap a text node in to current or parent node #49

raylua2566 · 2017-07-04T07:55:02Z

Feature: Wrap a text node in to current or parent node
新特征: 封装一个纯文本到一个当前或者父节点里
TODO:  ... et. not handle yet, create a ElementNode function to
handle them?
TODO: 一些特殊字符还没有处理, 考虑创建一个ElementNode实例方法处理特殊字符?

Feature: Wrap a text node in to current or parent node 新特征: 封装一个纯文本到一个当前或者父节点里 TODO:  ... et. not handle yet, create a ElementNode function to handle them? TODO: 一些特殊字符还没有处理, 考虑创建一个ElementNode实例方法处理特殊字符?

msva · 2017-07-04T15:16:46Z

Could you, please:

describe the use-case for this?
provide a test code, that will show this usecase (and so fail on current master, but will work after your PR)?
strip Chinese comments (keeping only english ones)?

Well, 3rd one is pretty cosmetic and have no consequences on how library works,
but I'm asking about first two, because I am missing the end purpose (real-world example, when it would be useful) of that changes.

raylua2566 · 2017-07-04T18:47:30Z

Please forgive my grammar mistakes.

From README.md

Limitations

Textnodes are no separate tree elements; in local root = htmlparser.parse("<p>line1<br />line2</p>"), root.nodes[1]:getcontent() is "line1<br />line2", while root.nodes[1].nodes[1].name is "br"

Now

My PR will wrap the Text line1 and line2 into a node named text, after that while root.nodes[1].nodes[1].name is "text", and #root.nodes[1].nodes is 3

real-world example, when it would be useful

xpath has a func text() to get literals text . Do not you want it?
Get ordered tags within a a tag contains <text> and <img>.

Why test case failed?

Test case file "tst/init.lua" that function test_order() case will not true at line 292, because the text 1 performance for <text>1</text> implicitly, the same as texts 2 3 ...10. so the :not(n)'s result is 14 instead of 4

Discussion

the origin str contains <text>some text</text> ^_^

msva

ping?

msva · 2017-07-14T09:31:41Z

src/htmlparser.lua

+				textcontent = string.gsub(textcontent, "&nbsp;", '')
+				if textcontent ~= '' then
+					index = index + 1
+					local textTag = ElementNode:new(index, 'text', node, descend, textstart+1, textend)


I think, that it would be better to rename it to something that will not collide.

What if user will use parser on <text>moo</text>?

On the other hand, we already have _text. So, I guess, it can be something like _textonly, or something like that.

Although, maybe better way would be to move current _text to _content, and make this to be a _text, but it will brake the API 😿

Which variant is more acceptable for you to implement?

Which variant would be more acceptable for you to implement?

I think _content is suitable for _text.
_text is a subset of _content, what do you think?

msva · 2019-06-09T11:03:35Z

Ping?

Sorry for long disappearing, I was having a lot of personal issues :-/

Let's discuss further implementation of that idea?

msva · 2021-08-25T05:23:01Z

I'll close it for now, since it is incomptible with current code base now.

If you (or someby else) want to continue work on that - feel free to open new PR.

fix tests

0bcc11e

msva reviewed Jul 16, 2017

View reviewed changes

msva closed this Aug 25, 2021

remysucre mentioned this pull request Nov 20, 2025

Add text nodes #68

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Wrap a text node in to current or parent node #49

Wrap a text node in to current or parent node #49

Uh oh!

raylua2566 commented Jul 4, 2017 •

edited

Loading

Uh oh!

msva commented Jul 4, 2017 •

edited

Loading

Uh oh!

raylua2566 commented Jul 4, 2017 •

edited

Loading

Uh oh!

msva left a comment

Uh oh!

msva Jul 14, 2017 •

edited

Loading

Uh oh!

raylua2566 Jul 27, 2017

Uh oh!

msva commented Jun 9, 2019

Uh oh!

msva commented Aug 25, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Wrap a text node in to current or parent node #49

Wrap a text node in to current or parent node #49

Uh oh!

Conversation

raylua2566 commented Jul 4, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

msva commented Jul 4, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

raylua2566 commented Jul 4, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Limitations

Uh oh!

msva left a comment

Choose a reason for hiding this comment

Uh oh!

msva Jul 14, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

raylua2566 Jul 27, 2017

Choose a reason for hiding this comment

Uh oh!

msva commented Jun 9, 2019

Uh oh!

msva commented Aug 25, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

raylua2566 commented Jul 4, 2017 •

edited

Loading

msva commented Jul 4, 2017 •

edited

Loading

raylua2566 commented Jul 4, 2017 •

edited

Loading

msva Jul 14, 2017 •

edited

Loading