/bin/bash: gsed: command not found #99

eliot-akira · 2022-12-23T15:39:56Z

When running the task build:wp, I'm seeing the following error.

/bin/bash: gsed: command not found
The command '/bin/bash -c echo '<!doctype html>' > wordpress-static/wp-includes/empty.html && gsed -E 's#srcDoc:"[^"]+"#src:"/wp-includes/empty.html"#g' -i wordpress-static/wp-includes/js/dist/block-editor.min.js &&     gsed -E 's#srcDoc:"[^"]+"#src:"/wp-includes/empty.html"#g' -i wordpress-static/wp-includes/js/dist/block-editor.js' returned a non-zero code: 127

It's coming from src/wordpress-playground/wordpress/Dockerfile.

RUN echo '<!doctype html>' > wordpress-static/wp-includes/empty.html &&  \
    gsed -E 's#srcDoc:"[^"]+"#src:"/wp-includes/empty.html"#g' -i wordpress-static/wp-includes/js/dist/block-editor.min.js && \
    gsed -E 's#srcDoc:"[^"]+"#src:"/wp-includes/empty.html"#g' -i wordpress-static/wp-includes/js/dist/block-editor.js

From a quick search, it seems gsed is GNU sed renamed by Homebrew on macOS. Inside the Docker container, I believe the above lines should be calling sed instead. If so, I'd be happy to make a little pull request.

The text was updated successfully, but these errors were encountered:

adamziel · 2022-12-23T15:57:34Z

You're exactly right, I'm not sure how or why it worked on my end. I will appreciate your Pull Request a lot!

eliot-akira · 2022-12-30T15:10:35Z

In case you might have not seen it, pull request #100 resolves this issue.

Resolves #99

Prototypes a `wp_rewrite_urls()` URL rewriter for block markup to migrate the content from, say, `<a href="https://adamadam.blog">` to `<a href="https://adamziel.com/blog">`. * URL rewriting works to perhaps the greatest extent it ever did in WordPress migrations. * The URL parser requires PHP 8.1. This is fine for some Playground applications, but we'll need PHP 7.2+ compatibility to get it into WordPress core. * This PR features `WP_HTML_Tag_Processor` and `WP_HTML_Processor` to enable usage outside of WordPress core. ### Details This PR consists of a code ported from https://github.com/adamziel/site-transfer-protocol. It uses a cascade of parsers to pierce through the structured data in a WordPress post and replace the URLs matching the requested domain. The data flow is as follows: Parse HTML -> Parse block comments -> Parse attributes JSON -> Parse URLs On a high level, this parsing cascade is handled by the `WP_Block_Markup_Url_Processor` class: ```php $p = new WP_Block_Markup_Url_Processor( $block_markup, $base_url ); while ( $p->next_url() ) { $parsed_matched_url = $p->get_parsed_url(); // .. do processing $p->set_raw_url($new_raw_url); } ``` Getting more into details, the `WP_Block_Markup_Url_Processor` extends the `WP_HTML_Tag_Processor` class and walks the block markup token by token. It then drills down into: * Text nodes – where matches URLs using regexps. This part can be improved to avoid regular expressions. * Block comments – where it parses the block attributes and iterates through them, looking for ones that contain valid URLs * HTML tag attributes – where it looks for ones that are reserved for URLs (such as `<a href="">`, looking for ones that contain valid URLs The `next_url()` method moves through the stream of tokens, looking for the next match in one of the above contexts, and the `set_raw_url()` knows how to update each node type, e.g. block attributes updates are `json_encode()`-d. ### Processing tricky inputs When this code is fed into the migrator: ```html   🚀-science.com/science has the best scientific articles on the internet! We're also available via the punycode URL:  https://xn---science-7f85g.com/%73%63ience/.  This isn't migrated: https://🚀-science.comcast/science Or this: super-🚀-science.com/science     <img src="https://xn---science-7f85g.com/science/wp-content/image.png">   ``` This actual output is produced: ```html   science.wordpress.com has the best scientific articles on the internet! We're also available via the punycode URL:  https://science.wordpress.com/.  This isn't migrated: https://🚀-science.comcast/science Or this: super-🚀-science.com/science     <img src="https://science.wordpress.com/wp-content/image.png">   ``` ## Remaining work - [x] Add PHPCBF - [x] Get to zero CBF errors - [x] Get the unit tests to run in CI (e.g. run `composer install`) - [x] Add relevant unit tests coverage ## Follow-up work - [x] Patch `WP_HTML_Tag_Processor` in WordPress core, see WordPress/wordpress-develop#7007 (comment) - [ ] Package our copy of `WP_HTML_Tag_Processor` as a "WordPress polyfill" for standalone usage. - [ ] Make it compatible with PHP 7.2+ ## Testing Instructions (or ideally a Blueprint) CI runs the PHP unit tests. To run this on your local machine, do this: ```sh cd packages/playground/data-liberation composer install cd ../../../ nx test:watch playground-data-liberation ```

eliot-akira mentioned this issue Dec 23, 2022

Use sed instead of gsed #100

Merged

adamziel closed this as completed in #100 Dec 30, 2022

adamziel pushed a commit that referenced this issue Dec 30, 2022

Use sed instead of gsed (#100)

70cc160

Resolves #99

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

/bin/bash: gsed: command not found #99

/bin/bash: gsed: command not found #99

eliot-akira commented Dec 23, 2022 •

edited

Loading

adamziel commented Dec 23, 2022 •

edited

Loading

eliot-akira commented Dec 30, 2022

/bin/bash: gsed: command not found #99

/bin/bash: gsed: command not found #99

Comments

eliot-akira commented Dec 23, 2022 • edited Loading

adamziel commented Dec 23, 2022 • edited Loading

eliot-akira commented Dec 30, 2022

eliot-akira commented Dec 23, 2022 •

edited

Loading

adamziel commented Dec 23, 2022 •

edited

Loading