Refactor FuncLexr to use iterator functions #39

johnhuichen · 2023-10-23T15:07:28Z

I am making a minor change to crates/vimfuncs/build.rs.

It doesn't impact the functionality but use more idiomatic iterator. I also made some changes to keep abstraction level consistent. Please let me know if that's too many changes.

Changes:

FuncLexer only has one field chars which is an iterator
FuncToken has a new special token Skip
FuncLexer::lex function delegates processing of characters to other struct methods
Removed enum methods for FuncToken (string, number...) and instead use pattern matching to get enum values

Testing:

cargo run test is green
I used data/global_functions.txt to generate a list of tokens using the original script and the refactored script. They produced identifical content

crates/vimfuncs/build.rs

tjdevries · 2023-10-25T02:43:33Z

crates/vimfuncs/build.rs

-                None => break,
+        while let Some(ch) = self.chars.next() {
+            let tok = match ch {
+                '{' => self.process_left_brace(),


I don't think there is any particular reason we need to make these all functions if they just return a value, i think it muddies the code

It's true that these functions just return a value.

I did it for consistency and decoupling. If the logic of process_left_brace should change in the future, it's easier for another contributor to say with confidence that the logic is probably contained in the struct method.

Of course it's debatable that inlining the logic is also obvious that the logic is contained there.

I can revert back if you feel strongly about it

tjdevries · 2023-10-25T02:44:41Z

crates/vimfuncs/build.rs

-        if !self.ch().is_whitespace() {
-            return;
-        }
+    fn skip_whitespaces(&mut self) {


skip_whitespace is a better name

Ok I will revert back

tjdevries · 2023-10-25T02:45:59Z

crates/vimfuncs/build.rs

-        }
-    }
-
-    fn identifier(self) -> String {


i don't get why you removed these

because these functions used on incorrect types (e.g. number on FuncToken::String) will throw an error, and it's not really unreachable

I replaced them with pattern matching the enum because I think it's more idiomatic.

Let me know what you think. I can revert back if needed

tjdevries · 2023-10-25T02:46:58Z

There's kind of a lot of extra changes in the PR. It's not a huge deal but in general it's nice if the changes can be made smaller so as to be more easily reviewed (for example, some of the name changes and things I don't really like as much as the names that were there before)

johnhuichen · 2023-10-25T13:44:56Z

There's kind of a lot of extra changes in the PR. It's not a huge deal but in general it's nice if the changes can be made smaller so as to be more easily reviewed (for example, some of the name changes and things I don't really like as much as the names that were there before)

100% agreed. I started making too many changes as part of refactoring.

johnhuichen · 2023-10-27T17:57:28Z

I cleaned the changes up by

renaming skip_whitespaces to skip_whitespace
revert deletion of FuncToken methods
revert changes to anything outside FuncLexer

I made a new change so that now lex method iterates through characters by peek() instead of next(). See L64. I think this would make the design more flexible to change in the future. Let me know what you think

@tjdevries

johnhuichen · 2023-10-27T17:58:24Z

crates/vimfuncs/build.rs

-        let ch = self.chars.get(self.position).cloned();
-        self.position += 1;
+    fn process_left_brace(&mut self) -> FuncToken {
+        self.chars.next();


making it explicit that the iterator advances by one step when creating a token

johnhuichen commented Oct 23, 2023

View reviewed changes

crates/vimfuncs/build.rs Show resolved Hide resolved

Refactor FuncLexr to use iterator functions

a3c6d26

tjdevries reviewed Oct 25, 2023

View reviewed changes

johnhuichen added 2 commits October 27, 2023 13:34

Revert changes including deleted methods and method name changes

baeea04

Loop using peek instead of next

72b69ed

johnhuichen commented Oct 27, 2023

View reviewed changes

Rename process_alphabets to process_identifier

cdd9638

johnhuichen closed this by deleting the head repository Jan 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor FuncLexr to use iterator functions #39

Refactor FuncLexr to use iterator functions #39

johnhuichen commented Oct 23, 2023 •

edited

Loading

tjdevries Oct 25, 2023

johnhuichen Oct 25, 2023

tjdevries Oct 25, 2023

johnhuichen Oct 25, 2023

tjdevries Oct 25, 2023

johnhuichen Oct 25, 2023

tjdevries commented Oct 25, 2023

johnhuichen commented Oct 25, 2023

johnhuichen commented Oct 27, 2023

johnhuichen Oct 27, 2023

Refactor FuncLexr to use iterator functions #39

Refactor FuncLexr to use iterator functions #39

Conversation

johnhuichen commented Oct 23, 2023 • edited Loading

tjdevries Oct 25, 2023

Choose a reason for hiding this comment

johnhuichen Oct 25, 2023

Choose a reason for hiding this comment

tjdevries Oct 25, 2023

Choose a reason for hiding this comment

johnhuichen Oct 25, 2023

Choose a reason for hiding this comment

tjdevries Oct 25, 2023

Choose a reason for hiding this comment

johnhuichen Oct 25, 2023

Choose a reason for hiding this comment

tjdevries commented Oct 25, 2023

johnhuichen commented Oct 25, 2023

johnhuichen commented Oct 27, 2023

johnhuichen Oct 27, 2023

Choose a reason for hiding this comment

johnhuichen commented Oct 23, 2023 •

edited

Loading