-
Notifications
You must be signed in to change notification settings - Fork 16
kyx0r/nextvi
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Nextvi(1) General Commands Manual Nextvi(1) NAME Nextvi - A small vi/ex editor for editing bidirectional UTF-8 text SYNOPSIS vi [-emsv] [file ...] DESCRIPTION Nextvi is a vi(1)-like text editor based on Ali Gholami Rudi's Neatvi(1). VI is a legendary command-line text editor, first introduced by Bill Joy in 1976, renowned for its modal interface, which allows users to switch between insert, command, and visual modes for seamless text manipulation. This powerful and efficient editor has stood the test of time, remaining a staple tool for developers and system administrators in Unix and Linux environments, thanks to its flexibility, customization options, and syntax similar to other vi/ex editors. To mark its 50th anniversary, Nextvi emerges as the pinnacle of VI's evolution. This masterpiece of efficiency and elegance boasts unparalleled startup speed, unmatched portability, exceptional hackability, and an incredibly robust macro system, among other features. Nextvi is truly the next generation of VI, ready to elevate your text editing experience. OPTIONS -e Enter Ex mode on startup -m Disable initial file read message -s Enter raw Ex mode on startup -v Enter visual mode on startup (Default) MANPAGE NOTATION <x> A closure where x represents character literal [x] A closure where x represents optional argument {x} A closure where x represents required argument "x" A closure where x represents a string <^X> Represents a ctrl key X # Represents a positive number in a closure * Represents any character(s) in a closure < > Separates alternatives in a closure x-x Range from x to x VI NORMAL [#]j Move # lines down [#]k Move # lines up [#]+ [#]<^M> [#]<Newline> Move # lines down, cursor after indent [#]- Move # lines up, cursor after indent [#]h Move # cols left [#]l Move # cols right f{arg} Move to arg character found forward F{arg} Move to arg character found backward t{arg} Move until arg character found forward T{arg} Move until arg character found backward [#], Repeat last <f F t T> move backward # times [#]; Repeat last <f F t T> move forward # times [#]E Move to end of word # times, skip punctuation [#]e Move to end of word # times [#]B Move to start of word backward # times, skip punctuation [#]b Move to start of word backward # times [#]W Move to start of word forward # times, skip punctuation [#]w Move to start of word forward # times vw Toggle line mode for <E e B b W w> { Move to next <{> section down } Move to next <{> section up [ Move to next <Newline> section down ] Move to next <Newline> section up ^ Move to start of line after indent 0 Move to start of line $ Move to end of line [#]| Goto # col [#]<Space> Move # characters forward [#]<^H> [#]<Backspace> Move # characters backward % Move to closest <] ) }><[ ( {> pair {#}% Move to # percent line number '{arg} Move to a line mark arg `{arg} Move to a line mark arg with cursor position gg Goto first line in buffer [#]G Move to last line in buffer or # line H Move to highest line on a screen L Move to lowest line on a screen M Move to middle line on a screen [#]z. Center screen at cursor. # is xtop [#]z<^M> [#]z<Newline> Center screen at top row. # is xtop [#]z- Center screen at bottom row. # is xtop [#]<^E> Scroll down 1 or # lines, retain # and cursor position [#]<^Y> Scroll up 1 or # lines, retain # and cursor position [#]<^D> Scroll down half a screen size. If [#], set scroll to # lines [#]<^U> Scroll up half a screen size. If [#], set scroll to # lines <^B> Scroll up full screen size <^F> Scroll down full screen size # Show global and relative line numbers 2# Toggle show global line numbers permanently 4# Toggle show relative line numbers after indent permanently 8# Toggle show relative line numbers permanently V Toggle show hidden characters:<Space Tab Newline> <^C> Toggle show line motion numbers for <h l e b E B w W> {1-5}<^C> Switch to line motion number mode # <^V> Loop through line motion number modes [#]<^R> Redo # times [#]u Undo # times <^I> <Tab> Open file path from cursor to end of line <^K> Write current buffer to file. Force write on 2nd attempt [#]<^W>{arg} Unindent arg region # times [#]<{arg} Indent left arg region # times [#]>{arg} Indent right arg region # times "{arg}{arg1} Operate on arg register according to arg1 motion R Print registers and their contents [#]&{arg} Execute arg register macro in non-blocking mode # times [#]@{arg} Execute arg register macro in blocking mode # times [#]@@ [#]&& Execute a last executed register macro # times [#]. Repeat last normal command # times [#]v. Repeat last normal command moving down across # lines : Enter ex prompt [#]!{arg} Enter pipe ex prompt based on region # or arg vv Enter ex prompt with the last line from history buffer b-1 [#]vr Enter %s/ ex prompt. Insert # words from cursor [#]vt[#arg] Enter .,.+0s/ ex prompt. Insert # of lines from cursor. Insert #arg words from cursor [#]v/ Enter v/ xkwd ex prompt to set search keyword. Insert # words from cursor v; Enter ! ex prompt [#]vi Enter %s/ ex prompt. Contains regex for changing spaces to tabs. # modifies tab width [#]vI Enter %s/ ex prompt. Contains regex for changing tabs to spaces. # modifies tab width vo Remove trailing white spaces and <\r> line endings va Toggle ai ex option <^G> Print buffer status infos 1<^G> Enable permanent status bar row 2<^G> Disable permanent status bar row ga Print character info 1ga Enable permanent character info bar row 2ga Disable permanent character info bar row [#]gw Hard line wrap a line to # col limit. Default: 80 [#]gq Hard line wrap a buffer to # col limit. Default: 80 [#]g~{arg} Switch character case for arg region # times [#]gu{arg} Switch arg region to lowercase # times [#]gU{arg} Switch arg region to uppercase # times [#]~ Switch character case # times forward i Enter insert mode I Enter insert mode at start of line after indent A Enter insert mode at end of line a Enter insert mode 1 character forward [#]s Enter insert mode and delete # characters S Enter insert mode and delete all characters o Enter insert mode and create a new line down O Enter insert mode and create a new line up [#]c{arg} Enter insert mode and delete arg region # times C Enter insert mode and delete from cursor to end of line [#]d{arg} Delete arg region # times D Delete from cursor to end of line [#]x Delete # characters from cursor forward [#]X Delete # characters from cursor backward di{arg} Delete around arg which can be <( ) "> ci{arg} Change around arg which can be <( ) "> [#]r{arg} Replace # characters with arg from cursor forward K Split a line {#}K Split a line without creating blank <Newline> [#]J Join # lines vj Toggle space padding when joining lines [#]y{arg} Yank arg region # times [#]Y Yank # lines [#]p Paste default register # times [#]P Paste default register below current line or behind cursor position # times m{arg} Set buffer local line mark arg <^T> Set global line mark 0. Global marks are always valid {0 2 4 6 8}<^T> Set a global line mark # {1 3 5 7 9}<^T> Switch to a global line mark # [#]<^7>{0-9} [#]<^_>{0-9} Show buffer list and switch based on # or 0-9 index when prompted <^^> <^6> Swap to previous buffer [#]<^N> Swap to next buffer, # changes direction [forward backward] \ Swap to /fm/ buffer b-2 {#}\ Swap from /fm/ buffer b-2 and backfill directory listing vb Recurse into b-1 history buffer. Insert current line into ex prompt after a quit command z1 Set alternative keymap to Farsi keymap z2 Set alternative keymap to Russian keymap ze Switch to English keymap zf Switch to alternative keymap zL Set td ex option to 2 zl Set td ex option to 1 zr Set td ex option to -1 zR Set td ex option to -2 [#]/ Regex search, move down 1 or # matches [#]? Regex search, move up 1 or # matches [#]n [#]N Repeat regex search, move [down up] 1 or # matches <^A> Regex search 1 word from cursor, no center, wraparound move [up down] {#}<^A> Regex search, set keyword to # words from cursor <^]> Filesystem search forward based on directory listing in b-2 {#}<^]> Filesystem search forward, set keyword to # words from cursor <^P> Filesystem search backward based on directory listing in b-2 {#}<^P> Filesystem search backward, set keyword to # words from cursor <^Z> Suspend vi <^L> Force redraw whole screen and update terminal dimensions qq Force quit and clean terminal zz Force quit, submit a command when recursive ZZ Soft quit, write to a file before exit VI REGIONS Regions are vi normal commands that define [h v]range for vi motions. Commands described with the word "move" define a region. <+ j ^M Newline - k h l f F t T , ; B E b e W w { } [ ] ^ 0 $ Space ^H Backspace % ' ` G H L M / ? n N ^A> All regions VI MOTIONS Motions are vi normal commands that run in a [h v]range. Commands described with the word "region" consume a region. <^W < > ! c d y "> g~ gu gU All motions <"> Special motions that consume a motion dd yy cc g~~ guu gUU >> << <^W><^W> !! Special motions that can use [#] as number of lines Examples: 3d/int Delete text until the 3rd instance of "int" keyword d3w Delete 3 words "ayl Yank a character into <a> register "Ayw Append a word to <a> register VI/EX INSERT <^H> <Backspace> Delete a character <^U> Delete util <^X> mark or everything <^W> Delete a word <^T> Increase indent <^D> Decrease indent <^]> Switch default paste register to registers 0-9 <^\>{arg} Select paste register arg. <^\> selects default register <^P> Paste a register <^X> Mark autocomplete and <^U> starting position. <^X> resets the mark <^G> Index buffer for autocomplete <^Y> Reset autocomplete data <^R> Loop through autocomplete options backward <^N> Loop through autocomplete options forward <^B> Print autocomplete options when in vi insert <^B> Recurse into b-1 history buffer when in ex prompt. Insert current line into ex prompt after a quit command <^A> Loop through lines in a history buffer b-1 <^Z> Suspend vi/ex <^L> Redraw screen in vi mode, clean terminal in ex <^O> Switch between vi and ex modes recursively <^E> Switch to english keymap <^F> Switch to alternative keymap <^V>{arg} Read a literal character arg <^K>{arg} Read a digraph sequence arg <^C> <ESC> Exit insert mode EX Ex is a powerful line editor for Unix systems, initially developed by Bill Joy in 1976. This essential tool serves as the backbone of vi, enabling it to execute commands, macros and even transform into a purely command-line interface (CLI) when desired. In Ex, each command is initiated and separated by <:> prefix. EX EXPANSION Characters <# %> in ex prompt substitute the buffer pathname. <%> substitutes current buffer and <#> last swapped buffer. It is possible to expand any arbitrary buffer by using <# %> followed by the buffer number. Example: print the pathname for buffer 69. :!echo "%69" Every ex command is be able to receive data from external process through a special expansion character <!> which pipes the data into the command itself. If the closing <!> is not specified, the end of the line becomes a terminator. Example: Substitute the value of env var $SECRET to the value of $RANDOM. In this demo, we set the value of SECRET to "int" ourselves. :%s/!export SECRET="int" && printf "%s" $SECRET!/!printf "%s" $RANDOM! EX ESCAPES Special characters will become regular when escaped with <\>. <( ^ ! = [ ] \> Special characters in regex "[]" bracket exp <( ) { } + * ? ^ $ [ ] | \< \> . \> Special characters in regex <# % ! :> Special characters in ex EX RANGES Some ex commands can be prefixed with ranges. Examples: . current position , vertical range separator ; horizontal range separator :1,5p print lines 1,5 :.-5,.+5p print 5 lines around xrow :/int/p print first occurrence of int :?int?p print first occurrence of int in reverse :.,/int/p print until int is found :?int?,.p print until int is found in reverse :'d,'ap print lines from mark <d> to mark <a> :%p print all lines in the buffer :$p print last line in the buffer :;50 goto character offset 50 :10;50 goto line 10 character offset 50 :10;.+5 goto line 10 +5 character offset :'a;'a goto line mark <a> offset mark <a> :;$ goto end of the line :5;/int/ search for int on line 5 :.;?int? search for int in reverse on the current line EX COMMANDS [range]f{/?}[kwd] Ranged search (stands for find) Example: no range given, current line only :f/int Example: reverse :f?int Example: range given :10,100f/int Subsequent commands within the range will move to the next match just like vi n/N commands. b[#] Print currently active buffers state or switch to a buffer Example: switch to the 5th buffer :b5 There are 2 temporary buffers which are separate from the main buffers. b-1 = /hist/ ex history buffer b-2 = /fm/ directory listing buffer Example: switch to the b-1 buffer :b-1 Example: switch to the b-2 buffer :b-2 bp [path] Set current buffer path bs[#] Set current buffer saved. If any arg given, reset undo/redo history [range]p Print line(s) from the buffer Example: utilize character offset ranges :1,10;5;5p Example: print current line from offset 5 to 10 :.;5;10p ea [kwd] [#] Open file based on filename substring from dir listing in b-2 Requires directory listing in b-2 backfilled prior. Example: backfill b-2 using :fd :fd Example: backfill b-2 using find :b-2:1,$!find . If the substring matches more than 1 filename, a prompt will be shown. Submit using numbers 0-9 (higher ascii values work too (<^C> to cancel)). Passing an extra arg to :ea in form of a number will bypass the prompt and open the corresponding file. Example: open filename containing "v" :ea v Example: open first match containing "v" :ea v 0 ea![kwd] [#] Forced version of ea [#]a [str] [#]i [str] [#]c [str] Enter ex {append insert change} mode # determines insertion line number. str determines initial input into the insertion buffer. Example: insert "hello" in vi/ex :i hello<^M><ESC> Example: discard changes in vi/ex :i hello<^C> Example: immediately insert "hello" :i hello<^V><^M><^V><ESC> Example: insert "hello" in raw ex mode i hello<^M>.<^M> [range]d Delete line(s) e [path] Open a file at a path No argument opens "unnamed" buffer. e![path] Force open a file at a path No argument re-reads the current buffer from the filesystem. [range]g{*}[kwd]{*}{cmd} Global command Execute an ex command on a range of lines that matches an enclosed regex. Example: remove empty lines :g/^$/d Multiple ex commands can be chained in one global command. To chain commands, the ex separator <:> must be escaped once. Example: yank matches appending to reg 'a' and print them out. :g/int/ya A\:p It is possible to nest global commands inside of global commands. Example: find all lines with int and a semicolon and append "has a semicolon" :g/int/:.g/;/& A has a semicolon Example: extract/print data enclosed in "()" :g/\(.+\)/;0;/\(.+\)/\:.;.+1k a\:se grp=2\:;/\)*(\))/\: se nogrp\:k s\:.;'a;'sp [range]g!{*}[kwd]{*}{cmd} Inverted global command [range]= Print the current range line number [range]k [mark] Set a line mark The character offset is set to the current position. &{macro} Global non-blocking macro Execute any sequence of vi/ex commands or macros. A non-blocking macro shall not wait for input when the end of the sequence is reached. A non-blocking macro executing other macros will always reach a terminating point. Example: execute vi insert statement :& ihello Example: execute :hello :& \:hello Example: execute ci(int macro :& ci(int Example: turn non-blocking into blocking macro :& \:@ \\:run as non-blocking but blocking<^V><^M> @{macro} Global blocking macro Execute any sequence of vi/ex commands or macros. A blocking macro shall wait for input when the end of the sequence is reached. A blocking macro executing other macros may result in congestion. Example: execute vi insert statement :@ ihello Example: insert "hello" into <:> prompt :@ \:hello Example: execute ci(int macro :@ ci(int Example: execute ci(int exiting insert mode :@ ci(int<^V><^C> Example: execute ci)INT as a follow-up :@ ci(int<^V><^C>ci)INT Example: execute dw after user exits insert :@i:@dw pu [register] [cmd] Paste a register To pipe register data to an external process use :pu \![cmd] Example: copy default register to X11 clipboard :pu \!xclip -selection clipboard [range]r [path cmd] Read a file or a pipe To read data from a pipe use :[range]r \![cmd] Example: pipe in only the first line :r \!ls Example: pipe in only lines 3,5 :3,5r \!ls Example: pipe in all data :%r \!ls [range]w [path cmd] Write to a file or a pipe To pipe buffer data to external process use :[range]w \![cmd] Example: pipe out all data into less :w \!less Example: pipe out only first 10 lines :1,10w \!less [range]w! [path] Force write to a file q Soft quit q! Force quit wq[!] x[!] Write and force/soft quit u[# $] Undo # times or all with $ rd[# $] Redo # times or all with $ se {exp} Set ex option variable Example: set using implications :se hll :se nohll Example: set using exact values :se hll=1 :se hll=0 Example: set using ascii character :se hll=a [range]s{*}[kwd]{*}{str}[*][opts] Substitute Find and replace text in a range of lines that matches an enclosed regex with an enclosed replacement string. Example: global replacement :%s/term1/term2/g Substitution backreference inserts the text of matched group specified by \x where x is group number. Example: substitution backreference this is an example text for subs and has int or void :%s/(int)|(void)/pre\0after this is an example text for subs and has preintafter or void :%s/(int)|(void)/pre\2after/g this is an example text for subs and has prepreafterafter or prevoidafter [range]ya [register][append] Yank a region To append to the register, pass in its uppercase version. To append to any of the non-alphabetical registers add any extra character to the command. Example: append to register <1> :ya 1x ya![register] Reset register value [range]![cmd] Run external program When ex range specified, pipes the buffer data to an external process and pipes the output back into current buffer replacing the affected range. Example: infamously sort the buffer :1,$!sort ft [filetype] Set a filetype No argument prints the current file type. Reloads the highlight ft, which makes it possible to reset dynamic highlights created by options like "hlw". cm [keymap] Set a keymap No argument prints the current keymap name. cm![keymap] Set an alternative keymap fd [path] Set a secondary directory (stands for file dir) Recalculates the directory listing in b-2 buffer. No argument implies current directory. fp [path] Set a directory path for :fd (stands for file path) cd [path] Set a working directory (stands for change dir) Currently open buffers' file paths will be automatically adjusted to reflect a newly set working directory. inc [regex] Include regex for :fd calculation Example: include only files in submodule directory that end with .c :inc submodule.*\.c$ Example: exclude the .git and submodule folders :inc (^[\!.git\!submodule]+[^\/]+$) No argument disables the filter. reg[hscroll] Print registers and their contents Printing position is determined by xcols / 2 * [hscroll] bx[#] Set max number of buffers allowed Buffers will be deallocated if the number specified is lower than the number of buffers currently in use. No argument will reset to the default value of 10. ac [regex] Set autocomplete filter regex Example: autocomplete using whole lines from the buffer :ac .+ No argument resets to the default word filter regex as defined in led.c. uc Toggle multibyte utf-8 decoding This command is particularly useful when editing files with mixed encodings, binary files, or when the terminal does not support UTF-8 or lacks the necessary fonts to display UTF-8 characters. Typically to be used along with :ph for the full effect. ph [#clow] [#chigh] [#width] [#blen][*char] Create new placeholders Example: render 8 bit ascii (Extended ASCII) as <~> :ph 128 255 1 1~ Example: flawless ISO/IEC 8859-1 (latin-1) support :uc:ph 128 160 1 1~ Example: reset to default as in conf.c :ph EX OPTIONS ai If set, indent new lines. ic If set, ignore case in regular expressions. ish Interactive shell Run every <!> command through an interactive shell. The shell will source the .rc file before command execution. This makes it possible to use predefined functions, aliases and ENV variables from the .rc file. Precondition: There must be no stdout output created by .rc file for <!> commands to return expected results. grp Regex search group Defines a target search group for any regex search operation. This becomes necessary when the result of regex search is to be based on some group rather than default match group. Example: ignore tabs at the start of the line :se grp=2:1,$f/^[ ]+(.+):se nogrp The value of grp is calculated using (group number * 2). The default group number is 0. hl If set, highlight text based on rules defined in conf.c. hlr If set, highlight text in reverse direction. hll If set and defined in hl, highlight current line. hlp If set and defined in hl, highlight "[]" "()" "{}" pairs. hlw If set and defined in hl, highlight current word from cursor. led If unset, all terminal output is disabled. vis Control startup flags Example: disable :e message in ex mode :se vis=12 Example: disable :e message in vi mode :se vis=8 Example: enable raw ex mode :se vis=6 Example: disable raw ex mode :se vis=4 mpt Control vi prompts When set to 0 after an ex command is called from vi, disables the "[any key to continue]" prompt. If mpt is negative, the prompt will remain disabled. order If set, reorder characters based on rules defined in conf.c. shape If set, perform Arabic script letter shaping. pac If set, print autocomplete suggestions on the fly. tbs Number of spaces used to represent a tab. td Current text direction context. This option accepts four meaningful values: +2 Exclusively left-to-right. +1 Follow dircontexts[] (in conf.c), defaulting to left-to- right. -1 Follow dircontexts[], defaulting to right-to-left. -2 Exclusively right-to-left. pr Print register Set a special register using a character or a number. Once the register is set, all data passed into ex_print will be stored in the register. If the register is uppercase, <Newline> characters are added to match the exact output that was printed. Example: paste current buffer list exactly like from :b command :se pr=A:ya! a🅱️ pu a Example: store a line printed with :p :se pr=A:ya! a:p sep Ex separator Set Ex command separator character. Default <:> Built-in commands using <:> separator will seize to function. Example: disable separator :se nosep EXINIT ENV VAR EXINIT defines a sequence of vi/ex commands to be performed at startup. Consequently, this is the primary way for scripting and customizing Nextvi outside of C. Many standard text processing utils such as grep, awk, sed can be replaced by Nextvi with EXINIT in mind. Examples: export EXINIT=$'e ./vi.c:& i\x7\x3:bx 1:bx' Index vi.c for autocomplete export EXINIT='b-1:%r ./vi.c:b-1' Load vi.c into a history buffer export EXINIT=$'e:& io{\n}\x16\x3kA\x3:& 1G:& 2\"ayy' Setup @ macro in register <a> @a macro creates <{> and closing <}> below the cursor leaving cursor in insert mode in between the braces. REGEX Nextvi's regex syntax is akin to that of Plan 9. Disregard manpage notation for <{ } [ ]> in this section. . match any single char ^ assert start of the line $ assert end of the line {N,M} match N to M times () grouping (?:) non capture grouping [N-M] match a set of alternate ranges N to M * repeated zero or more times + repeated one or more times | union, alternative branch \< assert start of the word \> assert end of the word ? one or zero matches greedy ?? one or zero matches lazy Additionally, Nextvi supports static lookahead expressions. /[=abc] find <a> followed by "bc" /[!abc] find anything except <a> followed by "bc" /[!abc!cda!qwe] multiple lookahead alternatives in one bracket exp /[!abc^=123] disable lookahead using "^=" specifier and match [123] SPECIAL MARKS * position of the previous change [ first line of the previous change ] last line of the previous change ' position of the previous line region ` position of the previous line region SPECIAL REGISTERS / previous search keyword : previous ex command 0 previous value of default register (atomic) Atomic means the operation did not include a <Newline>. 1-9 previous value(s) of default register (nonatomic) CODE MAP +--------------+----------------------+ | 476 vi.h | definitions/aux | | 537 kmap.h | keymap translation | +--------------+----------------------+ | 294 conf.c | hl/ft/td config | | 336 term.c | low level IO | | 379 ren.c | positioning/syntax | | 590 lbuf.c | file/line buffer | | 652 uc.c | UTF-8 support | | 658 regex.c | extended RE | | 668 led.c | insert mode/output | | 1231 ex.c | ex options/commands | | 1953 vi.c | normal mode/general | | 6761 total | wc -l *.c|sort | +--------------+----------------------+ COMPILING export CC='g++ -x c' set compiler, g++ example export CFLAGS='-s' set CFLAGS, strip example ./cbuild.sh build once ./cbuild.sh build build ./cbuild.sh debug build with -O0 -g ./cbuild.sh pgobuild PGO build can lead to a significant performance boost on some application specific tasks ./cbuild.sh install install vi to $DESTDIR$PREFIX/bin ./cbuild.sh fetch merge commits from upstream repository valgrind --tool=cachegrind --cache-sim=yes --branch-sim=yes ./vi vi.c performance bench test PHILOSOPHY In most text editors, flexibility is a minor or irrelevant design goal. Nextvi is designed to be flexible where the editor adapts to the user needs. This flexibility is achieved by heavily chaining basic commands and allowing them to create new ones with completely different functionality. Command reuse keeps the editor small without infringing on your freedom to quickly get a good grasp on the code. If you want to customize anything, you should be able to do it using only core commands or a mix with some specific C code for more difficult tasks. Simple and flexible design allows for straight forward solutions to any problem long term and filters bad inconsistent ideas. "All software sucks, but some do more than others." - Kyryl Melekhin SEE ALSO New functionality can be obtained through optional patches provided in the patches branch. If you have a meaningful contribution and would love to be made public the patch can be submitted via email or github pull request. https://github.com/kyx0r/nextvi/tree/patches Scripts used to generate this manual are located in the manual branch. https://github.com/kyx0r/nextvi/tree/manual Q: What is pikevm? A: Pikevm is a complete rewrite of Nextvi's regex engine for the purposes of getting rid of backtracking and severe performance and memory constraints. Pikevm guarantees that all regular expressions are computed in constant space and O(n+k) time where n is size of the string and k is some constant for the complexity of the regex i.e. number of state transitions. It is important to understand that it does not mean that we run at O(n) linear speed, but rather the amount of processing time & memory usage is distributed evenly and linearly throughout the string, the k constant plays a big role. If you are familiar with radix sort algorithms this follows the same idea. Q: What are the other benefits? A: For example, now it is possible to compute a C comment /* n */ where n can be an infinite number of characters. Of course this extends to every other valid regular expression. Q: New features pikevm supports? A: Additionally, pikevm supports PCRE style non capture group (?:) and lazy quantifiers like .*? and .+?? because they were easy to implement and allow for further regex profiling/optimization. Q: NFA vs DFA (identify) A: pikevm = NFA backtrack = DFA Q: What's wrong with original implementation? A: Nothing except it being slow and limited. My improved version of Ali's DFA implementation ran 3.5X faster in any case, however I found a bug with it where zero quantifier "?" nested groups compute wrong submatch results. To fix this problem, it would require to undo a lot of optimization work already done, basically going back to how slow Ali's implementation would be. The reason this was spotted so late was because this kind of regex wasn't used before, so I never tested it. Other than that I think submatch extraction is correct on other cases. Pikevm does not have this bug, so it will be used as main regex engine from now on, unless dfa ever finds a proper fix. Honestly, this change isn't so surprising, as I was working on pikevm a few months prior, to favor a superior algorithm. You can still find that code here (likely with no updates): https://github.com/kyx0r/nextvi/tree/dfa_dead As a downside, NFA simulation loses the DFA property of being able to quickly short circuit a match, as everything runs linearly and at constant speed, incurring match time overhead. Well optimized DFA engine can outperform pikevm, but that is rather rare as they got problems of their own. For example as independently benchmarked, dfa_dead runs only 13% faster than pikevm and that is stretching the limit of what is physically possible on a table based matcher. Can't cheat mother nature, and if you dare to try she's unforgiving at best. Supplementary reading by Russ Cox: https://swtch.com/~rsc/regexp/regexp1.html Original Neatvi repository: https://github.com/aligrudi/neatvi AUTHORS Nextvi was written by Kyryl Melekhin <k.melekhin@gmail.com>. It is based on neatvi(1), which was written by Ali Gholami Rudi <ali@rudi.ir>. This manual page was inspired by nepeta <nepeta@canaglie.net> Linux 6.11.9-arch1-1 November 21, 2024 Linux 6.11.9-arch1-1