local wtxtdiff=require("wetgenes.txt.diff")

lua.wetgenes.txt.diff.find

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/diff.lua

Given two tables of strings, return the length , starta , startb of the longest common subsequence in table indexes or nil if not similar.

lua.wetgenes.txt.diff.match

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/diff.lua

Given two tables of strings, return two tables of strings of the same length where as many strings as possible match.

lua.wetgenes.txt.diff.split

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/diff.lua

Use the delimiter to split a string into a table of strings such that each string ends in the delimiter (except for possibly the final string) and a table.concat on the result will recreate the input string exactly.

table = wtxtdiff.split(string,delimiter)

String is the string to split and delimiter is a lua pattern so any special chars should be escaped.

for example

st = wtxtdiff.split(s) -- split on newline (default)
st = wtxtdiff.split(s,"\n") -- split on newline (explicit)

st - wtxtdiff.split(s,"%s+") -- split on white space

lua.wetgenes.txt.diff.trim

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/diff.lua

Given two tables of strings, return the length at the start and at the end that are the same. This tends to be a good first step when comparing two chunks of text.

lua.wetgenes.txt.edit

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/edit.lua

Generic text modifying functions.

lua.wetgenes.txt.lex

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/lex.lua

Some useful lex files for other editors to be used as starting points and checking we did not miss anything.

https://github.com/vim/vim/tree/master/runtime/syntax
https://github.com/sublimehq/Packages

lua.wetgenes.txt.lex_js

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/lex_js.lua

lua.wetgenes.txt.lex_lua

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/lex_txt.lua

lua.wetgenes.txt.undo

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/undo.lua

undo / redo code for a text editor with persistence to disk

lua.wetgenes.txt.utf

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/utf.lua

local wutf = require("wetgenes.txt.utf")

helper functions to help manage a string as a stream of utf8 tokens.

lua.wetgenes.txt.utf.char

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/utf.lua

string = wutf.char(number)

convert a single unicode value to a utf8 string of 1-4 bytes

lua.wetgenes.txt.utf.charpattern

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/utf.lua

string:gmatch(wutf.charpattern)

lua pattern to match each utf8 character in a string

lua.wetgenes.txt.utf.chars

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/utf.lua

string = wutf.chars(number,number,...)
string = wutf.chars({number,number,...})

convert one or more unicode values into a utf8 string

lua.wetgenes.txt.utf.length

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/utf.lua

unicode = wutf.ncode(string,index)

get the utf8 value at the given code index.

Note that this is slower than wutf.code as we must search the string to find the byte index of the code.

lua.wetgenes.txt.utf.map_latin0_to_unicode

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/utf.lua

unicode = wutf.map_latin0_to_unicode[latin0] or latin0

lua.wetgenes.txt.utf.map_unicode_to_latin0

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/utf.lua

latin0 = wutf.map_unicode_to_latin0[unicode] or unicode

I prefer the coverage of latin0 (ISO/IEC 8859-15) for font layout as it is just a few small differences for western european languages to get most needed glyphs into the first 256 codes.

lua.wetgenes.txt.utf.size

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/utf.lua

size = wutf.size(string,index)

get the size in bytes of the utf8 value at the given byte index.

size = wutf.size(string)

get the size in bytes of the utf8 value at the start of this string

The return value will be 1-4 as 4 is the biggest utf8 code size.

lua.wetgenes.txt.utf.string

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/utf.lua

unicode = wutf.code(string,index)

get the utf8 value at the given byte index.

unicode = wutf.code(string)

get the utf8 value at the start of this string

lua.wetgenes.txt.words

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/words.lua

local wtxtwords=require("wetgenes.txt.words")

See https://github.com/xriss/engrish for source of words and possible alternative licenses.

lua.wetgenes.txt.words.load

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/words.lua

yes = wtxtwords.check(word)

This is a fast check if the word exists.

May call wtxtwords.load() to auto load data.

lua.wetgenes.txt.words.transform

https://github.com/xriss/gamecake/blob/master/lua/wetgenes/txt/words.lua

list = wtxtwords.transform(word,count,addletters,subletters)

Returns a table of upto count correctly spelled words that you may have miss spelt given the input word ordered by probability.

If the input word is spelled correctly then it will probably be the first word in this list but that is not guaranteed.

addletters is the maximum number of additive transforms, the higher this number the slower this function and it defaults to 4.

subletters is the maximum number of subtractive transforms and will not have much impact on speed, this defaults to the same value as addletters.

We run subletters subtractive transforms on our starting word and then we scan all possible words and perform addletters number of subtractive transforms on them and see if they match any of the transforms we built from our starting word. A match then means we can add up the number of transforms on both sides and that is how many steps it would take to get from one word to another by adding and subtracting letters.