Aadhavan Srinivasan
e36310b32d
Added function (and helper functions) to generate a regex that matches all numbers in a range
2 months ago
Aadhavan Srinivasan
298285e44c
Added more test cases
2 months ago
Aadhavan Srinivasan
0de3a94ce3
Fixed bug with lookaheads: f(?=f) would not match anything in 'ffa', because of the 'a' at the end of the string. Fixed by checking if there are other last states when an assertion fails, rather than immediately aborting
2 months ago
Aadhavan Srinivasan
fe1136c54c
Fixed bug with parentheses in lookaround regex; fixed bug with reading last line of test string (if it doesn't end in a newline)
2 months ago
Aadhavan Srinivasan
25c333bea4
Added function to determine if a state is a lookaround
2 months ago
Aadhavan Srinivasan
74c177324b
Added more test cases
2 months ago
Aadhavan Srinivasan
7916629c4d
Added substitute flag - substitute matched text with given text
2 months ago
Aadhavan Srinivasan
ee02e7575e
Added function to generate all case variations of a rune
2 months ago
Aadhavan Srinivasan
c87a4b7136
Added case-insensitve flag
2 months ago
Aadhavan Srinivasan
924e2a8dbc
Added some AI-generated test cases (llama3.1:405b)
2 months ago
Aadhavan Srinivasan
21b2d5a2a9
Added lookaround-related fields to postfixNode struct
2 months ago
Aadhavan Srinivasan
77d19cd84e
Added lookaround-related fields to State struct, added lookaround support to checkAssertion()
2 months ago
Aadhavan Srinivasan
051a8551f3
Match zero-length match at end of string, even if the start node is an assertion (end of string, lookarounds, etc.)
2 months ago
Aadhavan Srinivasan
11c0a0552f
Added support for lokarounds; parsing and adding nodes for different lookarounds
2 months ago
Aadhavan Srinivasan
c807d6664e
Changed generation of characters for non-whitespace, non-digit and non-word characters - it's basically an inverted character class now
2 months ago
Aadhavan Srinivasan
d986999001
Added more tests
2 months ago
Aadhavan Srinivasan
ea64ddc88a
Removed unnecessary duplication of assertion checking
2 months ago
Aadhavan Srinivasan
1ba871d618
Removed dotChars() function, moved notDotChars() setting to main()
2 months ago
Aadhavan Srinivasan
1a1a8f4f9c
Moved flag-checking after flag.Parse()
2 months ago
Aadhavan Srinivasan
e882f41400
Added fields to denote all the characters that an 'allChars' postfixNode _shouldn't_ represent (useful for inverting character classes)
2 months ago
Aadhavan Srinivasan
b3ee1fe5e8
Convert an inverting character class into an 'allChars' node, with the characters marked as exceptions
2 months ago
Aadhavan Srinivasan
708a9e1303
Added field to denote all characters which an 'allChars' node _shouldn't_ match (useful for invertinc character classes
2 months ago
Aadhavan Srinivasan
c694c47be7
Added flag to print match indices, and to enable multi-line mode
2 months ago
Aadhavan Srinivasan
2569f52552
Wrote toString function for MatchIndex
2 months ago
Aadhavan Srinivasan
160b2f9215
Added newline character as an escaped node
2 months ago
Aadhavan Srinivasan
992c5a9300
Replaced isAlphaNum() with isNormalChar(), which returns true if the character isn't special (also returns true for unicode characters, which the previous function didn't
2 months ago
Aadhavan Srinivasan
1e0502c6aa
Added unicode tests
2 months ago
Aadhavan Srinivasan
c56d81a335
Added unicode support to dot metacharacter - it now matches _any_ unicode character (almost)
2 months ago
Aadhavan Srinivasan
8a1f1dc621
Added unicode support
...
Replaced strings with rune-slices, which capture unicode codepoints more
accurately.
2 months ago
Aadhavan Srinivasan
805766a5ba
Added support for -l : only print lines with at least one match (or with exactly 0 matches, if -v is enabled
2 months ago
Aadhavan Srinivasan
dcd712dceb
Added support for -o flag: only print matching content
2 months ago
Aadhavan Srinivasan
f2b8812b05
Added support for -v flag, to invert which values are printed in color. Also got rid of unecessary 'else' clause
2 months ago
Aadhavan Srinivasan
11641596fa
Read multiple lines from stdin and apply regex to each one; Convert the array of matchIndex structs into a flat array of indices; speeds up process of checking if we have to print a character in color
2 months ago
Aadhavan Srinivasan
b55b80ec6c
Updated TODO
...
I didn't like the existing capturing group implementation, so I moved
that to a separate branch. This branch does not (at the moment) any code
relating to capturing groups.
2 months ago
Aadhavan Srinivasan
137ea3c746
Made findAllMatchesHelper non-recursive, added pruneIndices (improved performance) and more changes
...
I made findAllMatchesHelper a non-recursive function. It now only
returns the first match it finds in the string (so I should probably
rename it).
These indices are collected by findAllMatches and pruned (to
remove overlaps). The overlap function has also been rewritten, to make
it (I believe) less than O(n^2). I also used the uniq_arr type to make
checking for uniqueness O(1) instaed of O(n) (as it was with
unique_append()). This has resulted in massive performance gains.
There's been a lot of changes here, and I probably haven't documented
all of them.
3 months ago
Aadhavan Srinivasan
9201ed49bd
Changed type from matchIndex to MatchIndex
3 months ago
Aadhavan Srinivasan
9a073aa514
Added node types for left and right parentheses
3 months ago
Aadhavan Srinivasan
7d265495f5
Got rid of list for uniq_arr (O(n) deletion) and instead have separate method to create list (O(n) list creation)
3 months ago
Aadhavan Srinivasan
e2e99ff6a9
Added fnunction to generate numbers in a range; added capacity to some slices to prevent unnecessary reallocations
3 months ago
Aadhavan Srinivasan
8a69ea8cb7
Added unique array data structure - O(1) addition and retrieval (I think)
3 months ago
Aadhavan Srinivasan
ea17251bf8
Might have made a change to improve performance
3 months ago
Aadhavan Srinivasan
e8aca8606a
Added test cases
3 months ago
Aadhavan Srinivasan
9698c4f1d8
Fixed error in calculating word boundary (off-by-one)
3 months ago
Aadhavan Srinivasan
c032dcb2ea
Added more test cases
3 months ago
Aadhavan Srinivasan
269e2d0e1c
Updated go.mod
3 months ago
Aadhavan Srinivasan
21142e6e13
Wrote function to clone the NFA starting at a given state, and a function to find question mark operator (a? == (a|))
3 months ago
Aadhavan Srinivasan
b602295bee
Added support for specifying how often a postfixNode is repeated
3 months ago
Aadhavan Srinivasan
1d9d1a5b81
Fixed calculation of overlapping (used to check for subset instead)
3 months ago
Aadhavan Srinivasan
d8f52b8ccc
Added support for numeric specifiers, moved question mark operator to its own function
3 months ago
Aadhavan Srinivasan
dca81c1796
Replaced rune-slice parameters with string parameters in functions; avoids unnecessary conversion from strings to rune-slices
3 months ago