ucode - The ucode Scripting Language

Age	Commit message (Collapse)	Author
2022-03-02	Merge pull request #45 from jow-/lib-add-uloop	Jo-Philipp Wich
	lib: introduce uloop binding
2022-03-02	lib: introduce uloop binding	Jo-Philipp Wich
	The uloop module allows controlling the uloop event loop and supports adding timeouts, processes and file descriptors. Example: #!ucode -RS let fs = require("fs"); let uloop = require("uloop"); let fd1 = fs.popen("echo 1; sleep 1; echo 2; sleep 1; echo 3", "r"); let fd2 = fs.popen("echo 4; sleep 1; echo 5; sleep 1; echo 6", "r"); function fd_read_callback(flags) { if (flags & uloop.ULOOP_READ) { let line = this.handle().read("line"); printf("Line from fd <%s/%d>: <%s>\n", this, this.fileno(), trim(line)); } } uloop.init(); uloop.timer(1500, function() { printf("Timeout after 1500ms\n"); }); uloop.handle(fd1, fd_read_callback, uloop.ULOOP_READ); uloop.handle(fd2, fd_read_callback, uloop.ULOOP_READ); uloop.process("date", [ "+%s" ], { LC_ALL: "C" }, function(exitcode) { printf("Date command exited with code %d\n", exitcode); }); uloop.run(); Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-03-02	vm: release this context on exception in managed method call	Jo-Philipp Wich
	When attempting to invoke a non-function value as method or when the the internal recursion limit was exceeded, `uc_vm_call_function()` emitted and internal runtime exception and freed the function value but not the `this` context associated with the method call. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-15	tests: fix proto() testcase	Jo-Philipp Wich
	Fixes: 4ce69a8 ("fs: implement access(), mkstemp(), file.flush() and proc.flush()") Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-15	fs: implement access(), mkstemp(), file.flush() and proc.flush()	Jo-Philipp Wich
	The `access()` function allows testing the path given in the first argument for accessibility according to the permissions specified in the second mode string argument. The mode string characters may be `r`, `w`, `x` or `f` which correspond to `R_OK` - path is readable, `W_OK` - path is writable, `X_OK` - path is executable or `F_OK` - path exists respectively. The `mkstemp()` function creates a secure temporary file, unlinks it and returns the open file handle. The temporary path is constructed based on the optional template argument. If the template argument contains a slash, the path is taken as-is, if it contains no slashes, `/tmp/` is prepended. If the template does not end with `XXXXXX`, a `.XXXXXX` suffix is appended to the path. If the template is omitted, `/tmp/XXXXXX` is used. The `file.flush()` and `proc.flush()` functions call `fflush()` on the underlying file handle respectively. They take no arguments. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-11	compiler: fix patchlist corruption on switch statement syntax errors	Jo-Philipp Wich
	When compiling a switch statement with duplicate `default` cases or a switch statement with syntax errors before the body block, two error handling cases were hit in the code that prematurely returned from the function without resetting the compiler's patchlist pointer away from the on-stack patchlist that had been set up for the switch statement. Upon processing a subsequent break or continue control statement, a realloc was performed on the then invalid patchlist contents, triggering a segmentation fault or libc assert. Solve this issue by not returning from the function but breaking the switch body parsing loop. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-11	lib: change `ord()` to always return single byte value	Jo-Philipp Wich
	The most common usecase is extracting the value of a single byte at a specific offset, e.g. to scan a string char-by-char to construct a hash. Furthermore, constructing an array which contains the results of multiple `ord()` invocations is trivial while efficiently extracting a single byte value without the overhead of an intermediate array is not. Due to that, change `ord()` to always return a single integer byte value at the offset specified as second argument or at offset 0 in case no argument was supplied. That means that `ord("Abc", 0, 1, 2)` will now return `65` instead of the former `[ 65, 98, 99 ]` result. Code relying on the former behaviour should either perform multiple calls to `ord()`, passing different offsets each time or switch to the `struct` module which allows efficient unpacking of string data. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-11	vallist: fix storing/retrieving short strings with 8bit byte values	Jo-Philipp Wich
	Due to using signed byte values when writing/reading short strings to/from pointer addresses, 8 bit characters where incorrectly clamped to `-1` (`255`). Fix this issue by treating the input string as `uint8_t` array. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-08	compiler: fix incorrect loop break targets	Jo-Philipp Wich
	When patching jump targets for break statments while compiling for-loop statments, we need jump beyond the instructions popping intermediate loop variables off the stack but before the pop instructions removing local loop body variables to prevent a stack position mismatch between compiler and vm. Before that change, local loop body variables remained on the stack, breaking the expected stack layout. Fixes: b3d758b compiler: ("fix for/break miscompilation") Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-07	Merge pull request #38 from jow-/function-memory-model	Jo-Philipp Wich
	treewide: rework function memory model
2022-02-07	source: convert source objects into proper uc_value_t type	Jo-Philipp Wich
	Instead of implementing a custom limited refcount logic, turn uc_source_t instances into proper uc_value_t objects. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-07	treewide: rework function memory model	Jo-Philipp Wich
	- Instead of treating individual program functions as managed ucode types, demote uc_function_t values to pointers into a uc_program_t entity - Promote uc_program_t to a managed type - Let uc_closure_t claim references to the owning program of the enclosed uc_function_t - Redefine public APIs uc_compile() and uc_vm_execute() APIs to return and expect an uc_program_t object respectively - Remove vallist indirection for function loading and let the compiler emit the function id directly when producing function construction code Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-03	Merge pull request #37 from jow-/stdlib-tests	Jo-Philipp Wich

2022-02-03	tests: add functional tests for builtin functions	Jo-Philipp Wich
	Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-03	lib: fix leaking tokener in uc_json() on parse exception	Jo-Philipp Wich
	Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-03	lib: fix infinite loop on empty regexp matches in uc_replace()	Jo-Philipp Wich
	The regular expression `/()/` will match the empty string, causing the match loop to never advance. Add extra logic to deal with this case, similar to the empty separator string logic. Apply a similar exception to replacements of empty search strings, those should yield the same result as empty regexp matches. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-03	lib: fix infinite loop on empty regexp matches in uc_match()	Jo-Philipp Wich
	The regular expression `/()/` will match the empty string, causing the match loop to never advance. Add extra logic to deal with this case. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-03	lib: fix infinite loop on empty regexp matches in uc_split()	Jo-Philipp Wich
	The regular expression `/()/` will match the empty string, causing the match loop to never advance. Add extra logic to deal with this case, similar to the empty separator string logic. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-03	vm: ensure consistent trace output between gcc and clang compiled ucode	Jo-Philipp Wich
	Clang emits code which evaluates function call argument expressions in a different order, causing `uc_dump_insn()` to receive the instruction pointer address after decoding the instruction, not before. Avoid that problem by explicitly caching the pre-decode address in a temporary variable which is then passed to `uc_dump_insn()`. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-03	vm: fix leaking function value on call exception	Jo-Philipp Wich
	The internal uc_vm_call_function() helper may fail in different ways before the stack frame has been set up, e.g. if the provided function value was not actually a callable function. In such cases an exception is raised but the actual function value is leaked since there's not yet a stackframe referring to it. Solve the issue by freeing the function value explicitly in these exit cases. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-03	vm: NULL-initialize pointer to make cppcheck happy	Jo-Philipp Wich
	Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-03	source: zero-initialize conversion union to make cppcheck happy	Jo-Philipp Wich
	Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-03	run_tests.sh: change workdir to testcase directory during execution	Jo-Philipp Wich
	Ensure that that the testcase files are executed within the temporary testcase work directory to simplify testing relative path resolution. Also fixup the duplicate resource regression test breaking due to that. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-03	run_tests.sh: support placing supplemental testcase files	Jo-Philipp Wich
	Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-02-03	run_tests.sh: always treat outputs as text data	Jo-Philipp Wich
	Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-30	Merge pull request #36 from jow-/bytecode-fixes	Jo-Philipp Wich
	Bytecode fixes
2022-01-29	program: rename bytecode load/write functions, track path of executed file	Jo-Philipp Wich
	Extend source objects with a `runpath` field which contains the original path of the source being executed by the VM. When instantiating source objects from file paths, the `runpath` will be set to the `filename`. When instantiating source buffers using `uc_source_new_buffer()`, the runpath is initially unset. A new function `uc_source_runpath_set()` can be used to adjust the runtime path being associated with a source object. Extend bytecode loading logic to set the source buffer runtime path to the precompiled bytecode file path being loaded and executed. This is required for `sourcepath()` and relative paths in `include()` to function correctly when executing precompiled programs. Finally rename `uc_program_from_file()` and `uc_program_to_file()` to `uc_program_load()` and `uc_program_write()` respectively since the load part now operates on an `uc_source_t` input buffer instead of a plain `FILE *` handle. Adjust users of these API functions accordingly. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-29	lib: fix memory leak in uc_require_ucode()	Jo-Philipp Wich
	We need to release the compiled module function after we executed it in our VM context. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-26	vm: fix NaN strict equality tests	Jo-Philipp Wich
	A performance shortcut in `ucv_is_equal()` incorrectly led to `NaN === NaN` being true. Fix the issue by only comparing pointers when the involved types are not doubles. Due to fixing `NaN !== NaN`, the `uniq()` function now requires a special case to treat multiple NaNs equal for the sake of generating an array of unique values. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-26	Merge pull request #35 from jow-/various-fixes	Jo-Philipp Wich
	Various fixes
2022-01-26	vallist: uc_number_parse(): parse empty strings as `0`, not `NaN`	Jo-Philipp Wich
	Fixes: b605dbf ("treewide: rework numeric value handling") Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-26	vm: fix `null` loose equality/inequality checks	Jo-Philipp Wich
	The current implementation incorrectly yielded `true` for `0 == null` but only `null` must be equal to `null`. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-26	lib: fix exists() error return value	Jo-Philipp Wich
	The current implementation incorrectly returned `false` which got treated as `NULL` instead of a boolean `false` value. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-26	tests: reorganize testcase files	Jo-Philipp Wich
	- Rename 03_bugs to 04_bugs - Rename 26_invalid_sparse_array_set to 27_invalid_sparse_array_set Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-26	run_tests.sh: add ability to define environment variables for testcases	Jo-Philipp Wich
	Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-26	run_tests.sh: fix exitcode evaluation	Jo-Philipp Wich
	The `touch` command result incorrectly shadowed the testcase exit code. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-25	Merge pull request #34 from jow-/struct-variable-length-string-format	Jo-Philipp Wich

2022-01-24	struct: implement `*` format, fix invalid memory accesses	Jo-Philipp Wich
	Implement a new `` format which acts like `s` on unpack but accepts input records which are shorter than the specified length, e.g. the following call will yield "abc" while an equivalent "10s" format would fail: unpack("2", "abc") // [ "ab" ] unpack("10", "abc") // [ "abc" ] unpack("10s", "abc") // null The `` format is primarily useful to extract the remainder of a variable length record without having to encode the specific length of the record directly into the format string. When packing records, the `` format takes at most as many bytes as specified in the format string repeat count. If the input string is shorter than the given repeat count, only as many bytes as present in the input string are taken. A bare `` without any repeat count will take all bytes from the given input string: pack("2", "abc") // "ab" pack("10", "abc") // "abc" pack("*", "abc") // "abc" pack("10s", "abc") // "abc\0\0\0\0\0\0\0" Additionally prevent invalid memory accesses when unpacking a buffer shorter than the length expected by the format string. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-23	Merge pull request #33 from jow-/lib-format-string-rework	Jo-Philipp Wich
	lib: rework format string handling
2022-01-23	lib: rework format string handling	Jo-Philipp Wich
	Instead of extracting and forwarding recognized conversion directives from the user supplied format string, properly parse the format string into its components and reassemble a canonical representation of the conversion directive internally before handing it to the libc's sprintf() implementation. Also take care of selecting the proper conversion specifiers for signed and unsigned 64bit integer values to fix broken `%d`, `%i`, `%u`, `%o`, `%x` and `%X` formats on 32bit systems. While reworking the format logic, also slightly improve `%s` argument handling by not duplicate the given value if it already is a string, which reduces the amount of required heap memory. Ref: https://bugs.openwrt.org/index.php?do=details&task_id=4234 Ref: https://git.openwrt.org/3d3d03479d5b4a976cf1320d29f4bd4937d5a4ba Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-20	lib: fix %J string formats with precision specifier	Jo-Philipp Wich
	Previous refactoring of the code led to an invalid internal format pettern being used to output the formatted JSON data. Fixes: 9041e24 ("lib: fix uninitialized memory access on handling %J string formats") Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-18	syntax: drop legacy syntax support	Jo-Philipp Wich
	Drop support for the `local` keyword and `delete` function calls. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-18	lib: replace usages of vasprintf() with xvasprintf()	Jo-Philipp Wich
	This avoid triggering -Werror=unused-result compilation failures with certain toolchains. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-18	Merge pull request #32 from jow-/precompile	Jo-Philipp Wich
	Introduce ucode precompilation support
2022-01-18	build: support building without compile capabilities	Jo-Philipp Wich
	Introduce a new default enable CMake option "COMPILE_SUPPORT" which allows to disable source code compilation in the ucode interpreter. Such an interpreter will only be able to load precompiled ucode files. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-18	program: implement support for precompiling source files	Jo-Philipp Wich
	- Introduce new command line flags `-o` and `-O` to write compiled program code into the specified output file - Add support for transparently executing precompiled files, the lexical analyzing and com,pilation phase is skipped in this case Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-18	source: refactor source file handling	Jo-Philipp Wich
	- Move source object pointer into program entity which is referenced by each function - Move lineinfo related routines into source.c and use them from lexer.c since lineinfo encoding does not belong into the lexical analyzer. - Implement initial infrastructure for detecting source file type, this is required later to differentiate between plaintext and precompiled bytecode files Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-18	compiler, vm: use a program wide constant list	Jo-Philipp Wich
	Instead of storing constant values per function, maintain a global program wide list for all constant values within the current compilation unit. Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-18	types: add initial infrastructure for function serialization	Jo-Philipp Wich
	- Introduce a new "program" entity which holds the list of functions created during compilation - Instead of storing pointers to the in-memory function representation in the constant list, store the index of the function within the program's function list - When loading functions from the constant list, retrieve the function by index from the program entity Signed-off-by: Jo-Philipp Wich <jo@mein.io>
2022-01-07	Merge pull request #31 from jow-/add-uniq	Jo-Philipp Wich
	lib: implement uniq() function