Commit graph

350 commits

Author SHA1 Message Date
Marko Mikulicic
068525a6d7 Use a YamlDecoder builder to implement optional encoding_trap parameter. 2024-03-19 15:26:16 +01:00
Marko Mikulicic
7705e87ff6 Implement load_from_bytes
Also helps in some cases with #142, when the BOM is at the beginning of
the file (common), but not in corner case where the BOM is at the start
of a document which is not the first one.

Closes: #155
2024-03-19 15:26:16 +01:00
Jim Turner
c962741719 Add byte a offset to the error message 2024-03-19 15:26:16 +01:00
larkbyte
e1119bb3bf Replace linked_hash_map with hashlink 2024-03-19 15:26:16 +01:00
David Aguilar
ef1a2d1d5c ci: replace travis with github actions 2024-03-19 15:26:16 +01:00
David Aguilar
07113c6e1c Cargo.toml: enable the Rust 2021 Edition 2024-03-19 15:26:16 +01:00
David Aguilar
99fb843550 Cargo.toml: add myself to the authors list 2024-03-19 15:26:16 +01:00
David Aguilar
a6c8dfe5b8 Emit multi-line string values as block scalars 2024-03-19 15:26:16 +01:00
Ethiraric
e4ae1d0546 Update to v0.6. 2024-03-15 20:14:26 +01:00
Ethiraric
dc88910c23 Add bench_compare tool. 2024-03-15 18:29:54 +01:00
Ethiraric
6b27316586 Prepare for benchmarks. 2024-03-15 17:34:39 +01:00
Ethiraric
f44cb1dd22 Create a deeper YAML in nested.yaml.
This requires heavily reducing the number of nodes since they are on
average more indented. Leaving 5M nodes results in files larger than
1GB.
2024-03-15 12:48:04 +01:00
Ethiraric
0a05d3c0ed Add gen_large_yaml to justfile's before_commit. 2024-03-15 12:42:10 +01:00
Ethiraric
b7755e119c Remove tools from examples.
Add documentation for those tools, and make it so that `gen_large_yaml`
generates a predetermined set of files instead of outputting to its
standard output.
2024-03-14 19:20:56 +01:00
Ethiraric
3d8a54d384 Add a generator for nested objects. 2024-02-14 00:35:41 +01:00
Ethiraric
765f2bb672 Replace VecDeque with ArrayDeque.
This removes all allocations in the `Scanner` code. The downside is that
the buffer is now stored in the `Scanner` structure, making it 48 bytes
larger. This however makes the code much more performant.
2024-02-13 23:10:32 +01:00
Ethiraric
8a80e547c6 Fix some clippy lints. 2024-02-13 23:10:17 +01:00
Ethiraric
7b431c77d4 Improve comments. 2024-02-08 07:04:38 +01:00
Ethiraric
671a498bf4 Minor improvements. 2024-01-31 22:02:53 +01:00
Ethiraric
5d94b8174d Add big string array bench generation. 2024-01-30 23:11:00 +01:00
Ethiraric
1dcb1eee34 Rename generator tool. 2024-01-30 22:37:32 +01:00
Ethiraric
99fb05c937 Improve scan_plain_scalar readability.
Take whitespace checking out of the innermost loop for performance.
2024-01-25 03:06:18 +01:00
Ethiraric
f535e505a7 Avoid a trip to self.buffer.
`self.buffer` is a `VecDeque<char>`, meaning that characters are stored
on 4B. When reading as we used to do, this means that every 1 byte
character we read was turned into 4 bytes, which was turned into 1 byte
in `String::extend`.

Instead of going through `self.buffer`, use a local `String` to store
the characters before pushing them to `string`.
2024-01-24 23:02:02 +01:00
Ethiraric
cfbf287b3d Buffer block scalar lines.
Instead of doing a loop that goes:
  * fetch from input stream
  * push char into string

Make a loop that fetches characters while they're not a breakz and
_then_ extend the string. This avoids a bunch of reallocations.
2024-01-24 21:45:18 +01:00
Ethiraric
26ef839cd3 Split skip into more specific variants.
This function is a hotpath and sometimes removing the conditional jump
to detect line breaks saves a bunch of instructions.
2024-01-24 20:20:52 +01:00
Ethiraric
6babe8de60 Doing this leads to worse performance. 2024-01-24 19:42:18 +01:00
Ethiraric
aaa9288842 Pre-load chars in skip_block_scalar_indent. 2024-01-24 19:31:45 +01:00
Ethiraric
2cebf0af1d Help the compiler inline read_break. 2024-01-24 17:14:52 +01:00
Ethiraric
5789169ceb Remove debug prints code from release builds.
If building release mode, remove debug code. Now, the `debug_print!`
macro resolves to nothing in release build.

In debug build, don't check the environment for each print. This has a
huge overhead. The environment is only checked once and next checks are
made against a simple boolean value.
2024-01-24 02:22:02 +01:00
Ethiraric
e6fae1c679 Add justfile for my pre-commit todo-list. 2024-01-24 01:11:53 +01:00
Ethiraric
b4f66c457a Move char is_xxx fn to their own file. 2024-01-24 01:02:20 +01:00
Ethiraric
36e8b06e82 Add benchmarking tools. 2024-02-08 08:15:36 +01:00
Ethiraric
6e5286c2e1 Use OR for dual licensing.
https://doc.rust-lang.org/book/ch14-02-publishing-to-crates-io.html#adding-metadata-to-a-new-crate
2024-01-23 15:20:12 +01:00
Ethiraric
eed3433841 Update licence, readme, doc. 2024-02-08 07:12:14 +01:00
Ethiraric
68c7afcb4c Fix null nodes with tags in flow constructs. 2024-01-23 00:19:04 +01:00
Ethiraric
7f7919748a Fix tag scanning. 2024-01-23 00:19:04 +01:00
Ethiraric
76b3773ffd Fix indent in block scalars. 2024-01-23 00:19:04 +01:00
Ethiraric
9f4b1480df Fix dquote indentation. 2024-01-23 00:19:04 +01:00
Ethiraric
2e94a4217b Fix dquote string escape sequences. 2024-01-23 00:19:04 +01:00
Ethiraric
7145ee8282 Fix tests related to anchor/alias indentation.
I have no idea what I'm doing.
2024-01-23 00:19:04 +01:00
Ethiraric
9281e35bbb Fix flow adjacent value with complex key. 2024-01-23 00:19:04 +01:00
Ethiraric
7c3e8ed7c4 Lint and improve formatting. 2024-01-23 00:19:04 +01:00
Ethiraric
5dbd6f9a55 Fix use of dashes in flow contexts. 2024-01-23 00:19:04 +01:00
Ethiraric
628549085c Fix block scalar / eof interactions. 2024-01-23 00:19:04 +01:00
Ethiraric
c0caeb84e9 Fix block scalars and document end interaction. 2024-01-23 00:19:04 +01:00
Ethiraric
ff28b55d22 Fix duplicate version directive. 2024-01-23 00:19:04 +01:00
Ethiraric
7b744d092e Fixes towards implicit document end. 2024-01-23 00:19:04 +01:00
Ethiraric
f27a7c25a8 Fixes towards spaces before comments. 2024-01-23 00:19:04 +01:00
Ethiraric
0b58be1458 Fix possible misindent in block scalar. 2024-01-23 00:19:04 +01:00
Ethiraric
0a1aebaf3c Fix towards multiple documents in a single stream. 2024-01-23 00:19:04 +01:00