Marmelator/pdf.js.mirror - pdf.js.mirror - Gitea: Git with a cup of tea

Marmelator/pdf.js.mirror

mirror of https://github.com/mozilla/pdf.js.git synced 2026-06-24 08:55:48 +02:00

Author	SHA1	Message	Date
Jonas Jenwald	ffa7ac7a91	Use `Map.prototype.getOrInsertComputed()` more in the code-base	2026-06-12 23:21:16 +02:00
Jonas Jenwald	e6dba6ee34	Enable the `radix` ESLint rule Many `parseInt` call-sites already provide the `radix` argument, and this rule helps improve consistency in the code-base; see https://eslint.org/docs/latest/rules/radix Please note: The rule is disabled in `src/scripting_api/util.js` for now, since it's not obvious at a glance (at least to me) what the correct `radix` argument should be there.	2026-04-25 12:13:12 +02:00
Jonas Jenwald	e52f2d1d67	Convert the internal `Map` to a properly private field in the `Dict` class	2026-03-19 09:36:29 +01:00
Jonas Jenwald	6c6bb19324	Use proper access methods in `Dict.merge`, rather than modifying the `_map` field manually	2026-03-19 09:36:29 +01:00
Jonas Jenwald	f4aadea001	Reduce duplication in the `Dict.prototype.{get, getAsync, getArray}` methods These methods are all very similar, so let's introduce a private helper method to reduce unnecessary code duplication.	2026-03-18 11:15:23 +01:00
Calixte Denizet	0fca64f01e	Check for having Ref before adding them in a RefSet (bug 2023106)	2026-03-15 22:03:39 +01:00
Jonas Jenwald	229e3642be	Change the `Dict.prototype.getRawValues` method to return an iterator This method is usually used with loops, and it should be a tiny bit more efficient to use an iterator directly rather than first iterating through ` Map`-values to create a temporary `Array` that we finally iterate through at the call-site. Note that the `getRawValues` method is old code, and originally the `Dict` class stored its data in a regular `Object`, hence why the old code was written that way.	2026-03-04 16:07:49 +01:00
Jonas Jenwald	58996f21b2	Change the `Dict.prototype.getKeys` method to return an iterator This method is usually used with loops, and it should be a tiny bit more efficient to use an iterator directly rather than first iterating through ` Map`-keys to create a temporary `Array` that we finally iterate through at the call-site. Note that the `getKeys` method is old code, and originally the `Dict` class stored its data in a regular `Object`, hence why the old code was written that way.	2026-03-04 16:07:49 +01:00
Jonas Jenwald	50d66d7d34	Use the `Dict.prototype.getRawEntries` method more This changes a number of loops currently using `Dict.prototype.{getKeys, getRaw}`, since it should be a tiny bit more efficient to use an iterator directly rather than first iterating through `Map`-keys to create a temporary `Array` that we finally iterate through at the call-site. Note that the `getKeys` method is much older than `getRawEntries`, and originally the `Dict` class stored its data in a regular `Object`, hence why the old code was written that way.	2026-03-04 12:46:25 +01:00
Jonas Jenwald	7fd939763e	Remove unnecessary class constructors in the `src` folder There's a number of classes where the constructors can be removed completely by instead using class fields, which help to slightly shorten the code. It seems that `unicorn/prefer-class-fields` ESLint plugin, see PR 20657, unfortunately isn't able to detect all of these cases.	2026-02-19 00:08:57 +01:00
Calixte Denizet	37f4712f7e	Update the named page destinations when some pdf are combined (bug 1997379) and remove link annotations pointing on a deleted page.	2025-11-07 18:22:19 +01:00
Calixte Denizet	bc87f4e8d6	Add the possibility to create a pdf from different ones (bug 1997379) For now it's just possible to create a single pdf in selecting some pages in different pdf sources. The merge is for now pretty basic (it's why it's still a WIP) none of these data are merged for now: - the struct trees - the page labels - the outlines - named destinations For there are 2 new ref tests where some new pdfs are created: one with some extracted pages and an other one (encrypted) which is just rewritten. The ref images are generated from the original pdfs in selecting the page we want and the new images are taken from the generated pdfs.	2025-11-07 14:57:48 +01:00
Calixte Denizet	2d5794f79d	[Editor] Fix saving a deleted popup	2025-09-08 15:36:41 +02:00
Calixte Denizet	63b37b4371	Add few methods to the Dict class in order to simplify the code when writing an annotation	2025-07-08 21:23:29 +02:00
Jonas Jenwald	2c0cc48d1b	Replace the `forEach` method in `Dict` with "proper" iteration support	2024-11-17 12:45:32 +01:00
Jonas Jenwald	691be77f65	Convert the `Dict`-implementation to use a `Map` internally With all the recent work happening under https://bugzilla.mozilla.org/show_bug.cgi?id=1851662, the performance of `Map` is already good enough that I believe that we should now be able to utilize it in the `Dict`-class without problem. This patch was tested in Firefox Nightly, specifically build https://hg.mozilla.org/mozilla-central/rev/6c508a387477e3b72db913a9e1761e9a433d06a2, with the following manifest file: ``` [ { "id": "tracemonkey-eq", "file": "pdfs/tracemonkey.pdf", "md5": "9a192d8b1a7dc652a19835f6f08098bd", "rounds": 100, "type": "eq" }, { "id": "issue2618", "file": "pdfs/issue2618.pdf", "md5": "2c554a99a52288ca1a44a422eeafb8fb", "rounds": 100, "type": "eq" } ] ``` which gave the following results, indicating no significant regression, when comparing this patch against the `master` branch: - Overall ``` -- Grouped By browser, pdf, stat -- browser \| pdf \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| -------------- \| ------------ \| ----- \| ------------ \| ----------- \| --- \| ----- \| ------------- firefox \| issue2618 \| Overall \| 100 \| 678 \| 678 \| 0 \| 0.04 \| firefox \| issue2618 \| Page Request \| 100 \| 1 \| 1 \| 0 \| -3.88 \| firefox \| issue2618 \| Rendering \| 100 \| 677 \| 677 \| 0 \| 0.05 \| firefox \| tracemonkey-eq \| Overall \| 1400 \| 35 \| 36 \| 0 \| 0.96 \| firefox \| tracemonkey-eq \| Page Request \| 1400 \| 1 \| 1 \| 0 \| -8.08 \| firefox \| tracemonkey-eq \| Rendering \| 1400 \| 34 \| 35 \| 0 \| 1.26 \| ``` - Page-specific ``` -- Grouped By browser, pdf, page, stat -- browser \| pdf \| page \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| -------------- \| ---- \| ------------ \| ----- \| ------------ \| ----------- \| --- \| ------ \| ------------- firefox \| issue2618 \| 0 \| Overall \| 100 \| 678 \| 678 \| 0 \| 0.04 \| firefox \| issue2618 \| 0 \| Page Request \| 100 \| 1 \| 1 \| 0 \| -3.88 \| firefox \| issue2618 \| 0 \| Rendering \| 100 \| 677 \| 677 \| 0 \| 0.05 \| firefox \| tracemonkey-eq \| 0 \| Overall \| 100 \| 23 \| 24 \| 0 \| 1.24 \| firefox \| tracemonkey-eq \| 0 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 19.77 \| firefox \| tracemonkey-eq \| 0 \| Rendering \| 100 \| 23 \| 23 \| 0 \| 0.40 \| firefox \| tracemonkey-eq \| 1 \| Overall \| 100 \| 32 \| 32 \| -1 \| -1.89 \| firefox \| tracemonkey-eq \| 1 \| Page Request \| 100 \| 1 \| 1 \| 0 \| -28.13 \| firefox \| tracemonkey-eq \| 1 \| Rendering \| 100 \| 31 \| 31 \| 0 \| -0.77 \| firefox \| tracemonkey-eq \| 2 \| Overall \| 100 \| 17 \| 18 \| 1 \| 4.60 \| firefox \| tracemonkey-eq \| 2 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 23.53 \| slower firefox \| tracemonkey-eq \| 2 \| Rendering \| 100 \| 17 \| 17 \| 1 \| 3.71 \| firefox \| tracemonkey-eq \| 3 \| Overall \| 100 \| 23 \| 24 \| 0 \| 1.71 \| firefox \| tracemonkey-eq \| 3 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 7.79 \| firefox \| tracemonkey-eq \| 3 \| Rendering \| 100 \| 23 \| 23 \| 0 \| 1.55 \| firefox \| tracemonkey-eq \| 4 \| Overall \| 100 \| 31 \| 31 \| 1 \| 2.49 \| firefox \| tracemonkey-eq \| 4 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 48.96 \| firefox \| tracemonkey-eq \| 4 \| Rendering \| 100 \| 30 \| 30 \| 0 \| 1.05 \| firefox \| tracemonkey-eq \| 5 \| Overall \| 100 \| 31 \| 30 \| -1 \| -2.42 \| firefox \| tracemonkey-eq \| 5 \| Page Request \| 100 \| 2 \| 1 \| -1 \| -49.33 \| firefox \| tracemonkey-eq \| 5 \| Rendering \| 100 \| 29 \| 29 \| 0 \| -0.03 \| firefox \| tracemonkey-eq \| 6 \| Overall \| 100 \| 27 \| 27 \| 0 \| 1.81 \| firefox \| tracemonkey-eq \| 6 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 4.94 \| firefox \| tracemonkey-eq \| 6 \| Rendering \| 100 \| 26 \| 27 \| 0 \| 1.68 \| firefox \| tracemonkey-eq \| 7 \| Overall \| 100 \| 26 \| 26 \| 1 \| 3.13 \| firefox \| tracemonkey-eq \| 7 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 6.98 \| firefox \| tracemonkey-eq \| 7 \| Rendering \| 100 \| 25 \| 25 \| 1 \| 2.92 \| firefox \| tracemonkey-eq \| 8 \| Overall \| 100 \| 25 \| 26 \| 1 \| 5.16 \| firefox \| tracemonkey-eq \| 8 \| Page Request \| 100 \| 1 \| 1 \| -1 \| -41.84 \| firefox \| tracemonkey-eq \| 8 \| Rendering \| 100 \| 23 \| 25 \| 2 \| 8.19 \| firefox \| tracemonkey-eq \| 9 \| Overall \| 100 \| 33 \| 33 \| 0 \| 0.03 \| firefox \| tracemonkey-eq \| 9 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 0.79 \| firefox \| tracemonkey-eq \| 9 \| Rendering \| 100 \| 32 \| 32 \| 0 \| -0.10 \| firefox \| tracemonkey-eq \| 10 \| Overall \| 100 \| 144 \| 144 \| 1 \| 0.52 \| firefox \| tracemonkey-eq \| 10 \| Page Request \| 100 \| 2 \| 1 \| -1 \| -43.52 \| firefox \| tracemonkey-eq \| 10 \| Rendering \| 100 \| 141 \| 143 \| 2 \| 1.18 \| firefox \| tracemonkey-eq \| 11 \| Overall \| 100 \| 24 \| 25 \| 1 \| 2.51 \| firefox \| tracemonkey-eq \| 11 \| Page Request \| 100 \| 1 \| 1 \| 0 \| -4.71 \| firefox \| tracemonkey-eq \| 11 \| Rendering \| 100 \| 23 \| 24 \| 1 \| 2.78 \| firefox \| tracemonkey-eq \| 12 \| Overall \| 100 \| 40 \| 39 \| -1 \| -1.67 \| firefox \| tracemonkey-eq \| 12 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 14.71 \| firefox \| tracemonkey-eq \| 12 \| Rendering \| 100 \| 39 \| 38 \| -1 \| -1.98 \| firefox \| tracemonkey-eq \| 13 \| Overall \| 100 \| 19 \| 20 \| 1 \| 3.09 \| firefox \| tracemonkey-eq \| 13 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 24.79 \| firefox \| tracemonkey-eq \| 13 \| Rendering \| 100 \| 18 \| 19 \| 0 \| 1.70 \| ```	2024-11-17 12:44:06 +01:00
Calixte Denizet	4bf7787084	Simplify saving added/modified annotations. Having this map to collect the different changes will allow to know if some objects have already been modified.	2024-11-12 10:59:38 +01:00
Calixte Denizet	6711123f68	[Editor] Update the freetext annotation dictionary instead of creating a new one when updating an existing freetext	2024-07-11 10:44:21 +02:00
Calixte Denizet	45fa867577	Allow to insert several annotations under the same parent in the structure tree While testing stamp insertion with the added pdf, I noticed that the tags using a MCID weren't considered when trying to attach an annotation to it.	2024-04-24 16:23:05 +02:00
Calixte Denizet	a8573d4e1b	[Editor] Add the ability to create/update the structure tree when saving a pdf containing newly added annotations (bug 1845087) When there is no tree, the tags for the new annotions are just put under the root element. When there is a tree, we insert the new tags at the right place in using the value of structTreeParentId (added in PR #16916).	2023-09-16 18:34:58 +02:00
Calixte Denizet	1a047f843c	[Editor] Add the possibility to update an existing annotation with some new properties when saving or printing	2023-06-09 17:14:53 +02:00
Jonas Jenwald	1b4a7c5965	Introduce more optional chaining in the `src/core/` folder After PR 12563 we're now free to use optional chaining in the worker-thread as well. (This patch also fixes one previously "missed" case in the `web/` folder.) For the MOZCENTRAL build-target this patch reduces the total bundle-size by `1.6` kilobytes.	2023-05-15 12:38:28 +02:00
Jonas Jenwald	d950b91c4e	Introduce some logical assignment in the `src/core/` folder	2023-04-29 13:49:37 +02:00
Jonas Jenwald	9cb3236ac0	Remove the remaining unnecessary closures in the `src/core/primitives.js` file	2023-04-22 15:33:04 +02:00
Jonas Jenwald	804aa896a7	Stop using the `PRODUCTION` build-target in the JavaScript code This special build-target is very old, and was introduced with the first pre-processor that only uses comments to enable/disable code. When the new pre-processor was added `PRODUCTION` effectively became redundant, at least in JavaScript code, since `typeof PDFJSDev === "undefined"` checks now do the same thing. This patch proposes that we remove `PRODUCTION` from the JavaScript code, since that simplifies the conditions and thus improves readability in many cases. Please note: There's not, nor has there ever been, any gulp-task that set `PRODUCTION = false` during building.	2023-04-17 12:04:34 +02:00
Jonas Jenwald	dcc73423e5	Enable the `unicorn/prefer-logical-operator-over-ternary` ESLint plugin rule This leads to ever so slightly more compact code, and can in some cases remove the need for a temporary variable. Please find additional information here: https://github.com/sindresorhus/eslint-plugin-unicorn/blob/main/docs/rules/prefer-logical-operator-over-ternary.md	2022-07-12 10:52:37 +02:00
Jonas Jenwald	c0736647f9	Add general iteration support in the `RefSet` and `RefSetCache` classes This patch removes the existing `forEach` methods, in favor of making the classes properly iterable instead. Given that the classes are using a `Set` respectively a `Map` internally, implementing this is very easy/efficient and allows us to simplify some existing code.	2022-03-18 14:27:34 +01:00
Jonas Jenwald	ec87995050	Ensure that `Cmd`/`Name` is only initialized with string arguments Trying to use a non-string argument in either a `Cmd` or a `Name` is not intended, and would basically be an implementation error. Hence we can add a non-PRODUCTION check to enforce this, similar to the existing one used e.g. in the `Dict.set` method.	2022-02-23 22:39:12 +01:00
Jonas Jenwald	a2f9031e9a	Ensure that `Dict.set` only accepts string `key`s Trying to use a non-string `key` in a `Dict` is not intended, and would basically be an implementation error. Hence we can add a non-PRODUCTION check to enforce this, complementing the existing `value` check added in PR 11672.	2022-02-22 16:35:20 +01:00
Jonas Jenwald	2cb2f633ac	Remove the `isRef` helper function This helper function is not really needed, since it's just a wrapper around a simple `instanceof` check, and it only adds unnecessary indirection in the code.	2022-02-19 15:33:42 +01:00
Jonas Jenwald	1a31855977	Remove the `isStream` helper function At this point all the various Stream-classes extends an abstract base-class, hence this helper function is no longer necessary and only adds unnecessary indirection in the code.	2022-02-17 13:51:36 +01:00
Jonas Jenwald	a807ffe907	Prevent circular references in XRef tables from hanging the worker-thread (issue 14303) Please note: While this patch on its own is sufficient to prevent the worker-thread from hanging, however in combination with PR 14311 these PDF documents will both load and render correctly. Rather than focusing on the particular structure of these PDF documents, it seemed (at least to me) to make sense to try and prevent all circular references when fetching/looking-up data using the XRef table. To avoid a solution that required tracking the references manually everywhere, the implementation settled on here instead handles that internally in the `XRef.fetch`-method. This should work, since that method and the `Parser`/`Lexer`-implementations are completely synchronous. Note also that the existing `XRef`-caching, used for all data-types except Streams, should hopefully help to lessen the performance impact of these changes. One potential problem with these changes could be certain browser exceptions, since those are generally not catchable in JavaScript code, however those would most likely "stop" worker-thread parsing anyway (at least I hope so). Finally, note that I settled on returning dummy-data rather than throwing an exception. This was done to allow parsing, for the rest of the document, to continue such that one bad reference doesn't prevent an entire document from loading. Fixes two of the issues listed in issue 14303, namely the `poppler-91414-0.zip-2.gz-53.pdf` and `poppler-91414-0.zip-2.gz-54.pdf` documents.	2021-11-27 23:50:26 +01:00
Jonas Jenwald	ea1c348c67	Always prefer abbreviated keys, over full ones, when doing any dictionary lookups (issue 14256) Note that issue 14256 was specifically about inline images, please refer to: - https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#G7.1852045 - https://www.pdfa.org/safedocs-unearths-pdf-inline-image-issue/ - https://pdf-issues.pdfa.org/32000-2-2020/clause08.html#H8.9.7 However, during review of the initial PR in https://github.com/mozilla/pdf.js/pull/14257#issuecomment-964469710, it was suggested that we instead do this unconditionally for all dictionary lookups. In addition to re-ordering the existing call-sites in the `src/core`-code, and adding non-PRODUCTION/TESTING asserts to catch future errors, for consistency a number of existing `if`/`switch`-blocks were re-factored to also check the abbreviated keys first.	2021-11-10 11:56:18 +01:00
Jonas Jenwald	3369f9a783	Move some validation, in `Dict.merge`, used during merging of sub-dictionaries (PR 13775 follow-up) By not adding any additional non-`Dict` entries to the list of candidates for merging of sub-dictionaries, we can very slightly reduce the amount of parsing required by not having to again iterate through unmergeable data.	2021-08-12 11:32:11 +02:00
Jonas Jenwald	766299016f	Remove the `isEOF` helper function and slightly re-factor `EOF` Given how trivial the `isEOF` function is, we can simply inline the check at the various call-sites and remove the function (which ought to be ever so slightly more efficient as well). Furthermore, this patch also changes the `EOF` primitive itself to a `Symbol` instead of an Object since that has the nice benefit of making it unclonable (thus preventing accidentally trying to send `EOF` from the worker-thread).	2021-08-03 20:19:32 +02:00
Jonas Jenwald	e1ee3835cd	Remove some duplication in the `Dict.merge` method Currently the `!mergeSubDicts` code-path is essentially just duplicated code, which we can easily avoid by simply moving that check. (This may lead to ever so slightly more parsing for this case, but the difference ought to be negligible in practice.)	2021-07-22 14:01:43 +02:00
Jonas Jenwald	3838c4e27c	Re-factor the handling of empty `Name`-instances (PR 13612 follow-up) When working on PR 13612, I mostly prioritized a simple solution that didn't require touching a lot of code. However, while working on PR 13735 I started to realize that the static `Name.empty` construction really wasn't a good idea. In particular, having a special `Name`-instance where the `name`-property isn't actually a String is confusing (to put it mildly) and can easily lead to issues elsewhere. The only reason for not simply allowing the `name`-property to be an empty string, in PR 13612, was to avoid having to touch a lot of existing code. However, it turns out that this is only limited to a few methods in the `PartialEvaluator` and a few of the `BaseLocalCache`-implementations, all of which can be easily re-factored to handle empty `Name`-instances. All-in-all, I think that this patch is even an overall improvement since we're now validating (what should always be) `Name`-data better in the `PartialEvaluator`. This is what I ought to have done from the start, sorry about the code churn here!	2021-07-15 12:00:42 +02:00
Jonas Jenwald	6467907318	Support corrupt documents with empty `Name`-entries (issue 13610) Apparently some really bad PDF software can create documents with empty `Name`-entries, which we thus need to somehow deal with. While I don't know if this patch is necessarily the best solution, it should at least ensure that the empty `Name`-instance cannot accidentally match a proper `Name`-instance (and it doesn't require changes to a lot of existing code).[1] --- [1] I briefly considered using a `Symbol` rather than an Object, but quickly decided against that since the former one [is not clonable](https://developer.mozilla.org/en-US/docs/Web/API/Web_Workers_API/Structured_clone_algorithm#supported_types) and `Name`-instances may be sent to the API.	2021-06-22 16:55:44 +02:00
Jonas Jenwald	70113131de	Inline the data lookup in the `Dict.getArray` method Similar to the `get`/`getAsync` methods, this should be a tiny bit more efficient which cannot hurt considering that `getArray` is now used a lot more than when initially added.	2021-05-14 11:24:27 +02:00
Jonas Jenwald	757636d519	Convert the remaining functions in `src/core/primitives.js` to use standard classes This patch was tested using the PDF file from issue 2618, i.e. https://bug570667.bugzilla-attachments.gnome.org/attachment.cgi?id=226471, with the following manifest file: ``` [ { "id": "issue2618", "file": "../web/pdfs/issue2618.pdf", "md5": "", "rounds": 50, "type": "eq" } ] ``` which gave the following results when comparing this patch against the `master` branch: ``` -- Grouped By browser, stat -- browser \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| ------------ \| ----- \| ------------ \| ----------- \| --- \| ---- \| ------------- firefox \| Overall \| 50 \| 3417 \| 3426 \| 9 \| 0.27 \| firefox \| Page Request \| 50 \| 1 \| 1 \| 0 \| 5.41 \| firefox \| Rendering \| 50 \| 3416 \| 3426 \| 9 \| 0.27 \| ``` Based on these results, there's no significant performance regression from using standard classes and this patch should thus be OK.	2021-05-12 09:36:28 +02:00
Jonas Jenwald	67415bfabe	Add an abstract base-class, which all the various Stream implementations inherit from By having an abstract base-class, it becomes a lot clearer exactly which methods/getters are expected to exist on all Stream instances. Furthermore, since a number of the methods are identical for all Stream implementations, this reduces unnecessary code duplication in the `Stream`, `DecodeStream`, and `ChunkedStream` classes. For e.g. `gulp mozcentral`, the built `pdf.worker.js` files decreases from `1 619 329` to `1 616 115` bytes with this patch-series.	2021-04-28 13:44:45 +02:00
Tim van der Meij	24f80f1e38	Enable the `no-var` linting rule in `src/core/primitives.js`	2021-02-27 12:51:01 +01:00
Jonas Jenwald	81525fd446	Use ESLint to ensure that `export`s are sorted alphabetically There's built-in ESLint rule, see `sort-imports`, to ensure that all `import`-statements are sorted alphabetically, since that often helps with readability. Unfortunately there's no corresponding rule to sort `export`-statements alphabetically, however there's an ESLint plugin which does this; please see https://www.npmjs.com/package/eslint-plugin-sort-exports The only downside here is that it's not automatically fixable, but the re-ordering is a one-time "cost" and the plugin will help maintain a consistent ordering of `export`-statements in the future. Note: To reduce the possibility of introducing any errors here, the re-ordering was done by simply selecting the relevant lines and then using the built-in sort-functionality of my editor.	2021-01-09 20:37:51 +01:00
Jonas Jenwald	082cd8fc6c	Add global caching, for /Resources without blend modes, and use it to reduce repeated fetching/parsing in `PartialEvaluator.hasBlendModes` The `PartialEvaluator.hasBlendModes` method is necessary to determine if there's any blend modes on a page, which unfortunately requires synchronous parsing of the /Resources of each page before its rendering can start (see the "StartRenderPage"-message). In practice it's not uncommon for certain /Resources-entries to be found on more than one page (referenced via the XRef-table), which thus leads to unnecessary re-fetching/re-parsing of data in `PartialEvaluator.hasBlendModes`. To improve performance, especially in pathological cases, we can cache /Resources-entries when it's absolutely clear that they do not contain any blend modes at all[1]. This way, subsequent `PartialEvaluator.hasBlendModes` calls can be made significantly more efficient. This patch was tested using the PDF file from issue 6961, i.e. https://github.com/mozilla/pdf.js/files/121712/test.pdf: ``` [ { "id": "issue6961", "file": "../web/pdfs/issue6961.pdf", "md5": "a80e4357a8fda758d96c2c76f2980b03", "rounds": 100, "type": "eq" } ] ``` which gave the following results when comparing this patch against the `master` branch: ``` -- Grouped By browser, page, stat -- browser \| page \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| ---- \| ------------ \| ----- \| ------------ \| ----------- \| ---- \| ------ \| ------------- firefox \| 0 \| Overall \| 100 \| 1034 \| 555 \| -480 \| -46.39 \| faster firefox \| 0 \| Page Request \| 100 \| 489 \| 7 \| -482 \| -98.67 \| faster firefox \| 0 \| Rendering \| 100 \| 545 \| 548 \| 2 \| 0.45 \| firefox \| 1 \| Overall \| 100 \| 912 \| 428 \| -484 \| -53.06 \| faster firefox \| 1 \| Page Request \| 100 \| 487 \| 1 \| -486 \| -99.77 \| faster firefox \| 1 \| Rendering \| 100 \| 425 \| 427 \| 2 \| 0.51 \| ``` --- [1] In the case where blend modes are found, it becomes a lot more difficult to know if it's generally safe to skip /Resources-entries. Hence we don't cache anything in that case, however note that most document/pages do not utilize blend modes anyway.	2020-11-05 16:59:08 +01:00
Jonas Jenwald	9416b14e8b	Re-factor how the ESLint `no-var` rule is enabled in the `src/` folder This simplifies/consolidates the ESLint configuration slightly in the `src/` folder, and prevents the addition of any new files where `var` is being used.[1] Hence we no longer need to manually add `/* eslint no-var: error */` in files, which is easy to forget, and can instead disable the rule in the `src/core/` files where `var` is still in use. --- [1] Obviously the `no-var` rule can, in the same way as every other rule, be disabled on a case-by-case basis where actually necessary.	2020-10-03 20:15:29 +02:00
Jonas Jenwald	a531c98cd2	Ensure that the empty dictionary won't be accidentally modified Currently there's nothing that prevents modification of the `Dict.empty` primitive, which obviously needs to be truly empty to prevent any future (hard to find) bugs.	2020-09-15 09:29:00 +02:00
Jonas Jenwald	784a420027	Add support, in `Dict.merge`, for merging of "sub"-dictionaries This allows for merging of dictionaries one level deeper than previously. This could be useful e.g. for /Resources dictionaries, where you want to e.g. merge their respective /Font dictionaries (and other) together rather than picking just the first one.	2020-08-30 23:18:32 +02:00
Jonas Jenwald	ea8e432c45	Add a `getRawValues` method, to `Dict` instances, to provide an easier way of getting all raw values When the old `Dict.getAll()` method was removed, it was replaced with a `Dict.getKeys()` call and `Dict.get(...)` calls (in a loop). While this pattern obviously makes a lot of sense in many cases, there's some instances where we actually want the raw `Dict` values (i.e. `Ref`s where applicable). In those cases, `Dict.getRaw(...)` calls are instead used within the loop. However, by introducing a new `Dict.getRawValues()` method we can reduce the number of (strictly unnecessary) function calls by simply getting the raw `Dict` values directly.	2020-07-17 16:32:00 +02:00
Jonas Jenwald	6381b5b08f	Add a `size` getter, to `Dict` instances, to provide an easier way of checking the number of entries This removes the need to manually call `Dict.getKeys()` and check its length.	2020-07-17 16:06:11 +02:00
Tim van der Meij	b19a1796ac	Convert `RefSetCache` to a proper class and to use a `Map` internally Using a `Map` instead of an `Object` provides some advantages such as cheaper ways to get the size of the cache, to find out if an entry is contained in the cache and to iterate over the cache. Moreover, we can clear and re-use the same `Map` object now instead of creating a new one.	2020-07-17 13:35:29 +02:00

1 2