pdf.js.mirror

Marmelator/pdf.js.mirror

mirror of https://github.com/mozilla/pdf.js.git synced 2026-05-31 07:11:00 +02:00

Author	SHA1	Message	Date
calixteman	2381ac6b16	Update the internal viewer to use a new debugger. It has few cool features: - all the canvas used during the rendering can be viewed; - the different properties in the graphics state can be viewed; - the different paths can be viewed.	2026-03-12 22:38:08 +01:00
Tim van der Meij	decbce7b9b	Merge pull request #20856 from Snuffleupagus/fix-font-clearData Fix the `FontInfo.prototype.clearData` method to actually remove the data as intended (PR 20197 follow-up)	2026-03-12 20:39:02 +01:00
Jonas Jenwald	e88a5652de	Fix the `FontInfo.prototype.clearData` method to actually remove the data as intended (PR 20197 follow-up) The purpose of PR 11844 was to reduce memory usage once fonts have been attached to the DOM, since the font-data can be quite large in many cases. Unfortunately the new `clearData` method added in PR 20197 doesn't actually remove anything, it just replaces the font-data with zeros which doesn't help when the underlying `ArrayBuffer` itself isn't modified. The method does include a commented-out `resize` call[1], but uncommenting that just breaks rendering completely. To address this regression, without having to make large or possibly complex changes, this patch simply changes the `clearData` method to replace the internal buffer/view with its contents before the font-data. While this does lead to a data copy, the size of this data is usually orders of magnitude smaller than the font-data that we're removing. --- [1] Slightly off-topic, but I don't think that patches should include commented-out code since there's a very real risk that those things never get found/fixed. At the very least such cases should be clearly marked with `// TODO: ...` comments, and should possibly also have an issue filed about fixing the TODO.	2026-03-12 18:15:42 +01:00
Jonas Jenwald	9aa1ce8f14	Move the `PagesMapper` class into its own file The `PagesMapper` class currently makes up one third of the `src/display/display_utils.js` file size, and since its introduction it's grown (a fair bit) in size. Note that the intention with files such as `src/display/display_utils.js` was to have somewhere to place functionality too small/simple to deserve its own file.	2026-03-11 12:28:13 +01:00
Jonas Jenwald	79df166e06	Merge pull request #20846 from Snuffleupagus/internal-viewer-followup A couple of small improvements of the new internal viewer	2026-03-11 12:01:11 +01:00
calixteman	9d093d9607	Merge pull request #20626 from nicolo-ribaudo/images-right-click Add support for right-clicking on images (bug 1012805)	2026-03-11 11:45:51 +01:00
Jonas Jenwald	60d6abdf4f	A couple of small improvements of the new internal viewer - Mention the internal viewer in the README, such that it's easier to find. - Implement a new `INTERNAL_VIEWER` define, such that it's easier to limit code to only the "internal-viewer" gulp target. - Only include the "GetRawData" message-handler when needed. Note that the `MessageHandler` [already throws](`eb159abd6a/src/shared/message_handler.js (L121-L123)`) for any missing handler. - Move the various new helper functions from `src/core/document.js` and into their own file. The reasons for doing this are: - That file is already quite large and complex as-is, and these helper functions are slightly orthogonal to its main functionality. - Babel isn't able to remove all of the new code, and by moving this into a separate file we can guarantee that no extra code ends up in e.g. Firefox.	2026-03-10 23:41:35 +01:00
Tim van der Meij	44a63549b0	Merge pull request #20831 from calixteman/internal_viewer Add a new internal viewer to explore the structure of PDF files.	2026-03-10 20:48:40 +01:00
Tim van der Meij	15e58f3912	Merge pull request #20830 from Snuffleupagus/validateRangeRequestCapabilities-fix-tests Improve the `validateRangeRequestCapabilities` unit-tests	2026-03-10 20:23:14 +01:00
Tim van der Meij	3f75c4e511	Merge pull request #20829 from Snuffleupagus/Blob-bytes Start using `Blob.prototype.bytes()` in the code-base	2026-03-10 20:14:49 +01:00
Jonas Jenwald	5ef582fb20	Use optional chaining a little bit more in the `src/display/api.js` file That format is preferred where possible, since it leads to ever so slightly shorter code overall.	2026-03-10 15:51:05 +01:00
Nicolò Ribaudo	4f7a025e21	Separate bbox tracking from dependencies tracking When recording bboxes for images, it's enough to record their clip box / bounding box without needing to run the full bbox tracking of the image's dependencies.	2026-03-10 14:51:03 +01:00
Nicolò Ribaudo	886c90d1a5	Add support for right-clicking on images This patch adds right-click support for images in the PDF, allowing users to download them. To minimize memory consumption, we: - Do not store the images separately, and instead crop them out of the PDF page canvas - Only extract the images when needed (i.e. when the user right-clicks on them), rather than eagery having all of them available. To do so, we layer one empty 0x0 canvas per image, stretched to cover the whole image, and only populate its contents on right click. These images need to be inside the text layer: they cannot be _behind_ it, otherwise they would be covered by the text layer's container and not be clickable, and they cannot be in front of it, otherwise they would make the text spans unselectable. This feature is managed by a new preference, `imagesRightClickMinSize`: - when it's set to `-1`, right-click support is disabled - when set to `0`, all images are available for right click - when set to a positive integer, only images whose width and height are greater than or equal to that value (in the PDF page frame of reference) are available for right click. This features is disabled by default outside of MOZCENTRAL, as it significantly degrades the text selection experience in non-Firefox browsers.	2026-03-10 14:51:03 +01:00
Jonas Jenwald	873378b718	Merge pull request #20836 from Snuffleupagus/FontFaceObject-fix-asserts Fix the `disableFontFace` and `fontExtraProperties` asserts in the `FontFaceObject` constructor (PR 20197 follow-up)	2026-03-09 20:08:58 +01:00
Jonas Jenwald	9f69617109	Fix the `disableFontFace` and `fontExtraProperties` asserts in the `FontFaceObject` constructor (PR 20197 follow-up) In PR 19548 these checks were added to ensure that the font-data sent from the worker-thread always include correct `disableFontFace` and `fontExtraProperties` data. For some reason PR 20197 then changed the code such that these checks became effectively pointless, since these properties are now checked after the fact and the new getters provide fallback values.	2026-03-09 18:00:11 +01:00
Jonas Jenwald	dbb6ffb8d5	Change the `Font.prototype.glyphCacheValues` method to return an iterator This method is only used with loops, and it should be a tiny bit more efficient to use an iterator directly rather than first iterating through the underlying data to create a temporary `Array` that we finally iterate through at the call-site. Please note: As port of these changes the chars/glyph caches, on the `Font` instances, are changed to use `Map`s rather than Objects.	2026-03-09 16:18:48 +01:00
Jonas Jenwald	8bbb7c88d3	Change the `AnnotationLayer.prototype.getEditableAnnotations` method to return an iterator This method is only used with loops, and it should be a tiny bit more efficient to use an iterator directly rather than first iterating through the underlying `Map` to create a temporary `Array` that we finally iterate through at the call-site.	2026-03-09 16:11:21 +01:00
calixteman	9d81fafa8c	Add a new internal viewer to explore the structure of PDF files. The one from pdf.js.utils is a bit too old: a lot of bugs have been fixed in the code that parses PDF files since then. It's just an internal development tool, so it doesn't need to be perfect, but it should be good enough to be useful.	2026-03-09 14:16:12 +01:00
Calixte Denizet	0e48c16c3c	Add a UI to undo cut/delete and cancel a copy (bug 2021352, bug 2010832) This happens in a bar on top of the thumbnails sidebar. The label depending on the selected thumbnails is fixed.	2026-03-09 10:44:11 +01:00
Jonas Jenwald	a1b769caea	Improve the `validateRangeRequestCapabilities` unit-tests A number of these unit-tests didn't actually cover the intended code-paths, since many of them accidentally matched the "file size is smaller than two range requests"-check. The patch also updates `validateRangeRequestCapabilities` to use return-value names that are consistent with the class fields used in the various stream implementations.	2026-03-08 18:28:50 +01:00
Jonas Jenwald	2598b0dcdd	Start using `Blob.prototype.bytes()` in the code-base Note that this isn't motivated by the miniscule reduction in code-size, but rather by wanting to unblock using this newer feature; see https://developer.mozilla.org/en-US/docs/Web/API/Blob/bytes	2026-03-08 14:06:03 +01:00
calixteman	253ce6e323	Handle outline with Structure Element (SE) destination	2026-03-08 12:28:24 +01:00
Jonas Jenwald	ddd69ce4e0	Remove the "DocProgress" `loaded` fallback from the `getPdfManager` function Falling back to use the `loaded` byteLength if the server `contentLength` is unknown doesn't make a lot of sense, since it'd lead to the `onProgress` callback reporting `percent === 100` repeatedly while the document is loading despite that being obviously wrong. Instead we'll now report `percent === NaN` in that case, thus showing the indeterminate progressBar, which seems more correct if the `contentLength` is unknown. Please note that this code-path is normally not even reached, since streaming is enabled by default (applies e.g. to the Firefox PDF Viewer).	2026-03-08 10:22:01 +01:00
Jonas Jenwald	1f69cf964c	Ensure that `percent === NaN` is consistently reported by the `onProgress` callback With these changes `0`, `NaN`, `null`, and `undefined` in the `total`-property all result in `percent === NaN` being reported by the callback, since previously e.g. `0` would result in `percent === 100` being reported unconditionally which doesn't make a lot of sense. Also, remove the "indeterminate" loadingBar (in the viewer) if the `PDFDocumentLoadingTask` fails since there won't be any more data arriving and displaying the animation thus seems wrong.	2026-03-08 10:21:55 +01:00
Tim van der Meij	98dc351cfa	Merge pull request #20824 from calixteman/bug2015853 Add the possibility to merge/update acroforms when merging/extracting (bug 2015853)	2026-03-07 20:12:02 +01:00
calixteman	baf8647b1f	Add the possibility to merge/update acroforms when merging/extracting (bug 2015853)	2026-03-07 19:03:02 +01:00
Jonas Jenwald	0c514b008b	Use `Response.prototype.bytes()` more in the code-base (PR 20651 follow-up)	2026-03-07 15:50:36 +01:00
Jonas Jenwald	49e8240c19	Use `Map.prototype.getOrInsertComputed` in the scripting implementation This adds a basic non-MOZCENTRAL polyfill for now, which we should be able to remove once the next QuickJS version is released; note the pending changelog at `f1139494d1/Changelog (L7)`	2026-03-07 13:19:40 +01:00
Jonas Jenwald	ca428aadae	Use `Math.sumPrecise` in the scripting implementation This adds a very basic non-MOZCENTRAL polyfill for now, which we should be able to remove once the next QuickJS version is released; note the pending changelog at `f1139494d1/Changelog (L8)`	2026-03-07 13:19:40 +01:00
Tim van der Meij	d34a15e03f	Merge pull request #20662 from Snuffleupagus/getPdfManager-async-read Convert the data reading in `getPdfManager` to be asynchronous	2026-03-07 13:16:22 +01:00
Jonas Jenwald	d236b517fe	Shorten the `createActionsMap` helper in the `src/scripting_api/common.js` file	2026-03-07 11:22:21 +01:00
Jonas Jenwald	efa13c5e2a	Don't duplicate the `Jbig2Error` exception Let `src/core/jbig2_ccittFax_wasm.js` import the existing exception, rather than duplicate its code.	2026-03-06 12:04:08 +01:00
Jonas Jenwald	29362e6afb	Remove the `JBig2CCITTFaxWasmImage` instance when running clean-up This follows the same pattern as the existing handling for the `JpxImage` instance.	2026-03-06 12:04:03 +01:00
Jonas Jenwald	7f4e29ed22	Change the "Terminate" worker-thread handler to an asynchronous function This is a tiny bit shorter, which cannot hurt.	2026-03-06 11:24:12 +01:00
Jonas Jenwald	e8ab3cb335	Convert the data reading in `getPdfManager` to be asynchronous This is not only shorter, but (in my opinion) it also simplifies the code. Note: In order to keep the five different `BasePDFStreamReader` implementations consistent, we purposely don't re-factor the `PDFWorkerStreamReader` class to support `for await...of` iteration.	2026-03-05 22:50:26 +01:00
Tim van der Meij	688ae9b3e5	Merge pull request #20811 from calixteman/fix_xref Add fetch** functions in the XRefWrapper	2026-03-05 22:02:08 +01:00
Tim van der Meij	01bc76e681	Merge pull request #20806 from Snuffleupagus/BinaryCMapStream-extends-Stream Let `BinaryCMapStream` extend the `Stream` class	2026-03-05 20:43:37 +01:00
Calixte Denizet	150c1e80c2	Add fetch** functions in the XRefWrapper It could fail to not have them if they're used during writing.	2026-03-05 19:21:12 +01:00
Jonas Jenwald	fccee4bffd	Let `BinaryCMapStream` extend the `Stream` class Looking at the `BinaryCMapStream` implementation, it's basically a "regular" `Stream` but with added functionality for reading compressed CMap data. Hence, by letting `BinaryCMapStream` extend `Stream`, we can remove an effectively duplicate method and simplify/shorten the code a tiny bit.	2026-03-05 11:45:29 +01:00
Jonas Jenwald	aa445877a9	Use `BaseStream.prototype.getString` in the `readPostScriptTable` function Currently the `customNames` are read one byte at a time, in a loop, and at every iteration converted to a string. This can be replaced with the `BaseStream.prototype.getString` method, which didn't exist back when this function was written.	2026-03-04 18:34:07 +01:00
Jonas Jenwald	4d0709c174	Merge pull request #20795 from Snuffleupagus/Dict-more-iterators Change the `Dict.prototype.{getKeys, getRawValues}` methods to return iterators	2026-03-04 18:26:42 +01:00
calixteman	7384359a41	Merge pull request #20781 from pengkunbin/fix/chinese-font-names-gbk Fix missing Chinese font name variants (SimFang and XiaoBiaoSong) in GBK encoding detection	2026-03-04 16:49:44 +01:00
Jonas Jenwald	229e3642be	Change the `Dict.prototype.getRawValues` method to return an iterator This method is usually used with loops, and it should be a tiny bit more efficient to use an iterator directly rather than first iterating through ` Map`-values to create a temporary `Array` that we finally iterate through at the call-site. Note that the `getRawValues` method is old code, and originally the `Dict` class stored its data in a regular `Object`, hence why the old code was written that way.	2026-03-04 16:07:49 +01:00
Jonas Jenwald	58996f21b2	Change the `Dict.prototype.getKeys` method to return an iterator This method is usually used with loops, and it should be a tiny bit more efficient to use an iterator directly rather than first iterating through ` Map`-keys to create a temporary `Array` that we finally iterate through at the call-site. Note that the `getKeys` method is old code, and originally the `Dict` class stored its data in a regular `Object`, hence why the old code was written that way.	2026-03-04 16:07:49 +01:00
Jonas Jenwald	40bd73551c	Merge pull request #20793 from Snuffleupagus/more-getRawEntries Use the `Dict.prototype.getRawEntries` method more	2026-03-04 16:05:57 +01:00
calixteman	ce5f34ba13	Merge pull request #20780 from wooorm/wooorm/dismiss-popups Add support for dismissing comment popups with click outside	2026-03-04 15:24:29 +01:00
calixteman	72f98d4e00	Merge pull request #20788 from calixteman/organize_context_menu Add the pages organization actions in the Firefox context menu (bug 2018138)	2026-03-04 15:20:00 +01:00
Jonas Jenwald	50d66d7d34	Use the `Dict.prototype.getRawEntries` method more This changes a number of loops currently using `Dict.prototype.{getKeys, getRaw}`, since it should be a tiny bit more efficient to use an iterator directly rather than first iterating through `Map`-keys to create a temporary `Array` that we finally iterate through at the call-site. Note that the `getKeys` method is much older than `getRawEntries`, and originally the `Dict` class stored its data in a regular `Object`, hence why the old code was written that way.	2026-03-04 12:46:25 +01:00
Nicolò Ribaudo	2f2d5c9e27	Add script to check license headers	2026-03-04 10:40:39 +01:00
Calixte Denizet	d90530b86c	Add the pages organization actions in the Firefox context menu (bug 2018138)	2026-03-04 09:02:39 +01:00

1 2 3 4 5 ...

7520 Commits