pdf.js.mirror

Marmelator/pdf.js.mirror

mirror of https://github.com/mozilla/pdf.js.git synced 2026-06-24 00:45:49 +02:00

Author	SHA1	Message	Date
calixteman	22b97d1741	Flush the text content chunk only on real font changes (bug 2013793)	2026-02-03 23:11:31 +01:00
calixteman	1c12b07726	Merge pull request #20613 from calixteman/ccittfax_pdfium Use the ccittfax decoder from pdfium	2026-02-02 15:07:53 +01:00
calixteman	88c2051698	Use the ccittfax decoder from pdfium The decoder is a dependency of the jbig2 one and is already included in pdf.js, so we just need to wire it up. It improves the performance of documents using ccittfax images.	2026-02-02 11:10:32 +01:00
Tim van der Meij	f4326e17c4	Merge pull request #20610 from calixteman/brotli Add support for Brotli decompression	2026-02-01 20:41:06 +01:00
calixteman	43273fde27	Add support for Brotli decompression For now, `BrotliDecode` hasn't been specified but it should be in a close future. So when it's possible we use the native `DecompressionStream` API with "brotli" as argument. If that fails or if we've to decompress in a sync context, we fallback to `BrotliStream` which a pure js implementation (see README in external/brotli).	2026-01-31 16:25:53 +01:00
Jonas Jenwald	4ca205bac3	Add an abstract `BasePDFStreamRangeReader` class, that all the old `IPDFStreamRangeReader` implementations inherit from Given that there's no less than five different, but very similar, implementations this helps reduce code duplication and simplifies maintenance.	2026-01-30 14:15:39 +01:00
Jonas Jenwald	54d8c5e7b4	Add an abstract `BasePDFStreamReader` class, that all the old `IPDFStreamReader` implementations inherit from Given that there's no less than five different, but very similar, implementations this helps reduce code duplication and simplifies maintenance. Also, remove the `rangeChunkSize` not defined checks in all the relevant stream-constructor implementations. Note how the API, since some time, always validates and provides that parameter when creating a `BasePDFStreamReader`-instance.	2026-01-30 14:15:39 +01:00
Jonas Jenwald	4a8fb4dde1	Add an abstract `BasePDFStream` class, that all the old `IPDFStream` implementations inherit from Given that there's no less than five different, but very similar, implementations this helps reduce code duplication and simplifies maintenance. Also, spotted during rebasing, pass the `enableHWA` option "correctly" (i.e. as part of the existing `transportParams`) to the `WorkerTransport`-class to keep the constructor simpler.	2026-01-30 14:15:39 +01:00
Jonas Jenwald	a80f10ff1a	Remove the `onProgress` callback from the `IPDFStreamRangeReader` interface Note how there's nowhere in the code-base where the `IPDFStreamRangeReader.prototype.onProgress` callback is actually being set and used, however the loadingBar (in the viewer) still works just fine since loading progress is already reported via: - The `ChunkedStreamManager` instance respectively the `getPdfManager` function, through the use of a "DocProgress" message, on the worker-thread. - A `IPDFStreamReader.prototype.onProgress` callback, on the main-thread. Furthermore, it would definitely not be a good idea to add any `IPDFStreamRangeReader.prototype.onProgress` callbacks since they only include the `loaded`-property which would trigger the "indeterminate" loadingBar (in the viewer). Looking briefly at the history of this code it's not clear, at least to me, when this became unused however it's probably close to a decade ago.	2026-01-30 14:15:39 +01:00
Jonas Jenwald	05b78ce03c	Stop registering an `onProgress` callback on the `PDFWorkerStreamRangeReader`-instance, in the `ChunkedStreamManager` class Given that nothing in the `PDFWorkerStreamRangeReader` class attempts to invoke the `onProgress` callback, this is effectively dead code now. Looking briefly at the history of this code it's not clear, at least to me, when this became unused however it's probably close to a decade ago. Finally, note also how progress is already being reported through the `ChunkedStreamManager.prototype.onReceiveData` method.	2026-01-30 14:15:38 +01:00
Jonas Jenwald	987265720e	Remove the unused `IPDFStreamRangeReader.prototype.isStreamingSupported` getter This getter was only invoked from `src/display/network.js` and `src/core/chunked_stream.js`, however in both cases it's hardcoded to `false` and thus isn't actually needed. This originated in PR 6879, close to a decade ago, for a potential TODO which was never implemented and it ought to be OK to just simplify this now.	2026-01-30 14:15:38 +01:00
Jonas Jenwald	62d5408cf0	Stop tracking `progressiveDataLength` in the `ChunkedStreamManager` class Currently this property is essentially "duplicated", so let's instead use the identical one that's availble on the `ChunkedStream` instance.	2026-01-30 14:15:38 +01:00
Tim van der Meij	471adfd023	Merge pull request #20596 from Snuffleupagus/FileSpec-fixes Simplify the `FileSpec` class, and remove no longer needed polyfills	2026-01-29 22:03:38 +01:00
Jonas Jenwald	5b368dd58a	Remove the `Uint8Array.prototype.toHex()`, `Uint8Array.prototype.toBase64()`, and `Uint8Array.fromBase64()` polyfills (During rebasing of the previous patches I happened to look at the polyfills and noticed that this one could be removed now.) See: - https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Uint8Array/toHex#browser_compatibility - https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Uint8Array/toBase64#browser_compatibility - https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Uint8Array/fromBase64#browser_compatibility Note that technically this functionality can still be disabled via a preference in Firefox, however that's slated for removal in [bug 1985120](https://bugzilla.mozilla.org/show_bug.cgi?id=1985120). Looking at the Firefox source-code, see https://searchfox.org/firefox-main/search?q=array.tobase64%28%29&path=&case=false&regexp=false, you can see that it's already being used unconditionally elsewhere in the browser hence removing the polyfills ought to be fine (since toggling the preference would break other parts of the browser).	2026-01-29 17:27:43 +01:00
Jonas Jenwald	b3f35b6007	Return the `rawFilename` as-is even if it's empty, from `FileSpec.prototype.serializable` It's more correct to return the `rawFilename` as-is, and limit the fallback for empty filenames to only the `filename` property.	2026-01-29 17:25:44 +01:00
Calixte Denizet	806133379e	Refactor a bit page mapping stuff in order to be able to support delete/copy pages	2026-01-26 16:53:52 +01:00
calixteman	9f660be8a2	Use DecompressionStream in async code Usually, content stream or fonts are compressed using FlateDecode. So use the DecompressionStream API to decompress those streams in the async code path.	2026-01-25 14:22:19 +01:00
Jonas Jenwald	640a3106d5	Remove caching/shadowing from the `FileSpec` getters, and simplify the code Given that only the `FileSpec.prototype.serializable` getter is ever invoked from "outside" of the class, and only once per `FileSpec`-instance, the caching/shadowing isn't actually necessary. Furthermore the `_contentRef`-caching wasn't actually correct, since it ended up storing a `BaseStream`-instance and those should generally never be cached. (Since calling `BaseStream.prototype.getBytes()` more than once, without resetting the stream in between, will return an empty TypedArray after the first time.)	2026-01-25 13:16:29 +01:00
Jonas Jenwald	84b5866853	Reduce duplication in the `pickPlatformItem` helper function Also, tweak code/comment used when handling "GoToR" destinations.	2026-01-25 13:16:06 +01:00
calixteman	ce296d8d42	Add the possibility to order the pages in an extracted pdf (bug 1997379) or in a merged one.	2026-01-19 18:58:23 +01:00
Calixte Denizet	b5ed988267	Don't use contents stream which have an image format The original bug has been filled in mupdf bug tracker: https://bugs.ghostscript.com/show_bug.cgi?id=709033 The attached pdf can be open in Chrome but not in Acrobat.	2026-01-13 18:39:17 +01:00
calixteman	eab33828a9	Fix wasm url issue for the jbig2 decoder and add a test for jbig2 decoding with the js decoder.	2026-01-04 00:08:59 +01:00
calixteman	98c1955bd4	Use the PDFium JBig2 decoder compiled into wasm The decoder is ~4x faster than the JS decoder on large images.	2026-01-03 22:05:14 +01:00
calixteman	424c7989aa	Get glyph contours when stroking using a pattern Fix issue #20513 (second part).	2025-12-28 22:55:59 +01:00
calixteman	5518c8a544	Use CIDToGIDMap when the font is a type 2 with an OpenType font It fixes #18062.	2025-12-28 14:51:06 +01:00
Tim van der Meij	1990fa7cd0	Merge pull request #20538 from calixteman/issue13425 Fix the loca table length when there is enough space for it	2025-12-28 13:52:32 +01:00
calixteman	22932f7b68	Fix the loca table length when there is enough space for it It fixes #13425.	2025-12-28 11:21:40 +01:00
calixteman	1dffcf7f25	Remove undefStack stuff in the cff parser I think it should have been removed with #2527 so it should be useless now. Because of that stuff, some commands with a wrong number of arguments weren't stripped out (see the pdf in #13850).	2025-12-27 16:59:29 +01:00
calixteman	91033c2199	Fix the encoding for some missing chinese fonts It fixes #20489.	2025-12-23 14:05:27 +01:00
Andrii Vitiv	9677798ba0	Fix `Worker was terminated` error when loading is cancelled Fixes https://github.com/mozilla/pdf.js/issues/11595, where cancelling loading with `loadingTask.destroy()` before it finishes throws a `Worker was terminated` error that CANNOT be caught. When worker is terminated, an error is thrown here: `6c746260a9/src/core/worker.js (L374)` Then `onFailure` runs, in which we throw again via `ensureNotTerminated()`. However, this second error is never caught (and cannot be), resulting in console spam. There is no need to throw any additional errors since the termination is already reported [here](`6c746260a9/src/core/worker.js (L371-L373)`), and `onFailure` is supposed to handle errors, not throw them.	2025-12-14 18:15:10 +02:00
Tim van der Meij	d946f05841	Merge pull request #20440 from Gaurang-5/master Fix infinite loop in JBIG2 decoder with >4 referred-to segments	2025-12-09 20:42:51 +01:00
calixteman	f75812b0af	Merge pull request #20346 from ryzokuken/binary-fontpath Encode FontPath data into an ArrayBuffer	2025-12-08 13:59:23 +01:00
Tim van der Meij	de5709a7cd	Merge pull request #20454 from xiaobai2017666/russian-char Extend getGlyphMapForStandardFonts with some Russian entries (issue 20453)	2025-12-07 18:28:41 +01:00
Gaurang Bhatia	ac8d80a8e4	Fix infinite loop in JBIG2 decoder with >4 referred-to segments and add regression test	2025-12-07 06:46:16 +05:30
Ujjwal Sharma	3a85770af1	Encode FontPath data into an ArrayBuffer Serialize FontPath commands into a binary format and store it in an ArrayBuffer so that it can eventually be stored in a SharedArrayBuffer.	2025-12-06 03:00:48 +05:30
Weismann	365cc69cae	Extend getGlyphMapForStandardFonts with some Russian entries (issue 20453)	2025-12-01 10:21:27 +08:00
Calixte Denizet	516aea5562	[XFA] Set default max value in occur tag to -1 (bug 1998843)	2025-11-21 17:53:38 +01:00
calixteman	c6b61a34e6	Merge pull request #20436 from calixteman/merge_struct_trees Merge the structure trees coming from different pdfs (bug 1997379)	2025-11-17 20:10:06 +01:00
Calixte Denizet	e13a618df3	Merge the structure trees coming from different pdfs (bug 1997379)	2025-11-17 19:56:36 +01:00
Calixte Denizet	50c48cf11b	Add telemetry for tagged pdfs (bug 1997134)	2025-11-17 19:47:16 +01:00
calixteman	e7288dca8e	Merge pull request #20431 from calixteman/split_merge_p4 Add a wrapper for the new xref in order to be able to get some values from cloned dictionaries	2025-11-11 21:47:42 +01:00
Tim van der Meij	bc4d90711a	Merge pull request #20432 from calixteman/version Version entry in the catalog has to be a name and not a string	2025-11-11 20:31:59 +01:00
Calixte Denizet	a98b0b1fb5	Version entry in the catalog has to be a name and not a string	2025-11-09 15:34:57 +01:00
Calixte Denizet	65881f0e21	Add a wrapper for the new xref in order to be able to get some values from cloned dictionaries	2025-11-09 15:28:43 +01:00
Calixte Denizet	37f4712f7e	Update the named page destinations when some pdf are combined (bug 1997379) and remove link annotations pointing on a deleted page.	2025-11-07 18:22:19 +01:00
Calixte Denizet	ad97c5b816	Update the page labels tree when a pdf is extracted (bug 1997379)	2025-11-07 15:59:57 +01:00
calixteman	85ed401b82	Merge pull request #20409 from calixteman/split_merge_p1 Add the possibility to create a pdf from different ones (bug 1997379)	2025-11-07 15:05:52 +01:00
Calixte Denizet	bc87f4e8d6	Add the possibility to create a pdf from different ones (bug 1997379) For now it's just possible to create a single pdf in selecting some pages in different pdf sources. The merge is for now pretty basic (it's why it's still a WIP) none of these data are merged for now: - the struct trees - the page labels - the outlines - named destinations For there are 2 new ref tests where some new pdfs are created: one with some extracted pages and an other one (encrypted) which is just rewritten. The ref images are generated from the original pdfs in selecting the page we want and the new images are taken from the generated pdfs.	2025-11-07 14:57:48 +01:00
Calixte Denizet	04db38558a	Create the number tree for the ParentTree only one time	2025-11-05 17:49:55 +01:00
Tim van der Meij	6e7a6eb52b	Merge pull request #20408 from calixteman/fix_mml_encoding Don't set the MathML namespace for attributes in MathML tags (bug 1997343)	2025-11-01 14:58:15 +01:00

... 5 6 7 8 9 ...

3722 Commits