|
| 1 | +The latest release of `pdf-lib` (`v1.0.0`) includes several breaking API changes. If you have code written for older versions of `pdf-lib` (`v0.x.x`), you can use the following instructions to help migrate your code to v1.0.0. |
| 2 | + |
| 3 | +Note that many of the API methods are now asynchronous and return promises, so you'll need to `await` on them (or use promise chaining: `.then(res => ...)`). |
| 4 | + |
| 5 | +- Rename _`PDFDocumentFactory`_ to **`PDFDocument`**. `PDFDocument.create` and `PDFDocument.load` are now async (they return promises), so you'll need to `await` on them. |
| 6 | + |
| 7 | +* To create a new PDF document: |
| 8 | + |
| 9 | + ```js |
| 10 | + const pdfDoc = await PDFDocument.create(); |
| 11 | + ``` |
| 12 | + |
| 13 | +* To retrieve and load a PDF where `pdfUrl` points to the PDF to be loaded: |
| 14 | + ```js |
| 15 | + const pdfBuffer = await fetch(pdfUrl).then((res) => res.arrayBuffer()); |
| 16 | + const pdfDoc = await PDFDocument.load(pdfBuffer); |
| 17 | + ``` |
| 18 | + |
| 19 | +- The purpose of making these methods asynchronous is to avoid blocking the event loop (especially for browser-based usage). If you aren't running this code client-side and are not concerned about blocking the event loop, you can speed up parsing times with: |
| 20 | + |
| 21 | + ```js |
| 22 | + PDFDocument.load(..., { parseSpeed: ParseSpeeds.Fastest }) |
| 23 | + ``` |
| 24 | + |
| 25 | + You can do a similar thing for save: |
| 26 | + |
| 27 | + ```js |
| 28 | + PDFDocument.save({ objectsPerTick: Infinity }); |
| 29 | + ``` |
| 30 | + |
| 31 | +- To draw content on a page in old versions of `pdf-lib`, you needed to create a content stream, invoke some operators, register the content stream, and add it to the document. Something like the following: |
| 32 | + |
| 33 | + ```js |
| 34 | + const contentStream = pdfDoc.createContentStream( |
| 35 | + drawText( |
| 36 | + timesRomanFont.encodeText('Creating PDFs in JavaScript is awesome!'), |
| 37 | + { |
| 38 | + x: 50, |
| 39 | + y: 450, |
| 40 | + size: 15, |
| 41 | + font: 'TimesRoman', |
| 42 | + colorRgb: [0, 0.53, 0.71], |
| 43 | + }, |
| 44 | + ), |
| 45 | + ); |
| 46 | + page.addContentStreams(pdfDoc.register(contentStream)); |
| 47 | + ``` |
| 48 | + |
| 49 | + However, in new versions of `pdf-lib`, this is much simpler. You simply invoke drawing methods on the page, such as [`PDFPage.drawText`](https://pdf-lib.js.org/docs/api/classes/pdfpage#drawtext), [`PDFPage.drawImage`](https://pdf-lib.js.org/docs/api/classes/pdfpage#drawimage), [`PDFPage.drawRectangle`](https://pdf-lib.js.org/docs/api/classes/pdfpage#drawrectangle), or [`PDFPage.drawSvgPath`](https://pdf-lib.js.org/docs/api/classes/pdfpage#drawsvgpath). So the above example becomes: |
| 50 | + |
| 51 | + ```js |
| 52 | + page.drawText('Creating PDFs in JavaScript is awesome!', { |
| 53 | + x: 50, |
| 54 | + y: 450, |
| 55 | + size: 15, |
| 56 | + font: timesRomanFont, |
| 57 | + color: rgb(0, 0.53, 0.71), |
| 58 | + }); |
| 59 | + ``` |
| 60 | + |
| 61 | + Please see the [Usage Examples](#usage-examples) for more in depth examples of drawing content on a page in the new versions of `pdf-lib`. You may also find the [Complete Examples](#complete-examples) to be a useful reference. |
| 62 | + |
| 63 | +- Change _`getMaybe`_ function calls to **`get`** calls. If a property doesn't exist, then `undefined` will be returned. Note, however, that PDF name strings with need to be wrapped in `PDFName.of(...)`. For example, to look up the AcroForm object you'll need to change _`pdfDoc.catalog.getMaybe('AcroForm')`_ to **`pdfDoc.catalog.get(PDFName.of('AcroForm'))`**. |
| 64 | + |
| 65 | + ```js |
| 66 | + const acroForm = await pdfDoc.context.lookup( |
| 67 | + pdfDoc.catalog.get(PDFName.of('AcroForm')), |
| 68 | + ); |
| 69 | + ``` |
| 70 | + |
| 71 | + > v0.x.x converted the strings passed to `get` and `getMaybe` to `PDFName` objects, but v1.0.0 does not do this conversion for you. So you must always pass actual `PDFName` objects instead of strings. |
| 72 | +
|
| 73 | +- To find the AcroForm field references now becomes: |
| 74 | + ```js |
| 75 | + const acroFieldRefs = await pdfDoc.context.lookup( |
| 76 | + acroForm.get(PDFName.of('Fields')), |
| 77 | + ); |
| 78 | + ``` |
| 79 | +- To add a new page replace _`pdfDoc.createPage([width, height])`_ with **`pdfDoc.addPage([width, height])`** |
| 80 | + ```js |
| 81 | + const page = pdfDoc.addPage([500, 750]); |
| 82 | + ``` |
| 83 | + or simply: |
| 84 | + ```js |
| 85 | + const page = pdfDoc.addPage(); |
| 86 | + ``` |
| 87 | + |
| 88 | +* To get the size of the page: |
| 89 | + |
| 90 | + ```js |
| 91 | + const { width, height } = page.getSize(); |
| 92 | + page.getWidth(); |
| 93 | + page.getHeight(); |
| 94 | + ``` |
| 95 | + |
| 96 | +* To add images replace _`pdfDoc.embedPNG`_ with **`pdfDoc.embedPng`** and _`pdfDoc.embedJPG`_ with **`pdfDoc.embedJpg`** |
| 97 | + |
| 98 | +* The `pdfDoc.embedPng` and `pdfDoc.embedJpg` methods now return `PDFImage` objects which have the width and height of the image as properties. You can also scale down the width and height by a constant factor using the `PDFImage.scale` method: |
| 99 | + ```js |
| 100 | + const aBigImage = await pdfDoc.embedPng(aBigImageBytes); |
| 101 | + const { width, height } = aBigImage.scale(0.25); |
| 102 | + ``` |
| 103 | + So, `const [image, dims] = pdfDoc.embedJPG(mediaBuffer)` becomes: |
| 104 | + ```js |
| 105 | + const image = await pdfDoc.embedJpg(mediaBuffer); |
| 106 | + // image.width, image.height can be used instead of the dims object. |
| 107 | + ``` |
| 108 | +* To save the PDF replace _`PDFDocumentWriter.saveToBytes(pdfDoc)`_ with **`pdfDoc.save()`** |
| 109 | + |
| 110 | + ```js |
| 111 | + const pdfDocBytes = await pdfDoc.save(); |
| 112 | + ``` |
| 113 | + |
| 114 | +* To display the saved PDF now becomes: |
| 115 | + |
| 116 | + ```js |
| 117 | + const pdfUrl = URL.createObjectURL( |
| 118 | + new Blob([await pdfDoc.save()], { type: 'application/pdf' }), |
| 119 | + ); |
| 120 | + window.open(pdfUrl, '_blank'); |
| 121 | + ``` |
| 122 | + |
| 123 | + (note: `URL.revokeObjectURL` should be called later to free up memory) |
| 124 | + |
| 125 | +* To get the PDF page count: |
| 126 | + |
| 127 | + ```js |
| 128 | + pdfDoc.getPages().length; |
| 129 | + ``` |
| 130 | + |
| 131 | +* To copy pages from one document to another you must now call **`destPdf.copyPages(srcPdf, srcPageIndexesArray)`** to copy pages. You can see an example of this in the [Copy Pages](#copy-pages) usage example. Admittedly, this API is slightly less ergonomic than what exists in v0.x.x, but it has two key benefits: |
| 132 | + |
| 133 | + 1. It avoids making PDFDocument.addPage and PDFDocument.insertPage async. |
| 134 | + When copying multiple pages from the source document, the resulting merged document should have a smaller file size. This is because the page copying API that exists in v0.x.x was intended for copying just one or two pages. |
| 135 | + |
| 136 | + 2. When copying large numbers of pages, it could result in redundant objects being created. This new page copying API should eliminate that. |
| 137 | + |
| 138 | + ```js |
| 139 | + async function mergePdfs(pdfsToMerge: string[]) { |
| 140 | + const mergedPdf = await PDFDocument.create(); |
| 141 | + for (const pdfCopyDoc of pdfsToMerge) { |
| 142 | + const pdfBytes = fs.readFileSync(pdfCopyDoc); |
| 143 | + const pdf = await PDFDocument.load(pdfBytes); |
| 144 | + const copiedPages = await mergedPdf.copyPages(pdf, pdf.getPageIndices()); |
| 145 | + copiedPages.forEach((page) => { |
| 146 | + mergedPdf.addPage(page); |
| 147 | + }); |
| 148 | + } |
| 149 | + const mergedPdfFile = await mergedPdf.save(); |
| 150 | + return mergedPdfFile; |
| 151 | + } |
| 152 | + ``` |
| 153 | + |
| 154 | +* If required, you can retrieve the CropBox or MediaBox of a page like so: |
| 155 | + ```js |
| 156 | + const cropBox = page.node.CropBox() || page.node.MediaBox(); |
| 157 | + ``` |
0 commit comments