Releases: py-pdf/pypdf
Releases · py-pdf/pypdf
Version 2.12.0, 2022-12-10
What's Changed
Version 2.12.0, 2022-12-10
New Features (ENH)
- Add support to extract gray scale images (#1460) by @joeywang4
- Make PdfReader.get_object accept integer arguments (#1459) by @pubpub-zz
- Add 'threads' property to PdfWriter (#1458) by @pubpub-zz
- Add 'open_destination' property to PdfWriter (#1431) by @pubpub-zz
Bug Fixes (BUG)
- Scale PDF annotations (#1479) by @joshhendo
Robustness (ROB)
- Padding issue with AES encryption (#1469)
- Accept empty object as null objects (#1477) by @pubpub-zz
Documentation (DOC)
- Add module documentation the PaperSize class (#1447) by @MagnumBarrage
Maintenance (MAINT)
- Use 'page_number' instead of 'pagenum' (#1365)
- Add List of pages to PageRangeSpec (#1456) by @pubpub-zz
Testing (TST)
- Cleanup temporary files (#1454) by @pubpub-zz
- Mark test_tounicode_is_identity as external (#1449) by @heirecka
- Use Ubuntu 20.04 for running CI test suite (#1452) by @MasterOdin
New Contributors
- @heirecka made their first contribution in #1449
- @MagnumBarrage made their first contribution in #1447
- @joeywang4 made their first contribution in #1460
- @joshhendo made their first contribution in #1479
Full Changelog: 2.11.2...2.12.0
Version 2.11.2, 2022-11-20
What's Changed
New Features (ENH)
- Add remove_from_tree (#1432) by @pubpub-zz
- Add AnnotationBuilder.rectangle (#1388)
Bug Fixes (BUG)
- JavaScript executed twice (#1439) by @pubpub-zz
- ToUnicode stores /Identity-H instead of stream (#1433) by @pubpub-zz
- Declare Pillow as optional dependency (#1392)
Developer Experience (DEV)
- Link 'Full Changelog' automatically
- Modify read_string_from_stream to a benchmark (#1415)
- Improve error reporting of read_object (#1412) by @pubpub-zz
- Test Python 3.11 (#1404)
- Extend Flake8 ignore list (#1410)
- Use correct pytest markers (#1407)
- Move project configuration to pyproject.toml (#1382) by @singingwolfboy
Documentation (DOC)
- Fix typos in installation.md by @amyreyespdx in #1419
- Typos in PDF format documentation by @pavlidvg in #1438
New Contributors
- @singingwolfboy made their first contribution in #1391
- @amyreyespdx made their first contribution in #1419
- @pavlidvg made their first contribution in #1438
Full Changelog: 2.11.1...2.11.2
Version 2.11.1, 2022-10-09
What's Changed
Bug Fixes (BUG)
- td matrix (#1373) by @srogmann
- Cope with cmap from #1322 (#1372) by @pubpub-zz
Robustness (ROB)
- Cope with str returned from get_data in cmap (#1380) by @pubpub-zz
Documentation (DOC)
- Remove watermark PageObject declaration as it is already present inside for-loop (#1384) by @cs2sandeep
New Contributors
- @cs2sandeep made their first contribution in #1384
Full Changelog: 2.11.0...2.11.1
Version 2.11.0, 2022-09-25
What's Changed
New Features (ENH):
- Addition of optional visitor-functions in extract_text() (#1252) by @srogmann
- Add PageObject.images attribute (#1330) by @MartinThoma
- Add metadata.creation_date and modification_date (#1364) by @MartinThoma
Bug Fixes (BUG):
- Lookup index in _xobj_to_image can be ByteStringObject (#1366)
- 'IndexError: index out of range' when using extract_text (#1361)
- Errors in transfer_rotation_to_content() (#1356) by @pubpub-zz
Robustness (ROB):
- Ensure update_page_form_field_values does not fail if no fields (#1346) by @pubpub-zz
Testing (TST):
- read_string_from_stream performance (#1355) by ### @mergezalot
New Contributors
Full Changelog: 2.10.9...2.11.0
2.10.9
What's Changed
New Features (ENH)
- Add rotation property and transfer_rotate_to_content (#1348) by @pubpub-zz
Performance Improvements (PI)
- Avoid string concatenation with large embedded base64-encoded images (#1350) by @mergezalot
Bug Fixes (BUG)
- Format floats using their intrinsic decimal precision (#1267) by @programmarchy
Robustness (ROB)
- Fix merge_page for pages without resources (#1349) by @pubpub-zz
New Contributors
- @mergezalot made their first contribution in #1350
- @programmarchy made their first contribution in #1267
Full Changelog: 2.10.8...2.10.9
Version 2.10.8, 2022-09-14
What's Changed
- ROB: Improve NameObject reading/writing by @pubpub-zz in #1345
- ENH: Add PageObject.user_unit property by @MartinThoma in #1336
Full Changelog: 2.10.7...2.10.8
Version 2.10.7, 2022-09-11
What's Changed
Bug Fixes (BUG)
- Fix Error in transformations (#1341) by @pubpub-zz
- Decode #23 in NameObject (#1342) by @pubpub-zz
Testing (TST)
New Contributors
Full Changelog: 2.10.6...2.10.7
2.10.6
What's Changed
Two robustness issues were fixed by @pubpub-zz - thank you 🙏
The infinite loop issue might also be a security concern, depending on how you use PyPDF2.
Robustness (ROB):
- Fix infinite loop due to Invalid object (#1331)
- Fix image extraction issue with superfluous whitespaces (#1327)
Full Changelog: 2.10.5...2.10.6
Version 1.28.6, 2022-09-08
This is a bugfix for the old 1.x branch of PyPDF2 that still supports Python 2. Please try to update to the latest PyPDF2 > 2.0.0
version to get way better text extraction, support for modern encryption, and much more.
What's Changed
New Contributors
Full Changelog: 1.28.5...1.28.6
Version 2.10.5, 2022-09-04
What's Changed
New Features (ENH)
- Process XRefStm (#1297)
- Auto-detect RTL for text extraction (#1309) by @pubpub-zz
Bug Fixes (BUG)
- Avoid scaling cropbox twice (#1314) by @yegorLitvinov
Robustness (ROB)
- Fix offset correction in revised PDF (#1318) by @pubpub-zz
- Crop data of /U and /O in encryption dictionary to 48 bytes (#1317) by @exiledkingcc
- MultiLine bfrange in cmap (#1299) by @pubpub-zz
- Cope with 2 digit codes in bfchar (#1310) by @pubpub-zz
- Accept '/annn' charset as ASCII code (#1316) by @pubpub-zz
- Log errors during Float / NumberObject initialization (#1315) by @pubpub-zz
- Cope with corrupted entries in xref table (#1300) by @pubpub-zz
Documentation (DOC)
- Migration guide (PyPDF2 1.x ➔ 2.x) (#1324)
- Creating a coverage report (#1319)
- Fix AnnotationBuilder.free_text example (#1311)
- Fix usage of page.scale by replacing it with page.scale_by (#1313) by @yegorLitvinov
Developer Experience (DEV)
- Only run coverage for PyPDF2
Maintenance (MAINT)
- PdfReaderProtocol (#1303)
- Throw PdfReadError if Trailer can't be read (#1298) by @ediamondscience
- Remove catching OverflowException (#1302)
Testing (TST)
- Catch Exception for sample-files repo (#1307)
New Contributors
- @ediamondscience made their first contribution in #1298
- @yegorLitvinov made their first contribution in #1313
- @markdlevy made their first contribution in #1311
Full Changelog: 2.10.4...2.10.5