Page MenuHomeSolus

Update python-beautifulsoup4 to 4.10.0
ClosedPublic

Authored by Staudey on Jan 23 2022, 12:41 AM.
Tags
None
Referenced Files
F10939795: D12745.id31058.diff
Fri, Jul 14, 12:36 AM
F10938411: D12745.id31045.diff
Thu, Jul 13, 2:07 PM
F10888956: D12745.id31045.diff
Jun 27 2023, 11:02 AM
F10888062: D12745.id31058.diff
Jun 26 2023, 8:37 PM
F10883989: D12745.diff
Jun 24 2023, 5:40 PM
Subscribers

Details

Summary

Update python-beautifulsoup4 to 4.10.0

Changes:

  • The behavior of methods like .get_text() and .strings now differs depending on the type of tag
  • NavigableString and its subclasses now implement the get_text() method, as well as the properties .strings and .stripped_strings
  • The 'html5' formatter now treats attributes whose values are the empty string as HTML boolean attributes
  • The 'replace_with()' method now takes a variable number of arguments, and can be used to replace a single element with a sequence of elements
  • Corrected output when the namespace prefix associated with a namespaced attribute is the empty string, as opposed to None
  • Performance improvement when processing tags that speeds up overall tree construction by 2%
  • Corrected the use of special string container classes in cases when a single tag may contain strings with different containers
  • The html.parser tree builder can now handle named entities found in the HTML5 spec in much the same way that the html5lib tree builder does
  • Added a second way to pass specify encodings to UnicodeDammit and EncodingDetector, based on the order of precedence defined in the HTML5 spec
  • Improve the warning issued when a directory name (as opposed to the name of a regular file) is passed as markup into the BeautifulSoup constructor

Detailed changelog available here

Test Plan

Ran the examples from the Beautiful Soup Documentation

Diff Detail

Repository
R2558 python-beautifulsoup4
Lint
Lint Not Applicable
Unit
Tests Not Applicable