Released
- Improve type hints
Released
- Feat: Add parse_fragment() and create_tag()
- Add missing typing for Node.insert_child()
- Add Node.parser to access the HTMLParser to which the node belongs
Released
- Add Node.insert_child method to lexbor and modest backends
Released
- Add Python 3.13 wheels
- Update lexbor
Released
- *Breaking change*: lexbor backend now includes the root node when querying CSS selectors. Same as Modest backend.
- Fix css_matches and any_css_matches methods for Modest backend on some compilers
Released
- Fixup for 0.3.19 release
- Fix tag order for lexbor backend
Released
- Increase maximum HTML size to 2.4GB
Released
- Fix memory leak when using CSS selectors, lexbor backend
Released
- Update lexbor
- Add Python 3.12 wheels
Released
- Make HTML nodes hashable
- Pin Cython version
Released
- Improve typing. Thanks to @nesb1
Released
- Fix memory leak for lexbor backend
Released
- Update lexbor
Released
- Update lexbor
- Add Python 3.11 wheels
Released
- Fix out-of-bounds bug for
merge_text_nodes
method.
Released
This release does not contain any changes. Due to a typo in the version number (#70), we need to make a new release.
Released
- Remove trailing separator when using
text(deep=True, separator='x')
. - Add a new
merge_text_nodes
method for Modest backend.
Released
- Fix incorrect text handling when using
text(deep=True)
on a text node.
Released
- Fix return type of HTMLParser.tags
Released
- Improve text handling
- Add binary builds for Python 3.10 and ARM on MacOS and Linux
Released
- Add type annotations
Released
- Fix
HTMLParser.html
Released
- Use document for the
HTMLParser.html
,LexborHTMLParser.html
root properties
Released
- Fix
selector
method for lexbor - Improve text extraction for lexbor
Released
- Fix
setup.py
for Windows
Released
- Added
lexbor
backend - Fix cloning for Modest backend
Released
- Added advanced Selector (the
select
method) - Improved speed of
strip_tags
- Added
clone
method for theHtmlParser
object - Exposed
detect_encoding
,decode_errors
,use_meta_tags
,raw_html
attributes forHtmlParser
- Added
sget
method to theattrs
property
Released
- Don't throw exception when encoding text as UTF-8 bytes fails (#40).
- Fix Node.attrs.items() causes (#39).
Released
- Build wheels Apple Silicon
Released
- Fix strip argument is ignored for the root node (#35).
- Fix CSS parser hangs on a bad CSS selector (#36).
Released
- Fix root node property (#32 ). The root property now points to the html tag.
Released
- Fix README for PyPI
Released
- Add wheels for Python 3.9
Released
- Add raw_value attribute for Node objects (#22 )
- Improve node modification operations
Released
- Fix dependency on the source Node when inserting to or modifying destination Node
Released
- Allow to pass Node instances to replace_with, insert_before and insert_after methods
- Added insert_before and insert_after methods
Released
- Set maximum input size to 80MB
- Update modest
Released
- Rebuild PyPi wheels to support Python 3.8 and manylinux2010
Released
- Fix node comparison
Released
- Add optional include_text parameter for the iter and traverse methods
Released