You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+20-6Lines changed: 20 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,16 +1,27 @@
1
1
# Htmlunit-NekoHtml Parser
2
2
3
+
[](https://central.sonatype.com/artifact/org.htmlunit/neko-htmlunit)
The **Htmlunit-NekoHtml** Parser is a HTML scanner and tag balancer that enables application programmers to parse HTML documents
4
9
and access the information using standard XML interfaces.
5
10
The parser can scan HTML files and "fix up" many common mistakes that human (and computer) authors make in writing HTML documents.
6
11
NekoHTML adds missing parent elements; automatically closes elements with optional end tags; and can handle mismatched inline element tags.
7
12
8
-
The **Htmlunit-NekoHtml** Parser has no external dependencies at all, requires Java 8 and works also on Android.
9
-
The **Htmlunit-NekoHtml** Parser is used by Htmlunit.
13
+
## Key Features
10
14
11
-
[](https://central.sonatype.com/artifact/org.htmlunit/neko-htmlunit)
15
+
✅ **No External Dependencies** - Pure Java implementation
16
+
✅ **Java 8+ Compatible** - Works with Java 8, 11, 17, 21 and beyond
17
+
✅ **Android Support** - Runs on Android platforms
18
+
✅ **Error Tolerant** - Handles malformed HTML gracefully
19
+
✅ **Standards Compliant** - Follows HTML parsing specifications
[CVE-2022-29546](https://nvd.nist.gov/vuln/detail/CVE-2022-29546): Fixed in versions 2.61.0+
24
37
Htmlunit-NekoHtml Parser suffers from a denial of service vulnerability on versions 2.60.0 and below. A specifically crafted input regarding the parsing of processing instructions leads to heap memory consumption.
0 commit comments