summaryrefslogtreecommitdiff
path: root/src
Commit message (Collapse)AuthorAgeFilesLines
* Sync tokeniser tests with html5lib.John Mark Bell2009-03-101-12/+19
| | | | | | | | Sync tokeniser implementation with the spec. Fix handling of \0 in the tag open state. The unicodeCharacters test is disabled, as json-c doesn't like it. svn path=/trunk/hubbub/; revision=6755
* Fix bug 2673645John Mark Bell2009-03-091-2/+5
| | | | | | <form><option>foo</option></form> => abort. svn path=/trunk/hubbub/; revision=6746
* Actually associate textareas with formsJohn Mark Bell2009-02-201-7/+10
| | | | svn path=/trunk/hubbub/; revision=6578
* Convert UTF-16 meta charsets to UTF-8, given we'll have auto-detected them ↵John Mark Bell2009-01-231-26/+31
| | | | | | as UTF-8 in the chardet stage. (Credit: Andi Sidwell) svn path=/trunk/hubbub/; revision=6180
* And another one. Now warning free. Huzzah.John Mark Bell2009-01-081-0/+1
| | | | svn path=/trunk/hubbub/; revision=6000
* Add documentation from a couple of missing parametersJohn Mark Bell2009-01-081-2/+4
| | | | svn path=/trunk/hubbub/; revision=5999
* And some moreJohn Mark Bell2009-01-082-3/+3
| | | | svn path=/trunk/hubbub/; revision=5998
* Squash more doxygen warnings. Don't use html tags in comments, folks.John Mark Bell2009-01-081-25/+23
| | | | svn path=/trunk/hubbub/; revision=5997
* Make doxygen produce API documentation. I guess it helps if you enable the ↵John Mark Bell2009-01-082-2/+2
| | | | | | | | right options. Fix a couple more doxygen warnings. svn path=/trunk/hubbub/; revision=5996
* Use doxygen to create API documentation.John Mark Bell2009-01-087-31/+53
| | | | | | Add a bunch of extra commentary to stop doxygen warning. svn path=/trunk/hubbub/; revision=5994
* Ok, let's actually compile the code before committing it.John Mark Bell2009-01-061-0/+1
| | | | svn path=/trunk/hubbub/; revision=5971
* Some kind of recovery from not supporting an auto-detected charset. We fall ↵John Mark Bell2009-01-062-2/+33
| | | | | | | | | | | back to Windows-1252 and hope for the best. Note that this only occurs when autodetecting. If the client has specified a charset, they get told about it immediately and get to decide what to do about it. If a meta charset is encountered after a successful autodetection, the client is informed in the usual way. As this requires the client to throw out the parser and start afresh, specifying the meta charset as the one to use, there's no problem as, again, the client will be informed immediately in that case. All of this ensures that charset autodetection is pretty well transparent as far as the client application is concerned. svn path=/trunk/hubbub/; revision=5970
* Fix potential read beyond available input data when processing \r in some ↵John Mark Bell2009-01-061-5/+5
| | | | | | | | | | states. What happened was that, given \rabc, we would advance past the \r, then read at current_offset + len (len == 1). I.E. read 'b' instead of 'a'. If the data in the inputstream's internal buffer happened to end immediately after the \r, then we'd read past the end of the buffer thanks to a bug in lpu_inputstream_peek which was fixed in r5965. In any case, we'd still be looking at the wrong character when looking for CRLF pairs. All regression tests now pass again. svn path=/trunk/hubbub/; revision=5967
* Port to changed lpu API.John Mark Bell2009-01-063-458/+637
| | | | | | | Drop HUBBUB_OOD and just use HUBBUB_NEEDDATA, instead. Currently aborts in bogus comment handling if it encounters a \r at the end of the inputstream's utf-8 buffer. svn path=/trunk/hubbub/; revision=5966
* Fix build breakageJohn Mark Bell2008-11-301-1/+3
| | | | svn path=/trunk/hubbub/; revision=5851
* lotsa C89, please check.François Revel2008-11-301-48/+91
| | | | svn path=/trunk/hubbub/; revision=5846
* Fix wrong prototype.François Revel2008-11-301-1/+1
| | | | svn path=/trunk/hubbub/; revision=5845
* C89François Revel2008-11-301-2/+5
| | | | svn path=/trunk/hubbub/; revision=5844
* C89François Revel2008-11-307-46/+58
| | | | svn path=/trunk/hubbub/; revision=5843
* Fix commentaryJohn Mark Bell2008-11-091-2/+3
| | | | svn path=/trunk/hubbub/; revision=5668
* Convert PARSERUTILS_BADENCODING into HUBBUB_BADENCODINGJohn Mark Bell2008-11-092-0/+5
| | | | svn path=/trunk/hubbub/; revision=5667
* Return errors from parser constructor/destructor. This changes the public API.John Mark Bell2008-11-091-27/+35
| | | | svn path=/trunk/hubbub/; revision=5666
* Return errors from treebuilder constructor/destructorJohn Mark Bell2008-11-093-22/+32
| | | | svn path=/trunk/hubbub/; revision=5665
* Return errors from tokeniser constructor/destructorJohn Mark Bell2008-11-093-19/+30
| | | | svn path=/trunk/hubbub/; revision=5664
* Return errors from dictionary constructor/destructor.John Mark Bell2008-11-094-22/+32
| | | | | | Fix commentary copied from libcss svn path=/trunk/hubbub/; revision=5663
* Port hubbub to new lpu APIJohn Mark Bell2008-11-082-5/+7
| | | | svn path=/trunk/hubbub/; revision=5656
* Fixup dubious charsetsJohn Mark Bell2008-10-144-3/+26
| | | | svn path=/trunk/hubbub/; revision=5575
* Report errors from libparserutils better.Andrew Sidwell2008-09-243-3/+49
| | | | svn path=/trunk/hubbub/; revision=5431
* Ensure we unref collect node if it still exists when we come to destroy the ↵John Mark Bell2008-09-181-0/+6
| | | | | | | | | treebuilder (this can occur if EOF is encountered when in generic_rcdata/script_collect modes). Note that the spec has since changed here, so we'll need to resync, anyway. This is simply to ensure the testsuite passes once more. svn path=/trunk/hubbub/; revision=5374
* Lose redundant todosJohn Mark Bell2008-09-082-5/+0
| | | | svn path=/trunk/hubbub/; revision=5290
* Squash memory leakJohn Mark Bell2008-09-081-0/+2
| | | | svn path=/trunk/hubbub/; revision=5285
* Fix build failures. Quite how these were missed before, I don't know.John Mark Bell2008-09-073-0/+11
| | | | svn path=/trunk/hubbub/; revision=5277
* Native RO buildsystem. Beware of the NOP.John Mark Bell2008-09-071-2/+2
| | | | svn path=/trunk/hubbub/; revision=5275
* Fixes for handling of CR followed immediately by multibyte sequences.John Mark Bell2008-09-061-59/+94
| | | | | | | Pedantic whitespace changes. More paranoia surrounding entity handling. svn path=/trunk/hubbub/; revision=5266
* Fix for spurious attributes on implied tbody and colgroup by takkaria.Michael Drake2008-09-061-1/+5
| | | | svn path=/trunk/hubbub/; revision=5262
* Fix segfault caused by trampling the length of the current character when ↵John Mark Bell2008-08-181-2/+8
| | | | | | | | testing whether the 4 most recently read characters in the data state are <!--. Add a couple of assertions for paranoia. svn path=/trunk/hubbub/; revision=5146
* Refactor the generic rcdata mode so it's more like the script collection ↵Andrew Sidwell2008-08-154-39/+32
| | | | | | mode, and then remove an unused variable in the treebuilder state. svn path=/trunk/hubbub/; revision=5132
* Get make test running again.Andrew Sidwell2008-08-151-1/+2
| | | | svn path=/trunk/hubbub/; revision=5125
* Do what r5107 for system ID for public IDs.Andrew Sidwell2008-08-131-14/+4
| | | | svn path=/trunk/hubbub/; revision=5108
* Another COLLECT() -> COLLECT_MS() fix.Andrew Sidwell2008-08-131-14/+4
| | | | svn path=/trunk/hubbub/; revision=5107
* Add page which crashed, and fix the bug that caused it to do so.Andrew Sidwell2008-08-131-4/+2
| | | | svn path=/trunk/hubbub/; revision=5106
* Remove the CHAR() macro, which lets make test run again.Andrew Sidwell2008-08-131-80/+74
| | | | svn path=/trunk/hubbub/; revision=5104
* Optimise COLLECT_MS() macro.Andrew Sidwell2008-08-131-5/+3
| | | | svn path=/trunk/hubbub/; revision=5099
* Fix segfault in elimination of duplicate attributes.John Mark Bell2008-08-131-7/+8
| | | | svn path=/trunk/hubbub/; revision=5098
* Optimise comment states slightly, taking advantage of the fact that buffers ↵Andrew Sidwell2008-08-131-20/+1
| | | | | | store their own length and when emitting the comment, the buffer contains the whole comment and nothing else. svn path=/trunk/hubbub/; revision=5095
* Fix tokeniser so make test passes, with possible perf hit.Andrew Sidwell2008-08-131-18/+43
| | | | svn path=/trunk/hubbub/; revision=5093
* Use COLLECT_MS() macro rather than COLLECT() in attribute values.Andrew Sidwell2008-08-131-4/+4
| | | | svn path=/trunk/hubbub/; revision=5086
* Fix script collection now that pointers can change between calls.Andrew Sidwell2008-08-131-34/+30
| | | | svn path=/trunk/hubbub/; revision=5082
* Sanity checking for string dataJohn Mark Bell2008-08-131-0/+39
| | | | svn path=/trunk/hubbub/; revision=5080
* Remember to clear the self-closing flag when emitting a tag token.Andrew Sidwell2008-08-111-0/+3
| | | | svn path=/trunk/hubbub/; revision=5030