XML Tokenizer

XML Tokenizer is a low-memory high performance non-namespace parser library for parsing simple XML 1.0. This is an alternative option to the standard library's xml when speed is your main concern and you are willing to sacrifice certain features, such as handling the namespace, in favor of speed (discussion). This may not cover all XML files, but it can cover typical XML files.

Motivation

Go provides a standard library for XML parsing, however, I've found it to be slow for my use case. I work with a lot of GPX files in my personal project to retrieve my workouts data; GPX is an XML-based file format. When parsing my 14MB GPX file containing 208km ride using the standard library's xml, it takes roughly 600ms which is super slow and it needs 2.8mil alloc!. I need an alternative library for parsing XML that's faster than standard library's xml, suitable for typical XML parsing tasks and no code should be made unsafe.

Usage

Please see USAGE.md.

Benchmark

goos: darwin; goarch: amd64; pkg: xmltokenizer
cpu: Intel(R) Core(TM) i5-5257U CPU @ 2.70GHz
Benchmark/stdlib.xml:"ride_sembalun.gpx"-4    2  605913816 ns/op  110562568 B/op  2806823 allocs/op
Benchmark/xmltokenizer:"ride_sembalun.gpx"-4  8  141616068 ns/op   17143609 B/op       85 allocs/op

Approx. 4 times faster!

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.github		.github
docs		docs
internal		internal
testdata		testdata
CONTRIBUTING.md		CONTRIBUTING.md
LICENCE		LICENCE
README.md		README.md
benchmark_test.go		benchmark_test.go
codecov.yml		codecov.yml
go.mod		go.mod
go.sum		go.sum
token.go		token.go
token_test.go		token_test.go
tokenizer.go		tokenizer.go
tokenizer_internal_test.go		tokenizer_internal_test.go
tokenizer_test.go		tokenizer_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

XML Tokenizer

Motivation

Usage

Benchmark

About

Releases

Packages

Languages

License

dolmen-go/muktihari-xmltokenizer.fork

Folders and files

Latest commit

History

Repository files navigation

XML Tokenizer

Motivation

Usage

Benchmark

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages