33
GitHub - stanford-futuredata/sparser: An implementation of raw filtering....
source link: https://github.com/stanford-futuredata/sparser
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
README.md
sparser
This code base implements Sparser, raw filtering for faster analytics over raw data. Sparser can parse JSON, Avro, and Parquet data up to 22x faster than the state of the art. For more details, check out our paper published at VLDB 2018.
See the demo-repl
directory for a brief example. To run it:
- Build
json/rapidjson
(see the instructions there on how to do that) make
in thedemo-repl
directory.
Sparser itself is just a header file and only depends on standard C libraries available on most systems.
Recommend
About Joyk
Aggregate valuable and interesting links.
Joyk means Joy of geeK