|
| 1 | +# Good First Projects |
| 2 | + |
| 3 | +If any of these sound interesting, join #dev on |
| 4 | +[Discord](https://discord.multiprocess.io) and say hi! |
| 5 | + |
| 6 | +The first thing you'll be asked to do is go through one or two of the |
| 7 | +[tutorials on DataStation](https://datastation.multiprocess.io/docs/), |
| 8 | +and [try out dsq](https://github.com/multiprocessio/dsq). |
| 9 | + |
| 10 | +You'll need to have this little bit of experience using DataStation |
| 11 | +and dsq for these tasks to make sense. |
| 12 | + |
| 13 | +## Easy |
| 14 | + |
| 15 | +* Add a new supported file type |
| 16 | + * Example: Messagepack, BSON, CBOR, UBJSON, XML, Yaml, Protobuf, Avro? HDF5? |
| 17 | +* Test out INT96 support in Parquet, add conversion to timestamp if necessary |
| 18 | +* Build dsq, fakegen for more/every os/arch |
| 19 | +* Preparation for optimized internal representation of data |
| 20 | + * Do read/write benchmarks among MessagePack/BSON/Protobuf/Avro |
| 21 | + * Make sure there’s a library for every language |
| 22 | + * Figure out how to embed the library inside DataStation |
| 23 | +* More databases |
| 24 | + * IBM DB2, Neo4j, Apache Presto/Trino, Meilisearch, Apache Hive, Apache Druid, Apache Pinot |
| 25 | +* Add a new supported log format |
| 26 | + * Example: logfmt |
| 27 | +* Fakegen help text |
| 28 | + |
| 29 | +## Medium |
| 30 | + |
| 31 | +* Add caching to dsq |
| 32 | +* Autocomplete support in the UI |
| 33 | +* HTTP Range support for faster downloads |
| 34 | +* Add support for FTP |
| 35 | +* New usql cli (run queries against all DataStation supported databases in a single CLI) |
| 36 | +* ODBC/JDBC support |
| 37 | +* Research/benchmarks and associated blog posts |
| 38 | + * SQLite vs Go port of SQLite |
| 39 | + * SQLite vs DuckDB |
| 40 | + * JDBC vs native protocols |
| 41 | + * Regex vs lexer |
| 42 | + * Copying into SQLite vs vtable |
| 43 | + * Evaluate SQL parser libraries in JavaScript |
| 44 | + * Fastest way to launch a virtual machine (gvisor, firecracker, etc) |
0 commit comments