Rise of micro tools

Last decade has been a time where “big data“ tools came out every day. benchmarks were released boasting of throughputs and low latency numbers. Little attention was paid to operating costs to run these systems and expertise required to operate these systems. Running these systems was tricky with no tools to manage these systems effectively. Our spark cluster will go down with no warning. Every morning we will come to office to find that spark cluster was dead and no jobs were run last night. SaaS and hyperscaler came to rescue. Don’t worry about operating costs and maintenance costs, but don’t you dare to leave us. :) Often these services come with vendor lock-in.

Now, we are seeing some good tools being developed to “Big Data“ right from your good old laptop. Here we are going to take a look at some of these tools

DuckDB: Sqlite’s OLAP cousin

Duckdb has everything that made sqlite most used database in the world.

Duckdb github stars over past years

GGML: Run LLM model locally

After SSD paper and model were released, YOLO model was released and became instant hit among computer vision enthusiasts. While SSD could only run with large deep learning frameworks(TF, Pytorch), darknet YOLO came with no dependecies and could be compiled with a plan C++ compiler and could run on CPU machine.

Same thing happened when whisper and llamm model weights were released by OpenAI and Meta. A little know developer(Georgi Gergano) from Bulgaria released llama.cpp and whisper.cpp . Since then they have written wrapper around many other models such as BioGPT, Yolo. This lower level C++ lib serves as a foundation for Ollama.

Kùzu: graph database Cousin of sqlite

Developed and maintained by Semih Salihoğlu, a professor at University of Waterloo. They have now incorporated a company KuzuDB inc with his students. Their team is powering its development. It uses cypher(same language used by Neo4J) to query the database. This db is embeddable and can be used in a process. See the video below to get an overview what Kuzu has to offer.