BTW here is a fun exercise that takes this idea to the extreme. Who can build a custom file format that gets the best ClickHouse performance (on DataFusion):
https://github.com/apache/datafusion/issues/13448
Disclaimer I am on the PMC of Apache DataFusion, so am totally a fan boy.