We recently released ploomber, a workflow management tool to accelerate DS/ML pipeline development. Check it out!

Leveraging parquet's metadata to self-document data files

Eduardo Blancas.