Skip to content

Cirrus Minor Posts

pandas on spark apply_batch/transform_batch broken? (tl;dr; No – but it isn’t well documented)

Using pypark’s pandas integration via apply_batch and transform_batch is very powerful but lacking documentation can cause hard to trace bugs – hopefully my experience (below)…