artemis.tools.tdigesttool

The purpose of this tool will be to add a distribution generator tool to the artemis toolset Used for validation purpose

Module Contents

class artemis.tools.tdigesttool.TDigestToolOptions
class artemis.tools.tdigesttool.TDigestTool(name, **kwargs)

Bases: artemis.core.tool.ToolBase

initialize(self)
execute(self, record_batch)

This tool will read in a py arrow record batch or a read from a csv file and run a t-digest analysis in a distributed fashion

Parameters
  • file_name (the name of the file that will) –

  • read from and digest created, the file (be) –

  • be assumed to be in the 'examples/ (will) –

Returns

  • None

  • Although this returns none for now, it will be possible

  • to return a pyarrow record bactch or simmilar data structure

  • as it goes to a record batch and then writes directly to a csv file