data_scout.connectors.csv

Module Contents

Classes

CSV

Read data from a CSV file.

class data_scout.connectors.csv.CSV(arguments)

Bases: data_scout.connectors.connector.Connector

Read data from a CSV file.

TMP_SINK = False
MAX_SIZE = 2000000
MAX_ROWS = 200
fields
__call__(self, sample: bool = False, sampling_technique: str = 'top', column_types: bool = False)List[dict]

This class is called when the data needs to be loaded.

Arguments:
type sample

boolean: Whether to take a sample or not

type sampling_technique

str: Which sampling technique to use (top, stratisfied, random)

Returns:

dict – The row, including the extra output column