tfaip.data
DataBase
Module that defines DataBase
- tfaip.data.data.dict_to_input_layers(d: Dict[str, tensorflow.python.framework.tensor_spec.TensorSpec]) Dict[str, keras.engine.input_layer.Input]
- tfaip.data.data.validate_specs(func)
- class tfaip.data.data.DataBase(params: tfaip.data.data.TDP, **kwargs)
Bases:
Generic[tfaip.data.data.TDP],abc.ABCDataBase class to provide training and validation data.
Override _input_layer_specs, and _output_layer_specs in a custom implementation
- classmethod params_cls() Type[tfaip.data.data.TDP]
- classmethod default_params() tfaip.data.data.TDP
- classmethod data_pipeline_cls() Type[tfaip.data.pipeline.datapipeline.DataPipeline]
DataPipeline for this Data.
Overwrite this and return a custom DataPipeline if you need to modify the tf.data.Dataset (see TFDatasetGenerator).
- __init__(params: tfaip.data.data.TDP, **kwargs)
- preload(progress_bar=True)
- print_params()
- property params: tfaip.data.data.TDP
- padding_values() Dict[str, Union[numpy.ndarray, int, numpy.int8, numpy.int16, numpy.int32, numpy.int64, float, numpy.float16, numpy.float32, numpy.float64, bool]]
- element_length_fn() Callable[[Dict[str, Union[tensorflow.python.framework.ops.Tensor, tensorflow.python.keras.engine.keras_tensor.KerasTensor, keras.engine.keras_tensor.KerasTensor]]], Union[tensorflow.python.framework.ops.Tensor, tensorflow.python.keras.engine.keras_tensor.KerasTensor, keras.engine.keras_tensor.KerasTensor]]
Element length for bucked_by_sequence_length
- create_pipeline(pipeline_params: tfaip.data.databaseparams.DataPipelineParams, params: tfaip.data.databaseparams.DataGeneratorParams) tfaip.data.pipeline.datapipeline.DataPipeline
- get_or_create_pipeline(pipeline_params: tfaip.data.databaseparams.DataPipelineParams, params: Optional[tfaip.data.databaseparams.DataGeneratorParams]) tfaip.data.pipeline.datapipeline.DataPipeline
- pipeline_by_mode(mode: tfaip.data.pipeline.definitions.PipelineMode) tfaip.data.pipeline.datapipeline.DataPipeline
- create_input_layers() Dict[str, keras.engine.input_layer.Input]
- create_target_as_input_layers() Dict[str, keras.engine.input_layer.Input]
- create_meta_as_input_layers() Dict[str, keras.engine.input_layer.Input]
- input_layer_specs(**kwargs)
- target_layer_specs(**kwargs)
- meta_layer_specs(**kwargs)
- dataset_input_layer_specs() Dict[str, tensorflow.python.framework.tensor_spec.TensorSpec]
tf.data.Dataset generator: Input specs.
Usually, these should not be overwritten, only if using a custom TFDatasetGenerator (and thus a custom DataPipeline).
- dataset_target_layer_specs() Dict[str, tensorflow.python.framework.tensor_spec.TensorSpec]
tf.data.Dataset generator: Targets specs
Usually, these should not be overwritten, only if using a custom TFDatasetGenerator (and thus a custom DataPipeline).
- dataset_meta_layer_specs() Dict[str, tensorflow.python.framework.tensor_spec.TensorSpec]
tf.data.Dataset generator: Meta specs
Usually, these should not be overwritten, only if using a custom TFDatasetGenerator (and thus a custom DataPipeline).
- dump_resources(root_path: str, data_params_dict: dict)
- input_tensor_key_for_regularization() Optional[str]
input gradient regularization helper method !!! the inputs must be stored in a dict under this key !!!
DataBaseParams
Definition of DataBaseParams, DataPipelineParams, and DataGeneratorParams
- class tfaip.data.databaseparams.DataGeneratorParams
Bases:
abc.ABCParameter class that defines how to construct a DataGenerator.
- static cls() Type[DataGenerator]
- create(mode: tfaip.data.pipeline.definitions.PipelineMode) DataGenerator
- __init__() None
- classmethod from_dict(kvs: Optional[Union[dict, list, str, int, float, bool]], *, infer_missing=False) dataclasses_json.api.A
- classmethod from_json(s: Union[str, bytes, bytearray], *, parse_float=None, parse_int=None, parse_constant=None, infer_missing=False, **kw) dataclasses_json.api.A
- classmethod schema(*, infer_missing: bool = False, only=None, exclude=(), many: bool = False, context=None, load_only=(), dump_only=(), partial: bool = False, unknown=None) dataclasses_json.mm.SchemaF[dataclasses_json.mm.A]
- to_dict(encode_json=False, include_cls=True) Dict[str, Optional[Union[dict, list, str, int, float, bool]]]
- to_json(*, skipkeys: bool = False, ensure_ascii: bool = True, check_circular: bool = True, allow_nan: bool = True, indent: Optional[Union[int, str]] = None, separators: Optional[Tuple[str, str]] = None, default: Optional[Callable] = None, sort_keys: bool = False, **kw) str
- class tfaip.data.databaseparams.DataPipelineParams(batch_size: int = 16, limit: int = -1, prefetch: int = -1, num_processes: int = 4, batch_drop_remainder: bool = False, shuffle_buffer_size: int = -1, mode: tfaip.data.pipeline.definitions.PipelineMode = PipelineMode.TRAINING, bucket_boundaries: List[int] = <factory>, bucket_batch_sizes: Optional[List[int]] = None, use_shared_memory: bool = False)
Bases:
objectParameter class that defines the general parameters, e.g. batch size, prefetching, number of processes, … of a certain (e.g. train or val) pipeline.
- batch_size: int = 16
- limit: int = -1
- prefetch: int = -1
- num_processes: int = 4
- batch_drop_remainder: bool = False
- shuffle_buffer_size: int = -1
- mode: tfaip.data.pipeline.definitions.PipelineMode = 'training'
- bucket_boundaries: List[int]
- bucket_batch_sizes: Optional[List[int]] = None
- __init__(batch_size: int = 16, limit: int = -1, prefetch: int = -1, num_processes: int = 4, batch_drop_remainder: bool = False, shuffle_buffer_size: int = -1, mode: tfaip.data.pipeline.definitions.PipelineMode = PipelineMode.TRAINING, bucket_boundaries: List[int] = <factory>, bucket_batch_sizes: Optional[List[int]] = None, use_shared_memory: bool = False) None
- classmethod from_dict(kvs: Optional[Union[dict, list, str, int, float, bool]], *, infer_missing=False) dataclasses_json.api.A
- classmethod from_json(s: Union[str, bytes, bytearray], *, parse_float=None, parse_int=None, parse_constant=None, infer_missing=False, **kw) dataclasses_json.api.A
- classmethod schema(*, infer_missing: bool = False, only=None, exclude=(), many: bool = False, context=None, load_only=(), dump_only=(), partial: bool = False, unknown=None) dataclasses_json.mm.SchemaF[dataclasses_json.mm.A]
- to_dict(encode_json=False, include_cls=True) Dict[str, Optional[Union[dict, list, str, int, float, bool]]]
- to_json(*, skipkeys: bool = False, ensure_ascii: bool = True, check_circular: bool = True, allow_nan: bool = True, indent: Optional[Union[int, str]] = None, separators: Optional[Tuple[str, str]] = None, default: Optional[Callable] = None, sort_keys: bool = False, **kw) str
- class tfaip.data.databaseparams.DataBaseParams(pre_proc: tfaip.data.pipeline.processor.params.DataProcessorPipelineParams = <factory>, post_proc: tfaip.data.pipeline.processor.params.DataProcessorPipelineParams = <factory>, resource_base_path: str = '/home/docs/checkouts/readthedocs.org/user_builds/tfaip/checkouts/latest/docs/source')
Bases:
abc.ABCParameters that define the overall setup of the data pipelines (pre_proc and post_proc)
Parameters of this class will be shared among all DataProcessors.
- abstract static cls() Type[DataBase]
- create() DataBase
- pre_proc: tfaip.data.pipeline.processor.params.DataProcessorPipelineParams
- post_proc: tfaip.data.pipeline.processor.params.DataProcessorPipelineParams
- resource_base_path: str = '/home/docs/checkouts/readthedocs.org/user_builds/tfaip/checkouts/latest/docs/source'
- __init__(pre_proc: tfaip.data.pipeline.processor.params.DataProcessorPipelineParams = <factory>, post_proc: tfaip.data.pipeline.processor.params.DataProcessorPipelineParams = <factory>, resource_base_path: str = '/home/docs/checkouts/readthedocs.org/user_builds/tfaip/checkouts/latest/docs/source') None
- classmethod from_dict(kvs: Optional[Union[dict, list, str, int, float, bool]], *, infer_missing=False) dataclasses_json.api.A
- classmethod from_json(s: Union[str, bytes, bytearray], *, parse_float=None, parse_int=None, parse_constant=None, infer_missing=False, **kw) dataclasses_json.api.A
- classmethod schema(*, infer_missing: bool = False, only=None, exclude=(), many: bool = False, context=None, load_only=(), dump_only=(), partial: bool = False, unknown=None) dataclasses_json.mm.SchemaF[dataclasses_json.mm.A]
- to_dict(encode_json=False, include_cls=True) Dict[str, Optional[Union[dict, list, str, int, float, bool]]]
- to_json(*, skipkeys: bool = False, ensure_ascii: bool = True, check_circular: bool = True, allow_nan: bool = True, indent: Optional[Union[int, str]] = None, separators: Optional[Tuple[str, str]] = None, default: Optional[Callable] = None, sort_keys: bool = False, **kw) str