Datasets #
Note
There are two main classes related to datasets handling in Dataiku’s Python APIs:
-
dataiku.core.dataset.Datasetin thedataikupackage, which deals primarily with reading and writing data. It has the most flexibility when it comes to reading and writing -
dataikuapi.dss.dataset.DSSDatasetin thedataikuapipackage which is mostly used for creating datasets, managing their settings, building flows, creating ML models, and performing a wider range of operations on datasets.
For more details on the two packages, please see Getting started
For starting code samples, please see Python Recipes .
Detailed samples about interacting with datasets can be found in:
Reference documentation for the classes supporting interaction with datasets can be found in Datasets