id_translation.dio.integration.pandas#
Integration for Pandas types.
Module Attributes
Supported |
|
Valid as_category string values. |
Classes
|
Optional IO implementation for |
- class PandasIO(*, level=-1, missing_as_nan=None, as_category=False)[source]#
Bases:
DataStructureIO[PandasT,NameType,SourceType,IdType]Optional IO implementation for
pandastypes.- Parameters:
level – Column level to use as names when translating a
DataFramewithMultiIndexcolumns. Seepandas.MultiIndex.get_level_values()for details. Ignored otherwise.missing_as_nan – If set, unknown IDs will be NaN. Grouping operations will typically drop NaN values. If
False, placeholders such as'<Failed: id=-1>'will be used instead. Default isTrueifas_category=True,Falseotherwise.as_category – Set dtype=’category’ in the result. See Categorical translation for details.
Categorical translation#
Setting
as_category=Trueconverts the resultant translations to a categorical data type. The returnedpandas.CategoricalDtypewill beordered, with thecategoriesset to allreal translations. Ifmissing_as_nan=False, the categories may also include placeholders.Certain fetchers, such as the
MemoryFetcher(return_all=True), will return more IDs than requested. In this case the categories may also include values not present in the input data. This may also happen if data was prepared withTranslator.go_offline(), or if multiple columns were mapped to the same source.- extract(translatable, names)[source]#
Extract IDs from translatable.
- Parameters:
translatable – Data to extract IDs from.
names – List of names in translatable to extract IDs for.
- Returns:
A dict
{name: ids}.
- classmethod handles_type(arg)[source]#
Return
Trueif the implementation handles data for the type of arg.
- insert(translatable, names, tmap, copy)[source]#
Insert translations into translatable.
- Parameters:
translatable – Data to translate. Modified iff
copy=False.names – Names in translatable to translate.
tmap – Translations for IDs in translatable.
copy – If
True, modify contents of the original translatable. Otherwise, returns a copy.
- Returns:
A copy of translatable if
copy=True,Noneotherwise.- Raises:
NotInplaceTranslatableError – If
copy=Falsefor a type which is not translatable in-place.
- names(translatable)[source]#
Extract names from translatable.
- Parameters:
translatable – Data to extract names from.
- Returns:
A list of names to translate. Returns
Noneif names cannot be extracted.
- priority = 1999#
Determines order in which IOs are considered (higher = earlier).
Set priority < 0 to disable.
- class PandasT#
Supported
pandastypes.alias of TypeVar(‘PandasT’, ~pandas.DataFrame, ~pandas.Series, ~pandas.Index, ~pandas.MultiIndex)