lsdb.loaders.dataframe

lsdb.loaders.dataframe#

Submodules#

Package Contents#

Functions#

from_dataframe(→ lsdb.catalog.Catalog)

Load a catalog from a Pandas Dataframe in CSV format.

from_dataframe(dataframe: pandas.DataFrame, lowest_order: int = 0, highest_order: int = 5, partition_size: int | None = None, threshold: int | None = None, margin_order: int | None = -1, margin_threshold: float = 5.0, use_pyarrow_types: bool = True, **kwargs) lsdb.catalog.Catalog[source]#

Load a catalog from a Pandas Dataframe in CSV format.

Parameters:
  • dataframe (pd.Dataframe) – The catalog Pandas Dataframe

  • lowest_order (int) – The lowest partition order

  • highest_order (int) – The highest partition order

  • partition_size (int) – The desired partition size, in number of rows

  • threshold (int) – The maximum number of data points per pixel

  • margin_order (int) – The order at which to generate the margin cache

  • margin_threshold (float) – The size of the margin cache boundary, in arcseconds

  • use_pyarrow_types (bool) – If True, the data is backed by pyarrow, otherwise we keep the original data types. Defaults to True.

  • **kwargs – Arguments to pass to the creation of the catalog info

Returns:

Catalog object loaded from the given parameters