Skip to content

Latest commit

 

History

History
204 lines (148 loc) · 4.56 KB

CHANGELOG.rst

File metadata and controls

204 lines (148 loc) · 4.56 KB

Change Log

0.1.24

  • Replace default parquet engine, deprecate Fastparquet, start using as default pyarrow
  • Remove "chunksize" from collection.py as it's not used by dask nor pyarrow.
  • Solve issue #69
  • Fix collection.write by passing overwrite parameter.
  • rename metadata.json to pystore_metadata.json, to avoid conflicts with pyarrow

0.1.23

  • Fixed deprecate 'in' operator to be compatible with pandas 1.2.0 onwards (PR #58)
  • Add argument to append() to control duplicates (PR #57)

0.1.22

  • Uses PYSTORE_PATH environment variable, if it exists, as the path when not calling store.set_path() (defaults to ~/pystore)

0.1.21

  • Updated PyPi install script (lib is the same as 0.1.20)

0.1.20

  • Fix: Resetting config._CLIENT to None

0.1.19

  • Fixed: Exposed set/get_partition_size and set/get_clients

0.1.18

  • Added support for dask.distributed via pystore.set_client(...)
  • Added store.item(...) for accessing single collection item directly (pull request #44)
  • Added store.set_partition_size(...) and store.get_partition_size(). Default is ~99MB.

0.1.17

  • Updated PyPi install script (lib is the same as 0.1.16)

0.1.16

  • Fixed npartition=None issues on .append()

0.1.15

  • Fixed append issues
  • Raising an error when trying to read invalid item
  • Fixed path issued (removed unnecessary os.path.join calls)

0.1.14

  • Auto-detection and handling of nano-second based data

0.1.13

  • collection.reload_items defaults to False
  • Default npartitions and chunksize are better optimized (~99MB/partition)
  • collection.apply() repartitions the dataframe based on new data size (~99MB/partition)
  • Option to specify the default engine for the store by specifying engine="fastparquet" or engine="pyarrow" (dafaults to fastparquet)
  • Solving fastparquet/numba issues when using Dask >= 2.2.0 by importing numba in __init__.py

0.1.12

  • Added reload_items (default True) to collection.write and collection.delete to explicitly re-read the collection's items' directory

0.1.11

  • Reversed list_snapshots() behaviour
  • Added collection.threaded_write(...) method
  • collection.items being updated using items.add() and an async/threaded directory read

0.1.10

  • Switched from dtype_str to str(dtype) (Pandas 0.25+ compatibility)
  • Implemented collection.items and collection.snapshots as @property to reduce initialization overhead
  • collection.items and collection.snapshots are now of type set()
  • Option to specify both npartitions and chunksize in collection.append()

0.1.9

  • Fixed issues #13 and #15

0.1.8

  • Added pystore.read-csv() to quickly load csv as dask dataframe, ready for storage

0.1.7

  • Using os.path.expanduser("~") to determine user's home directory
  • collection.write(...) accepts Dask dataframes

0.1.6

  • Misc improvements

0.1.5

  • Added support for Python 2.7

0.1.4

  • Added support for Python 3.7

0.1.3

  • Fixed support for nanosecond-level data

0.1.2

  • epochdate defaults to True when storing ns data
  • Switched to dtype_str instead of str(dtype)

0.1.1

  • Infer datetime format when converting to Pandas

0.1.0

  • Increased version to fix setup
  • Bugfixes

0.0.12

  • Switched path parsing to pathlib.Path to help with cross-platform compatibility
  • Minor code refactoring

0.0.11

  • Adding an index name when one is not available

0.0.10

  • Added pystore.delete_store(NAME), pystore.delete_stores(), and pystore.get_path()
  • Added Jupyter notebook example to Github repo
  • Minor code refactoring

0.0.9

  • Allowing _ and . in snapshot name

0.0.8

  • Changed license to Apache License, Version 2.0
  • Moduled seperated into files
  • Code refactoring

0.0.7

  • Added support for snapshots
  • collection.list_items() supports querying based on metadata
  • Some code refactoring

  • Exposing more methods
  • Path setting moved to pystore.set_path()
  • Store.collection() auto-creates collection
  • Updated readme to reflect changes
  • Minor code refactoring

0.0.5

  • Not converting datetimte to epoch by defaults (use epochdate=True to enable)
  • Using "snappy" compression by default
  • Metadata's "_updated" is now a YYYY-MM-DD HH:MM:SS.MS string

0.0.4

  • Can pass columns and filters to Item object
  • Faster append
  • Store.path is now public

0.0.3

  • Updated license version

0.0.2

  • Switched readme/changelog files from .md to .rst.

0.0.1

  • Initial release