Skip to content

Latest commit

 

History

History
711 lines (571 loc) · 40.7 KB

CHANGELOG.md

File metadata and controls

711 lines (571 loc) · 40.7 KB

vaex 4.15.0

vaex-core 4.15.0

  • Fixes

    • allow single columns or expressions in materialize #2249
    • arrow data used in selection would ignore null values or fail #2196
    • build expressions / filters with an arrow string scalar #2244
    • selection-dropna did not work with non-identifier expresssion #2208
    • use vaex.settings for thread counts #2231
    • raise an informative exception when extract can not run #2232
  • Performance

    • Import and version checking improvements #2226

vaex-hdf5 0.14.0

  • Features
    • store arrow arrays in hdf5 using null bitmasks #2245

vaex 4.14.0

vaex-core 4.14.0

  • Features

    • Add support for the arrow export CSV backend in export_many #2220
    • Auto detect type when Lazy CSV reading #2224
  • Fixes

    • convert options for lazy from_csv_arrow #2218
    • support arrow datasets containing multiple fragments without row groups #2221
    • Interchange protocol fixes and updates #2150

vaex 4.13.0

vaex-core 4.13.0

  • Features

    • Out of core (lazy) and normal CSV support using Apache Arrow CSV reader #1028
    • Support compressed csv reading for pandas and arrow #1028
    • Expose vaex.DataFrame and vaex.Expression for typing #2186
  • Fixes

    • Negative indexing of a filtered dataframe #2163
    • nunique aggregator for numeric with selection gave wrong results #2199
    • vaex.from_arrow_dataset had no working implementation #2195

vaex-hdf5 0.13.0

Requires vaex-core 4.13.0 for refactor of dataset

vaex-viz 0.5.4

  • Fixes
    • correctly place the colorbar for matplotlib 3.6.0 #2215

vaex-astro 0.9.2

  • Fixes

vaex 4.12.0

vaex-core 4.12.0

  • Features
    • get_column_names accepts a dtypes argument #2160
  • Fixes
    • df.extract() was not thread safe #2182
    • uuid4 function was not always restored properly #2181
    • groupby could overflow due to wrong downcasting #2137
    • support unique with selection=True #2164
    • value_counts for strings was sometimes off #2147
    • better arrow support for interchanging categorical columns #2135

vaex-viz 0.5.3

  • Fixes
    • Improve selection behaviour for histogram and update docstrings #2143

vaex 4.11.1

vaex-core 4.11.1

  • Fix
    • Slicing of empty dataframe #2129
    • Dataframe interchange protocol: Better support for numpy backed categoricals #2133
    • Dataframe interchange protocol: Column.null_count returns NA count to comply with spec #2132
    • Export csv with 'protected' column name #2117

vaex 4.11.0

vaex-core 4.11.0

  • Fix
    • df.func.where relies more on pyarrow 5's if_else #2096
    • correct $VAEX_PATH_HOME -> $VAEX_PATH #2101
    • Various join fixed when the missing values were present #808
    • Various join fixed when the missing values were present #808
    • string join on large_list with large_strings. #2112
    • Working arm wheel for osx (#) #2124
  • Performance
  • Feature
    • Add how to dropna, dropinf etc #2104

vaex-hdf5 0.12.3

vaex 4.10

vaex-core 4.10.0

  • Fix
    • Join issue with missing values or nans #2077
  • Feature
    • vaex.agg.describe aggregator, and descripe on groupby #2004
    • df.struct.flatten to turn arrow structs into multiple columns #2072
    • Reuse hashmap for isin (df fingerprint is stable, better for cache) #2089
  • Performance
    • Value_counts uses a task to get caching support #2085.

vaex-ml 0.18.0

  • Features
    • Enable selections in metrics #2073

vaex 4.9.2

vaex-core 4.9.2

  • Fix
    • Write to cached filesystem when metadata argument is needed #1993
    • Multi-d sparse groupby would fail for arrow data (e.g. list agg) #2031
    • Exporting arrow with large_string would result in schema conflict #2030
    • expression engine did not roundtrip dict correctly, missing ", " #2039
    • Changed deprecated numpy.float to numpy.float64 #2023
    • Replace pylab with pyplot #2047
    • isin should accept empty array or non-existing values #2064
    • Ordinal_encode with values which extra entries gave wrong results #2059
    • Combining filters with arrow arrays failed converting (gave TypeError) #2038
    • Wrong order of casting and subtracting offset cause overflow #2065

vaex-hdf5 0.12.2

  • Fix
    • Do not keep a reference to numpy arrays on closing an hdf5 file 2066

vaex 4.9.1

vaex-core 4.9.1

  • Fix
    • When aggregation leads to arrow data, non-dense binners failed (e.g. vaex.agg.list) #2017
    • Filtering by boolean column would miss the column as dependency #2016

vaex 4.9.0

vaex-core 4.9.0

  • Features
    • Progress bar for percentile_approx and median_approx #1889
    • Better casting of strings to datetime #1920
    • We better support numpy scalars now, and more arrow time units. #1921
    • Allow sorting by strings, multiple columns and multiple directions #1963
    • Support JSON in df.export #1974
    • New/better aggregators
      • first/last use different type 'sort column. #1848
      • Skew and kurtosis #1946
      • List aggregator #1987
    • Pre-sort by the grouping columns in df.groupby (better performance) #1990
  • Performance
    • No copy of hashmap and GIL release #1893 #1961
    • Store strings in hashmap in arrow array, making map.key_array() faster #1976
  • Fix
    • Respect row_limit when the groupby is dense #1894
    • Fingerprint collision possible if filter uses virtual column #1949
    • Apply with filtered data could give wrong dtypes #1936
    • Strings array growing failed when first string was zero length #1956
    • Use less processes for when using multiprocessing. #1979
    • Support chunked arrays and empty chunks in value counts. #1958 #1975
    • Allow renaming of function, to make join use with functions without name collisions. #1966
    • Join would fail if the rhs had no columns besides the join one #2010
    • hdf5 export fails for concat df with missing columns #1493
    • Allow col as column name #1992

vaex 4.8.0

vaex-core 4.8.0

  • Features
    • Multiple example datasets provided in vaex.datasets #1317
    • We do not use asyncio for the default sync execute path #1783
    • Executor works with asyncio with multiple tasks [#1784]#1784)
    • Auto execute context manager makes vaex behave normal with await #1785
    • Support exporting arrow and parquet to file like objects #1790
    • Put lock files in $VAEX_HOME/lock #1797
    • Show progress when converting the included datasets #1798
    • Limit and limit_raise for unique and nunique #1801
    • Lazy ordinal encode #1813
    • Configure logging using settings system#1811
    • Export to JSON #1789
    • Progress bar can be configured using settings system #1815
    • fillna and fillmissing should upcast integers when needed #1869
  • Performance
    • Moved mutex use to the C++ layer to avoid GIL issues #1847
    • Many refactors to improve performance #1863 #1869
  • Fix
    • Support empty parquet and arrow files #1791
    • Keep virtual column order when renaming/dropping to not break state transfer #1788
    • blake3 compatibility issues #1818 db527a6
    • Avoid frozendict 2.2.0 which can segfault on Python 3.6#1856
    • Use label instead of expression for non-ident column names in binby #1842
  • Development

vaex-hdf5 0.12.0

  • Features
    • Support storing Arrow Dictionary encoded/categoricals in hdf5 #1814

vaex-ml 0.17.0

Requires vaex-core 4.8.0 for the vaex.datasets.iris()

vaex-server 0.8.1

Made compatible with Python 3.6

vaex 4.7.0

vaex-core 4.7.0

  • Features
    • Allow casting integers to timedelta64 type #1741
    • When a single task can fail, other can continue #1762
    • Improved rich progress bar support #1771
    • vaex.from_records to build a dataframe from a list of dicts #1767
    • Settings in Vaex can be configured in a uniform way #1743
    • Unique for datetime64 and timedelta64 expressions #1016
    • Copy argument for binby, similar to groupby 4e7fd8e
  • Performance
    • Improve performance for filtered dataframes 1685
  • Fixes
    • S3: endpoint override combined with globbing #1739
    • Support having filtered and unfiltered tasks in 1 pass over the data #1761
    • Continue next tasks even when old ones are cancelled #1769
    • Handle empty arrow files #1773
    • Evaluate and evaluate_iterator did not work for empty filtered dataframes #1775

vaex-hdf5 0.11.1

  • Features
    • do not track times to have deterministic output (useful for lineage/hash output) #1772

vaex-ml 0.16

Requires vaex-core 4.7 for uniform settings

vaex-server 0.8

Requires vaex-core 4.7 for uniform settings

vaex-jupyter 0.7

Requires vaex-core 4.7 for uniform settings

  • Features
    • Editor widget for settings #1743

vaex-viz 0.5.1

  • Fixes
  • Histogram method on expression to propagate kwargs #1757

vaex 4.6.0

vaex-core 4.6.0

  • Features
    • OSX Metal support for jitting expressions #584
    • Improved progress support, including Rich progress bars #1738
    • Control number of columns and rows being printed #1672
    • Groupby with regular bins (similar to binby) #1589
    • Groupby with a limited number of values, and 'OTHERS' #1641
    • New aggregators: vaex.agg.any and vaex.agg.all #1630
    • Better API for correlation and mutual information #536
    • Materialize datasets columns for better performance of non-memory mapping files (e.g. parquet) #1625
    • Avoid using nest_asyncio #1546
    • Multi level cache support (e.g. memory and disk) #1580
    • Do not mutate dataframe when comparing dates. #1584
  • Performance
    • Fingerprint for tasks are more stable when the dataframe changes, but not the task description, for more cache hits. #1627
    • Faster conversion between Arrow and NumPy #1625
    • Cache sparse-finding/combining of high-d groupby #1588
    • Allow (lazy) math and computations with aggregators #1612
    • Less passes over the data when multiple dataframes use the same dataset #1594
    • Share evaluation of expressions of selections #1594
    • Delay support for groupby #1594
  • Fixes
    • Missing values in groupby were not well supported #1637
    • Groupby over boolean #1632
    • Negative periods for diff and shift #1608
    • Arrow timestamp promotion during concatenation #1551

vaex-server 0.7

Requires vaex-core 4.6

vaex-ml 0.15

Requires vaex-core 4.6

  • Performance
    • Dot product with many columns does not use expressions, but dedicated function #1671

vaex 4.5.0

vaex-core 4.5.1

  • Features
    • Filelocks for multi process convert=True cooperation #1573
  • Performance
    • Fingerprint speedups #1574
    • Expression.nunique asked unique for Python list (slow) #1576
    • Groupby was slow for particular data (with low bits 0) #1571
    • Blob encoding is faster due to blake3 #1575
    • Stop iterating over dataset when exception occurs when computing #1577

vaex-core 4.5.0

  • Features
    • Protect file creation parts with lock files #1541
    • Expression.str.extract to extract parts of strings using regex to a struct #1423
  • Performance
    • We now cache Expression.nunique() #1565
    • Hashmaps memory is shared among threads (does not scale linear with number of threads), and avoids a merge phase #1525
    • Hashmaps serialize efficiently #1525
    • Avoid falling back to eval to get the dtype #1514

vaex-hdf5 0.10.0

  • Features
    • Write higher dimensional arrays to hdf5 files #1563

vaex-ml 0.14.0

  • Features

vaex-astro 0.9

Requires vaex 4.5.0 due to private API change.

vaex-server 0.6.1

  • Fixes
    • Missing imports (now checked in CI) #1516

vaex-contrib 0.1.0

  • Features
    • Import from and export to Google BigQuery #1470

vaex 4.4.0

vaex-core 4.4.0

  • Performance
    • Reuse filter data when slicing a dataframe #1287
    • Faster astype('str') #1411
    • Task refactor, which allows for more caching #1433
  • Features
  • Fixes
    • df.concat did not copy functions #1287
    • Dropping columns when name was invalid identifier #1434
    • Old dates wrapped due to negative ints and casting to unsigned #1504
    • Timestamp to NumPy datetime64 would ignore units #1513
    • Thread unsafety could trigger issues in Plotly dash #1512

vaex-server 0.6.0

  • Complete refactor, now using FastAPI by default #1300

vaex-ml 0.13.0

  • Tensorflow/keras support #1510

vaex-hdf5 0.9.0

  • Features
    • Support writing/reading from custom hdf5 groups #1439
    • Support laying out an empty hdf5 file for writing #1439
  • Fixes
    • File order close issue on Windows #1479

vaex 4.3.0

vaex-core 4.3.0

  • Performance
    • Reuse filter data when slicing a dataframe #1287
  • Features
    • Cache task results, with support for Redis and diskcache #1393
    • df.func.stack for stacking columns into Nd arrays #1287
    • Sliding windows / shift / diff / sum #1287
    • Embed join/groupby/shift in dataset (opt in via df._future(), will be default in vaex v5) #1287
    • df.fingerprint() - a cross runtime unique key for caching #1287
    • limit rows in groupby using early stop #1391
    • Compare date columns to string values formatted in ISO 8601 format 621a341b54f9b4112f24e2ffd86612753df19fef
  • Fixes
    • df.concat did not copy functions #1287
    • Filters with column name equals to function names a159777e2dc13ec762914c51c8b5550efec5f845

vaex 4.2.0

vaex-core 4.2.0

  • Performance
    • Perform groupby in a sparse way for less memory usage/performance (up to 250x faster) #1381
  • Features
  • Fixes
    • Proper use of logging framework #1384
    • Aggregating with 'count' would ignore custom names #1345
    • Join supports datetime column

vaex-ml 0.12.0

  • Features

vaex-server 0.4.1

  • Features
    • SSL support 5dc29edd5b15eb4e1fe9c6981c67edd477481484

vaex 4.1.0 (2021-Mar-18)

vaex-core 4.1.0

  • Features
    • groupby datetime support #1265
  • Fixes
    • Improved fsspec support #1268
  • Performance
    • df.extract() uses mask instead of indices 398b682fe9042b3336120e9013e15bbd638620ed

vaex 4.0.0 (2021-Mar-9)

  • Breaking changes:
    • Arrow is now a core dependency, vaex-arrow is deprecated. All methods that return string, will return Arrow arrays #517
    • Opening an .arrow file will expose the arrays as Apache Arrow arrays, not numpy arrays. #984
    • Columns (e.g. df.column['x']) may now return a ColumnProxy, instead of the original data, slice it [:] to get the underlying data (or call .to_numpy()/to_arrow() or try converting it with np.array(..) or pa.array(..)). #993
    • All plot methods went into the df.viz accessor #923

vaex-arrow (DEPRECATED)

This is now part of vaex-core.

vaex-astro 0.8.0

  • Requirement changed to vaex-core >=4,<5

vaex-core 4.0.0

  • Fixes
    • Repeated dropna/dropnan/dropmissing could report cached length. #874
    • Trimming concatenated columns. #860
    • percentile_approx works for 0 and 100 percentile. #818
    • Expression containing kwarg=True were treated as invalid. #861
    • Unicode column names fully supported #974
  • Features
    • Datetime floor method #843
    • dropinf (similar to dropna) #821
    • Support for streaming from Google Cloud Storage. #898
    • IPython autocomplete support (e.g. df['hom' (tab)) #961
    • Out of core Parquet support using Arrow Dataset scanning #993
  • Refactor
    • Use arrow.compute for several string functions/kernels. #885
    • Separate DataFrame and Dataset. #865
  • Performance
    • concat (vaex.concat or df.concat) is about 100x faster. #994

vaex-distributed (DEPRECATED)

This is now part of vaex-enterprise (was a proof of content, never functional).

vaex-graphql 0.2.0

  • Requirement changed to vaex-core >=4,<5

vaex-hdf5 0.7.0

  • Requirement changed vaex-core >=4,<5

vaex-jupyter 0.6.0

  • Requirement changed to vaex-core >=4,<5

vaex-ml 0.11.0

  • Features
    • Batch training for CatBoost. #819
    • Support for predict_proba and predict_log_proba for sklearn classifiers. #927

vaex-server 0.4.0

  • Requirement changed to vaex-core >=4,<5

vaex-viz 0.5.0

  • Requirement changed to vaex-core >=4,<5

vaex 3.1.0

vaex-jupyter 0.5.2 (2020-6-12)

  • Features
    • Normalize histogram and change selection mode. #826

vaex-ml 0.11.0-dev0

* Features
  * Autogenerate the fast (or functional) API [#512](https://github.com/vaexio/vaex/pull/512)

vaex-core 2.0.3 (2020-6-10)

  • Performance
    • isin uses hashmaps, leading to a 2x-4x performance increase for primitives, 200x for strings in some cases #822

vaex-jupyter 0.5.1 (2020-6-4)

  • Features
    • Selection toggle list. #797

vaex-server 0.3.1 (2020-6-4)

  • Fixes
    • Remote dataframe was still using dtype, not data_type. #797

vaex-ml 0.10.0 (2020-6-4)

  • Features
    • Implementation of GroupbyTransformer #479

vaex-arrow 0.6.1 (2020-6-4)

  • Fixes
    • Various fixes for aliased columns (column names with invalid identifiers) #768

vaex-hdf5 0.6.1 (2020-6-4)

  • Fixes
    • Masked arrays supported in hdf5 files on s3 #781
    • Various fixes for aliased columns (column names with invalid identifiers) #768

vaex-core 2.0.2 (2020-6-4)

  • Fixes
    • Masked arrays supported in hdf5 files on s3 #781
    • Expression.map always uses masked arrays to be state transferrable (a new dataset might have missing values) #479
    • Support importing Pandas dataframes with version 0.23 #794
    • Various fixes for aliased columns (column names with invalid identifiers) #768 #793

vaex-core 2.0.1 (2020-5-28)

  • Fixes
    • Join could in rare cases point to row 0, when there were values in the left, not present in the right #765
    • Tabulate 0.8.7 escaped html, undo this to print dataframes nicely.

vaex 3.0.0 (2020-5-24)

  • Breaking changes:
    • Python 2 is not supported anymore
    • Variables don't have access to pi and e anymore
    • df.rename_column is now df.rename (and also renames variables)
    • DataFrame uses a normal dict instead of OrderedDict, requiring Python >= 3.6
    • Default limits (e.g. for plots) is minmax, so we don't miss outliers
    • df.get_column_names() returns the aliased names (invalid identifiers), pass alias=False to get the internal column name
    • Default value of virtual is True in method df.export, df.to_dict, df.to_items, df.to_arrays.
    • df.dtype is a property, to get data types for expressions, use df.data_type(), df.expr.dtype is still behaving the same
    • df.categorize takes min_value and max_value, and no longer needs the check argument, also the labels do not have to be strings.
    • vaex.open/from_csv etc does not copy the pandas index by default #756
    • df.categorize takes an inplace argument, similar to most methods, and returns the dataframe affected.

vaex-core 2.0.0 (2020-5-24)

  • Performance

    • Printing out of dataframes done in 1 evaluate call, making remote dataframe printing faster. #571
    • Joining is faster and uses less memory (2x speedup measured) #586
    • Faster typechecks when adding columns of dtype=object (as often happens when coming from pandas) #612
    • Groupby 2x to 4x faster #730
  • Refactor

    • Task system is refactored, with task execution on CPU being default, and makes (de)serialization easier. #571
    • Serialization/encoding of data structures is more flexible, allowing binary blobs and json over the wire. #571
    • Execution and tasks support async await #654
  • Fixes

    • Renaming columns fixes #571
    • Joining with virtual columns but different data, and name collision fixes #570
    • Variables are treated similarly as columns, and respected in join #573
    • Arguments to lazy function which are numpy arrays gets put in the variables #573
    • Executor does not block after failed/interrupted tasks. #571
    • Default limits (e.g. for plots) is minmax, so we don't miss outliers #581
    • Do no fail printing out dataframe with 0 rows #582
    • Give proper NameError when using non-existing column names #299
    • Several fixes for concatenated dataframes. #590
    • dropna/nan/missing only dropped rows when all column values were missing, if no columns were specified. #600
    • Flaky test for RobustScaler skipped for p36 #614
    • Copying/printing sparse matrices #615
    • Sparse columns names with invalid identifiers are not rewritten. #617
    • Column names with invalid identifiers which are rewritten are shown when printing the dataframe. #617
    • Column name rewriting for invalid identifiers also works on virtual columns. #617
    • Fix the links to the example datasets. #609
    • Expression.isin supports dtype=object #669
    • Fix colum_count, now only counts hidden columns if explicitly specified #593
    • df.values respects masked arrays #640
    • Rewriting a virtual column and doing a state transfer does not lead to ValueError: list.remove(x): x not in list #592
    • df.<stat>(limits=...) will now respect the selection #651
    • Using automatic names for aggregators led to many underscores in name #687
    • Support Python3.8 #559
  • Features

    • New lazy numpy wrappers: np.digitize and np.searchsorted #573
    • df.to_arrow_table/to_pandas_df/to_items/df.to_dict/df.to_arrays now take a chunk_size argument for chunked iterators #589 (#699)
    • Filtered datasets can be concatenated. #590
    • DataFrames/Executors are thread safe (meaning you can schedule/compute from any thread), which makes it work out of the box for Dash and Flask #670
    • df.count/mean/std etc can output in xarray.DataArray array type, makes plotting easier #671
    • Column names can have unicode, and we use str.isidentifier to test, also dont accidently hide columns. #617
    • Percentile approx can take a sequence of percentages #527
    • Polygon testing, useful in combinations with geo/geojson data #685
    • Added dt.quarter property and dt.strftime method to expression (by Juho Lauri) #682

vaex-server 0.3.0 (2020-5-24)

  • Refactored server, can return multiple binary blobs, execute multiple tasks, cancel tasks, encoding/serialization is more flexible (like returning masked arrays). #571

vaex-viz 0.4.0 (2020-5-24)

  • Requirement of vaex-core >=2,<3

vaex-graphql 0.1.0 (2020-5-24)

  • Requirement of vaex-core >=2,<3

vaex-astro 0.7.0 (2020-5-24)

  • Requirement of vaex-core >=2,<3

vaex-hdf5 0.6.0 (2020-5-24)

  • Requirement of vaex-core >=2,<3

vaex-ml 0.9.0 (2020-5-24)

  • Requirement of vaex-core >=2,<3

vaex-arrow 0.5.0 (2020-5-24)

  • Requirement of vaex-core >=2,<3
  • Fixes
    • Booleans were negated, and didn't respect offsets.

vaex-jupyter 0.5.0 (2020-5-24)

  • Requirement of vaex-core >=2,<3
  • Breaking changes
    • vaex-jupyter is refactored #654

vaex 2.6.0 (2020-1-21)

vaex-core 1.5.0

  • Features
    • df.evaluate_iterator for efficient parallel chunked evaluation #515
    • Widget progress bar has time estimation #545
  • Fixes
    • Slicing arrow string arrays with masked arrays is respected/working #530]

vaex-ml 0.8.0

  • Performance
    • IncrementalPredictor uses parallel chunked support (2x speedup possible) #515
  • Fix
    • IncrementalPredictor: epochs now iterate over the whole DataFrame instead on a batch level #523
    • Rename vaex.ml.sklearn.SKLearnPredictor to vaex.ml.sklearn.Predictor #524
    • IncrementalPredictor can be used with sklearn.linear_model.SGDClassifier 539
  • Features
    • CycleTransformer #532
    • BayesianTargetEncoder #533
    • WeightOfEvidenceEncoder #534
    • Improve the consistency of the vaex.ml API for model transformers #552

vaex 2.5.0 (2019-12-16)

vaex-core 1.4.0

  • Performance
    • Dataframes are always true (implements __bool__) to avoid calling __len__ #496
  • Fixes
    • Do not duplicate column when joining DataFrames on a column with the same name #480
    • Better error messages/stack traces, and work better with debugger. #488
    • Accept numpy scalars in expressions. #462
    • Expression.astype can create datetime64 columns out of (arrow) strings arrays. #440
    • Invalid mask access triggered when memory-mapped read only for strings. #459
  • Features
    • Expressions are not evaluated for filtered data #483 #496 and selections 505
    • Filtering (using df.filter) allows more flexible (and growing/expanding!) filter. #489
    • Filtering and selections allow for booleans (True or False) to auto 'broadcast', to allow 'conditional filtering'. #489

vaex-ml 0.7.0

  • Features
    • IncrementalPredictor for scikit-learn models that support the .partial_fit method #497
  • Fixes
    • Adding unique function names to dataframes to enable adding a predictor twice #492

vaex-arrow 0.4.2

  * Compatibility with vaex-core 1.4.0

vaex 2.4.0 (2019-11-26)

vaex-core 1.3.0

  • Performance
    • Parallel df.evaluate #474
    • Avoid calling df.get_column_names (1000x for 1 billion rows per column use) #473
    • Slicing e.g df[1:-1] goes much faster for filtered dataframes #471
    • Dataframe copying and expression rewriting was slow #470
    • Double indices columns were not using index cache since empty dict is falsy #439
  • Features
    • multi-key sorting of a DataFrame #463
    • vaex expression to pandas.Series support #456
    • Dask array support #449 #476 example
    • isin() method for expressions #441 docs
    • Existing expressions are rewritten, to make them behave like arrays #450

vaex-hdf5 0.5.6

  • requires vaex-core >=1.3,<2 for parallel evaluate

vaex-jupyter 0.4.1

  • Fixes:
    • bqplot 0.12 revealed a bug/inconsistency with heatmap #465

vaex-arrow 0.4.1

  • Fixes
    • Support for Apache Arrow >= 0.15

vaex-ml 0.6.2

  • Fixes
    • Docstrings and minor improvements

vaex-graphql 0.0.1 (2019-10-15)

  • initial release 0.1

vaex 2.3.0 (2019-10-15)

vaex-core 1.2.0

  • feature: auto upcasting for sum #435
  • fix: selection/filtering fix when using masked values #431
  • fix: masked string array fixes #434
  • fix: memory usage fix for joins #439

vaex-arrow 0.4.1

  • fix: support for Apache Arrow >= 0.15