Modin 0.23.1.post0
anmyachev
released this
15 Nov 19:54
·
370 commits
to master
since this release
The main purpose of this release is to port as many fixes as possible to the latest version, which supports Python 3.8.
Key Features and Updates Since 0.23.1
- Stability and Bugfixes
- FIX-#0000: Pin
unidist<=0.4.1
- FIX-#4347:
read_excel
: defaults to pandas for unsupported types ofio
(#6462) - FIX-#4507: Do not call
ray.get()
inside of the kernel executing call queues (#6633) - FIX-#4687: Change
Column.null_count
to return a built-inint
instead of NumPy scalar (#6526) - FIX-#5164: Fix
unwrap_partitions
for virtual partitions whenaxis=None
(#6560) - FIX-#5536: Remove branch disabling
__getattribute__
for experimental mode (#6529) - FIX-#6465: Fix
groupby.apply()
for UDFs that change the output's shape (#6506) - FIX-#6479: HDK CalciteBuilder: Do not call
is_bool_dtype()
for categorical (#6480) - FIX-#6509: Fix
reshuffling
in case of a string key (#6510) - FIX-#6514:
test_sort_cols_str
fromtest_dataframe.py
crashed on HDK 0.7.0 and python 3.9 (#6515) - FIX-#6516: HDK:
test_dataframe.py
is crashed if Calcite is disabled (#6517) - FIX-#6518: Fix interchange protocol for string columns (#6523)
- FIX-#6519: Consider
botocore
as an optional dependency (#6521) - FIX-#6532: Fix
read_excel
so that it doesn't userich_text
param for oldopenpyxl
(#6534) - FIX-#6535: Pin
s3fs<2023.9.0
(#6536) - FIX-#6537: Unpin
s3fs<2023.9.0
(#6544) - FIX-#6541: Fix
ValueError: buffer source array is read-only
foriloc
(#6538) - FIX-#6553: Fix
read_csv
withiterator=True
(#6554) - FIX-#6572: Execute simple queries row-wise in pandas backend (#6575)
- FIX-#6594: Fix usage of Modin objects inside UDFs for
apply
(#6673) - FIX-#6600: Fix usage of list of UDF functions in
Series.groupby.agg
(#6613) - FIX-#6601:
sort_values
shouldn't affect source dataframe/series (#6603) - FIX-#6602: Refactor
join
to avoiddistributing a dict object
warning (#6612) - FIX-#6607: Fix incorrect cache after
.sort_values()
(#6608) - FIX-#6628: Allow groupby diff for dates (#6631)
- FIX-#6632: Return Series instead of Dataframe for
groupby.apply
in case of experimental groupby (#6649) - FIX-#6635: HDK:
read_csv
: treat object dtype as string (#6636) - FIX-#6637: Fix
skiprows
parameter usage forread_excel
(#6638) - FIX-#6642: Fix
modin.numpy.array.sum
on HDK (#6643) - FIX-#6647: Added init file to make
modin/experimental/sql/hdk/query.py
part of modin package (#6646) - FIX-#6651: Make sure
Series.between
works correctly (#6656) - FIX-#6680: Specify
navigation_with_keys=True
to fix docs build (#6681)
- FIX-#0000: Pin
Contributors
@AndreyPavlenko
@Egor-Krivov
@Garra1980
@RehanSD
@anmyachev
@dchigarev
@vnlitvinov