BUG: Performance issue with fillna() after merging DataFrames #61180
Labels
Missing-data
np.nan, pd.NaT, pd.NA, dropna, isnull, interpolate
Needs Info
Clarification about behavior needed to assess issue
Performance
Memory or execution speed performance
Pandas version checks
I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest version of pandas.
I have confirmed this bug exists on the main branch of pandas.
Reproducible Example
Issue Description
Bug Description
When using
fillna()
after merging DataFrames, unexpected behavior and performance issues occur.Reproducible Code Example
Expected Behavior
Expected Behavior
The
fillna()
operation should efficiently fill missing values after merging, without unexpected behavior or significant performance degradation.Actual Behavior
The
fillna()
operation may exhibit unexpected behavior or poor performance, especially with larger datasets.Additional Context
This issue becomes more apparent when working with larger datasets and complex merge operations. Improving the performance and reliability of
fillna()
after merging would greatly benefit data processing workflows.Environment
Installed Versions
INSTALLED VERSIONS
commit : None
python : 3.13.2.final.0
python-bits : 64
OS : Linux
OS-release : 5.10.102.1-microsoft-standard-WSL2
Version : #1 SMP Wed Mar 2 00:30:59 UTC 2022
machine : x86_64
processor : x86_64
byteorder : little
LC_ALL : None
LANG : en_US.UTF-8
LOCALE : en_US.UTF-8
pandas : 3.0.0
numpy : 1.26.3
pytz : 2024.1
dateutil : 2.8.2
pip : 24.0
setuptools : 69.0.2
Cython : 3.0.8
pytest : 8.0.0
hypothesis : 6.98.3
sphinx : 7.2.6
blosc : None
feather : None
xlsxwriter : 3.1.9
lxml.etree : 5.1.0
html5lib : 1.1
pymysql : None
psycopg2 : None
jinja2 : 3.1.3
IPython : 8.21.0
pandas_datareader: None
[other dependencies ...]
The text was updated successfully, but these errors were encountered: