Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replacing pandas-profiling (deprecated) with ydata-profiling #183

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

gAldeia
Copy link

@gAldeia gAldeia commented Sep 10, 2024

The previously known pandas profiling is now part of a bigger project and is decoupling from the idea that it is intended to be used only with data frames.

The package's name has changed, and the last version of pandas-profiling was released over a year ago.

The GitHub workflow for profiling new datasets is not working as it should due to deprecated dependencies.

I am trying to submit new datasets, and have been facing some issues related to pandas-profilng. It seems that one of its dependencies has migrated (https://docs.pydantic.dev/2.0/migration/#basesettings-has-moved-to-pydantic-settings, also see the GitHub actions error when running python -m pmlb.profiling). After looking at the PyPI docs (https://pypi.org/project/pandas-profiling/), I found that we should just replace the package, and it should be fine.

Please let me know if there is any changes that I should make.

Thank you for the attention and for reviewing this PR!

the previously known pandas-profiling is now part of a bigger
project and is decoupling from the idea that is intended to be
used only with dataframes.

The name of the package has changed, and the last version of
`pandas-profiling` was released more than a year ago.

The github workflow for profiling new datasets is not working
as it should, due to deprecated dependences.
@trangdata
Copy link
Collaborator

Thanks for this @gAldeia! I'm struggling to resolve the action error related to reticulate at the moment. Any insight would be much appreciated! 🙏🏽

@gAldeia
Copy link
Author

gAldeia commented Sep 11, 2024

Hi @trangdata! I actually saw your PR a few moments after creating it. Sorry for the duplicate. I decided to keep mine to remind us to change the docs, regardless of which PR will be merged.

In fact, I spent the last two days trying to debug this reticulate error.
I managed to reproduce it locally. What seems to be happening here is that reticulate is using a Python version other than 3.8 (the one used on the GitHub actions). Sometimes, mine uses Python 3.10, and sometimes 3.12. I am working on having reticulate work with Python 3.8, but I think either the reticulate::install_miniconda or the pip that comes with it is messing up with versions, forcing Python to be other than 3.8, but I am not sure.
I am running it in WSL. I tried installing Ubuntu dev packages to work with Tiff, but none actually worked. I also tried some explicit installation of Pillow, but I also had no success.

I will keep working on it, and if I figure out how to solve it I will let you know!

@trangdata
Copy link
Collaborator

Thanks so much @gAldeia. Have you got any further in your investigation of this issue? 🙏🏽

@gAldeia
Copy link
Author

gAldeia commented Nov 4, 2024

@trangdata Unfortunatelly not :(
I created a branch in my fork to test some stuff, but none worked yet.
If you want to see my attempts: https://github.com/gAldeia/pmlb/commits/libtiff_fix/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants