Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reference 'count' is ambiguous, could be count, count. #1402

Closed
3 tasks done
waghts95 opened this issue Jul 31, 2023 · 1 comment
Closed
3 tasks done

Reference 'count' is ambiguous, could be count, count. #1402

waghts95 opened this issue Jul 31, 2023 · 1 comment

Comments

@waghts95
Copy link

waghts95 commented Jul 31, 2023

Current Behaviour

Pyspark.sql.utils.AnalysisException : Reference 'count' is ambiguous, could be count, count.

Expected Behaviour

Code should save html file.

Data Description

could not provide data.
It is confidential.

Code that reproduces the bug

report.to_file("/tmp/report.html")

pandas-profiling version

4.3.2

Dependencies

pandas==1.3.5
numpy==1.21.6

OS

Amazon linux 2

Checklist

  • There is not yet another bug report for this issue in the issue tracker
  • The problem is reproducible from this bug report. This guide can help to craft a minimal bug report.
  • The issue has not been resolved by the entries listed under Common Issues.
@waghts95
Copy link
Author

waghts95 commented Aug 3, 2023

Column names should be like count, percent, sum as Y-data profiling might use these names. So if such names are used, ambiguity is created.
My issue got resolved when I rename column name from 'count' to 'patient_count'.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants