Skip to content

fix: Fix error that occurs when special characters are removed from column names #78

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 18 commits into
base: main
Choose a base branch
from

Conversation

tashiro-akira
Copy link
Contributor

@tashiro-akira tashiro-akira commented Jun 28, 2024

@AkiraUra @kimusaku
Add action to eliminate duplicate column names when special characters are removed.

The results of the review were reflected.

・Review Comments
  Thank you for the PR. The code of rename_dict is duplicated from the preprocess one.
  Could you consider a way of removing the duplication?
  
  ・Corrected content
    The processing to modify the column name is now a common plug-in.

・Review Comments
  The current method fails when the renamed names are the same as original names.
  For example, there are original columns Age , Age{} and Age1.
  In the case, Age -> Age1, Age{} -> Age0, so there are two Age1 columns.
  Could you consider the case?
  
  ・Corrected content
    Fix column renaming to add a random number after the column name.
    Add handling to rename a column if it is re-duplicated after renaming the column.

@tashiro-akira tashiro-akira requested a review from a team as a code owner June 28, 2024 02:22
@tashiro-akira tashiro-akira requested review from kimusaku and AkiraUra and removed request for a team June 28, 2024 02:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant