Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Size of set of random data #43

Open
uwegeercken opened this issue Jan 20, 2021 · 0 comments
Open

Size of set of random data #43

uwegeercken opened this issue Jan 20, 2021 · 0 comments

Comments

@uwegeercken
Copy link

Problem description

Sometimes when random data is generated - e.g. 1000 rows, it is desired to have some duplicates in the data set. E.g. when random data for "workplace" is generated the user could desire to have duplicate values in the "company" field within the 1000 rows. Today the size of the sample set is unknown.

The generators should allow to set the size of the sample set of the generated data. If e.g. in 1000 rows the sample set is 100, then one would expect to have duplicates.

Preferred solution

Allow to define a limiting value for each field that is generated. E.g. using a list of dicts specifying the name/key of the field and the value of the limit.

Alternatives

Alternatively, but not so flexible, would be to set a value per generator, so that all fields generated all use the same limit.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant