Documentation revamp #232

luizhsuperti · 2025-03-23T11:42:30Z

Proposal for Documentation Reorganization

linked issue
I was thinking about reorganizing the documentation to improve clarity and usability. Here’s the proposed structure:

Home (README)

Quickstart (Installation + Package Overview: Splitter, Perturbator, and Analysis classes)
Documentation (APIs, details on classes and methods)
Usage Examples (Notebooks with practical examples)
Contribution (Guidelines for contributing)
In my fork,
👉 [I revamped the README to be more appealing to a broader audience
I also modified the Quickstart section to align with this idea

I’d love your feedback on whether the Quickstart section information is accurate—I’m still new to the package, so there might be some errors in definitions or concepts. (The Python notebooks currently in the docs are safe and should still be included in some form.)

For inspiration, I looked at the documentation structures of:

Ambrosia
pysurvival

Why This Change?

Makes it easier for new users to navigate and understand the package.
Provides a clearer structure for future contributors.
The examples section can be refined over time for better clarity and accuracy.
In the future, we could also add a "Stats 101" section for foundational concepts.
Let me know what you think!

luizhsuperti · 2025-03-23T11:43:43Z

Probably is worth to add Experiment analysis scorecards

david26694

Hey @luizhsuperti , thanks for the PR! I've had a quick read, let me know what you think

david26694 · 2025-03-24T07:28:49Z

README.md

-power_normal = npw.power_analysis(df, average_effect=0.1)
-power_line_normal = npw.power_line(df, average_effects=[0.1, 0.2, 0.3])
+### 📌 **Experiment Design & Planning**  
+- **Power analysis** and **Minimal Detectable Effect (MDE)** estimation  


I'd add something on simulation-based power analysis

david26694 · 2025-03-24T07:28:58Z

README.md

+- **Power analysis** and **Minimal Detectable Effect (MDE)** estimation  
+- Support for **complex experimental designs**, including:  
+  - 🏢 **Cluster randomization**  
+  - 🔄 **Switchback experiments**  


I'd add variance reduction

david26694 · 2025-03-24T07:29:28Z

README.md

+  - 🏢 **Cluster randomization**  
+  - 🔄 **Switchback experiments**  
+
+### 🛠 **Data Preprocessing**  


I'd remove this section, I don't think the pandas integration is very relevant nor there are tools for data preparation in the lib

david26694 · 2025-03-24T07:29:49Z

README.md


+### 📊 **Comprehensive Experiment Analysis**  
+##### **✅ Metrics**  


I'd drop the metrics one for now since it looks like we have a bug (see last issue)

david26694 · 2025-03-24T07:30:41Z

README.md

+- 📌 **Generalized Estimating Equations (GEE)**  
+- 📌 **Mixed Linear Models** for robust inference  
+- 📌 **Ordinary Least Squares (OLS)** and **Clustered OLS** with covariates  
+- 📌 **T-tests** with variance reduction techniques (**CUPED, CUPAC**)  


I'd merge this and the one above, and not mention t-tests since mostly its OLS with covariates, cuped, cupac

david26694 · 2025-03-24T07:31:43Z

README.md

+📦 **Installation:**  
+```sh
+pip install cluster-experiments
+=======
 # MDE calculation
 mde = npw.mde(df, power=0.8)


for the MDE example, I have to asks: needs to be reproducible (so dataframe needs to be created), and show the methods power_analysis, mde, power_line and mde_line. wdyt?

david26694 · 2025-03-24T07:32:16Z

docs/quickstart.md

+```
+
+!!! info "Python Version Support"
+    **Cluster Experiments** requires **Python 3.9 or higher**. Make sure your environment meets this requirement before proceeding with the installation.


it's 3.8 I think

david26694 · 2025-03-24T07:33:24Z

README.md

-## Quick Start
-
-### Power Analysis Example
+**`cluster experiments`** is a comprehensive Python library for end-to-end A/B testing workflows, designed for seamless integration with Pandas in production environments.  


designed for seamless integration with Pandas in production environments.
I'd remove any production mention, I don't think it's fair to call this production. "seamless integration" sounds generated by an LLM, do you have a more natural equivalent?

david26694 · 2025-03-24T07:34:49Z

docs/quickstart.md

+Designing and analyzing experiments can feel overwhelming at times. After formulating a testable hypothesis,
+you're faced with a series of routine tasks. From collecting and transforming raw data to measuring the statistical significance of your experiment results and constructing confidence intervals,
+it can quickly become a repetitive and error-prone process.
+*Cluster Experiments* is here to change that. Built on top of well-known packages like `pandas`, `numpy`, `scipy` and `statsmodels`,  it automates the core steps of an experiment, streamlining your workflow, saving you time and effort, while maintaining statistical rigor.


I'd make the paragraph shorter and stress what it automates: being MDE/power calculation and inference scorecards

given the next examples, I think it's worth mentioning that you're describing the simulaiton-based power analysis, and there are other pipelines like power analysis based on normal approximation and scorecard generation

I like the explanation style, maybe you could write a similar thing for NormalPowerAnalysis and AnalysisPlan

david26694 · 2025-03-24T07:42:49Z

docs/quickstart.md

+```python
+from cluster_experiments import TTestClusteredAnalysis
+
+analysis = TTestClusteredAnalysis(


let's use ClusteredOLS, I think this analysis method is a bit weird

david26694 · 2025-03-26T14:12:06Z

Hey @luizhsuperti, I was playing with this and found an issue, all examples are under switchback, whenever you can have a look please :)

luizhsuperti and others added 5 commits February 8, 2025 17:03

Update .gitignore

5c6558c

Revamp documentation

b6babcc

Add API subfolder

6363d80

Merge branch 'david26694:main' into pilot-branch

aa04ca8

Merge branch 'main' into documentation-revamp

a9673c0

david26694 requested changes Mar 24, 2025

View reviewed changes

david26694 linked an issue Mar 26, 2025 that may be closed by this pull request

Reorg docs #225

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Documentation revamp #232

Documentation revamp #232

luizhsuperti commented Mar 23, 2025 •

edited

Loading

luizhsuperti commented Mar 23, 2025

david26694 left a comment

david26694 Mar 24, 2025

david26694 Mar 24, 2025

david26694 Mar 24, 2025

david26694 Mar 24, 2025

david26694 Mar 24, 2025

david26694 Mar 24, 2025

david26694 Mar 24, 2025

david26694 Mar 24, 2025

david26694 Mar 24, 2025

david26694 Mar 24, 2025

david26694 Mar 24, 2025

david26694 Mar 24, 2025

david26694 commented Mar 26, 2025


		### 📊 Comprehensive Experiment Analysis
		##### ✅ Metrics

Documentation revamp #232

Are you sure you want to change the base?

Documentation revamp #232

Conversation

luizhsuperti commented Mar 23, 2025 • edited Loading

Proposal for Documentation Reorganization

Home (README)

Why This Change?

luizhsuperti commented Mar 23, 2025

david26694 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

david26694 commented Mar 26, 2025

luizhsuperti commented Mar 23, 2025 •

edited

Loading