Differential Privacy-Driven Federated Learning for Large Language Models in HMI Systems

The GLUE dataset learning process is using Transformers library and is adopted from https://github.com/huggingface/transformers/blob/main/examples/pytorch/text-classification/run_glue.py
The Federated learning enviroment is using Flower AI framework.
https://flowerai.net/docs/framework/index.html

Install dependencies

pip install requirement.txt

Experiments

To run the experiments in the paper run:

./script.sh

Noise Calculation

We used the project at https://github.com/star-ailab/FSRDP to find the proper noise std deviation for different accountant. To find the proper std deviation of noise in different accountants:

Python ./noise_calculation/get_noise.py

target_epsilons and dataset_size_list is configurable in get_noise.py file.

Single Experiment

python federated.py \
  --model_name_or_path google-bert/bert-base-cased \
  --max_seq_length 128 \
  --task_name SST2 \
  --partition_policy Linear \
  --per_device_train_batch_size 550 \
  --learning_rate 2e-5\
  --output_dir /tmp/SST2/

Model_name is the based model.
task_name is the dataset which can be (SST2, QNLI, or QQP).
Parition_policy can be (Iid, Linear, Square, or Exp)

citation

Please cite our papar if you find our repo helpful.

@misc{ahmadi2025interactiveframeworkimplementingprivacypreserving,
      title={An Interactive Framework for Implementing Privacy-Preserving Federated Learning: Experiments on Large Language Models}, 
      author={Kasra Ahmadi and Rouzbeh Behnia and Reza Ebrahimi and Mehran Mozaffari Kermani and Jeremiah Birrell and Jason Pacheco and Attila A Yavuz},
      year={2025},
      eprint={2502.08008},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2502.08008}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
conf		conf
noise_calculation		noise_calculation
plot		plot
privacy_tools		privacy_tools
.gitignore		.gitignore
README.md		README.md
federated.py		federated.py
noise_epsilon_10.json		noise_epsilon_10.json
noise_epsilon_6.json		noise_epsilon_6.json
requirements.txt		requirements.txt
script.sh		script.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Differential Privacy-Driven Federated Learning for Large Language Models in HMI Systems

Install dependencies

Experiments

Noise Calculation

Single Experiment

citation

About

Releases

Packages

Languages

KasraAhmadi/FL-Privacy-LLM

Folders and files

Latest commit

History

Repository files navigation

Differential Privacy-Driven Federated Learning for Large Language Models in HMI Systems

Install dependencies

Experiments

Noise Calculation

Single Experiment

citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages