FlowerTune LLM on Finance Dataset

This directory conducts federated instruction tuning with a pretrained SmolLM2-360M-Instruct model on a Finance dataset. We use Flower Datasets to download, partition and preprocess the dataset. Flower's Simulation Engine is used to simulate the LLM fine-tuning process in federated way, which allows users to perform the training on a single GPU.

PEFT Adapter

The fine-tuning results have been submitted as a PEFT adapter and can be accessed here:

https://huggingface.co/ethicalabs/FlowerTune-SmolLM2-360M-Instruct-Finance-PEFT

Methodology

This experiment performs federated LLM fine-tuning with DoRA using the 🤗PEFT library. The clients' models are aggregated with FedAvg strategy.

SmolLM2-1.7B-Instruct

For the HuggingFaceTB/SmolLM2-360M-Instruct model I adopted the following fine-tuning methodology:

Precision: bf16 for model weights.
Quantization: 4-bit quantization for reduced memory usage.
Optimizer: paged_adamw_8bit
DoRA Configuration:
- Rank (r): 32
- Alpha: 64
- Target Modules:
  - down_proj
  - up_proj
  - gate_proj
Training Configuration:
- Batch size: 16
- Maximum number of steps: 8
- Total number of rounds: 24
- Fraction fit per round: 0.1
Learning Rate Scheduler:
- Cosine Annealing over rounds, where:
  - Maximum LR: 2e-4
  - Minimum LR: 6e-6
- Constant learning rate scheduler over steps
Strategy: FedAvg

Training Loss Visualization

Below is the training loss plot from the experiment:

Evaluation Results (Accuracy)

FiQA: n/a %
FPB: n/a %
TFNS: n/a %
Average: n/a %

The evaluation was conducted on an ...

Communication Budget

n/a MB

Environments setup

Project dependencies are defined in pyproject.toml. Install them in an activated Python environment with:

pip install -e .
pip install flash-attn --no-build-isolation   # Install FlashAttention-2

Experimental setup

The dataset is divided into 50 partitions in an IID fashion, a partition is assigned to each ClientApp. We randomly sample a fraction (0.1) of the total nodes to participate in each round, for a total of 12 rounds. All settings are defined in pyproject.toml.

Important

Please note that [tool.flwr.app.config.static] and options.num-supernodes under [tool.flwr.federations.local-simulation] are not allowed to be modified for fair competition if you plan to participated in the LLM leaderboard.

Running the challenge

Run the challenge with default config values.

The configs are defined in [tool.flwr.app.config] entry of pyproject.toml, and are loaded automatically.

flwr run

Running the evaluation

Please check flowertune-eval-finance.

Model saving

The global PEFT model checkpoints are saved every 5 rounds after aggregation on the sever side as default, which can be specified with train.save-every-round under [tool.flwr.app.config] entry in pyproject.toml.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
flowertune-eval-finance		flowertune-eval-finance
flowertune_finance		flowertune_finance
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FlowerTune LLM on Finance Dataset

PEFT Adapter

Methodology

SmolLM2-1.7B-Instruct

Training Loss Visualization

Evaluation Results (Accuracy)

Communication Budget

Environments setup

Experimental setup

Running the challenge

Running the evaluation

Model saving

About

Releases

Packages

Languages

ethicalabs-ai/FlowerTune-SmolLM2-360M-Instruct-Finance

Folders and files

Latest commit

History

Repository files navigation

FlowerTune LLM on Finance Dataset

PEFT Adapter

Methodology

SmolLM2-1.7B-Instruct

Training Loss Visualization

Evaluation Results (Accuracy)

Communication Budget

Environments setup

Experimental setup

Running the challenge

Running the evaluation

Model saving

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages