LLM Prompting Framework

A Python framework for enhancing smaller language models (like Llama 3.2B) with advanced prompting techniques including chain-of-thought reasoning, reflection mechanisms, and structured reasoning to avoid hallucination.

Features

Chain of Thought (CoT): Guides the model through step-by-step reasoning
Reflection Mechanism: Enables the model to review and improve its initial responses
Structured Reasoning: Helps prevent hallucination by encouraging explicit reasoning and uncertainty acknowledgment
Benchmarking Tools: Evaluate and compare different prompting strategies

Installation

Clone the repository:

git clone <repository-url>
cd llm_prompting_framework

Install dependencies:

pip install -r requirements.txt

Make sure you have Ollama installed and the Llama model downloaded:

ollama pull llama2:3b

Usage

Basic Usage

import asyncio
from src.prompt_enhancer import PromptEnhancer

async def main():
    enhancer = PromptEnhancer(model_name="llama2:3b")
    response = await enhancer.get_enhanced_response(
        "Your question here",
        use_cot=True,
        use_reflection=True,
        use_reasoning=True
    )
    print(response)

asyncio.run(main())

Running Benchmarks

from benchmarks.prompt_benchmark import PromptBenchmark

async def run_benchmark():
    benchmark = PromptBenchmark(model_name="llama2:3b")
    results = await benchmark.run_benchmark()
    benchmark.plot_results(results)
    benchmark.save_results(results)

asyncio.run(run_benchmark())

Prompting Strategies

Chain of Thought
- Breaks down complex questions into steps
- Shows intermediate reasoning
- Improves answer accuracy
Reflection Mechanism
- Reviews initial responses
- Identifies potential weaknesses
- Suggests improvements
- Provides revised answers
Structured Reasoning
- Explicitly states assumptions
- Indicates confidence levels
- Notes limitations and uncertainties
- Prevents hallucination

Benchmarking

primary conclusion

small models are limited in their ability to reason, like the llama3.2:3b model couldn't do math and reason very well.
the llama3.1:8b model is able do math and basic reasoning, but it's far from perfect.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
benchmarks		benchmarks
src		src
.gitignore		.gitignore
README.md		README.md
reasoning-queries.json		reasoning-queries.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Prompting Framework

Features

Installation

Usage

Basic Usage

Running Benchmarks

Prompting Strategies

Benchmarking

primary conclusion

Contributing

License

About

Releases

Packages

Languages

xichen1997/stronger-llama

Folders and files

Latest commit

History

Repository files navigation

LLM Prompting Framework

Features

Installation

Usage

Basic Usage

Running Benchmarks

Prompting Strategies

Benchmarking

primary conclusion

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages