diff --git a/README.md b/README.md index 31e16d6..2e84493 100644 --- a/README.md +++ b/README.md @@ -1,6 +1,6 @@ # Domain Adapted Language Modeling Toolkit -## Manifesto +## Overview A great rift has emerged between general LLMs and the vector stores that are providing them with contextual information. The unification of these systems is an important step in grounding AI systems in efficient, factual domains, where they are utilized not only for their generality, but for their specificity and uniqueness. To this end, we are excited to open source the Arcee Domain Adapted Language Model (DALM) toolkit for developers to build on top of our Arcee open source Domain Pretrained (DPT) LLMs. We believe that our efforts will help as we begin next phase of language modeling, where organizations deeply tailor AI to operate according to their unique intellectual property and worldview. diff --git a/dalm/datasets/qa_gen/question_answer_generation.py b/dalm/datasets/qa_gen/question_answer_generation.py index fbce902..8201e3c 100644 --- a/dalm/datasets/qa_gen/question_answer_generation.py +++ b/dalm/datasets/qa_gen/question_answer_generation.py @@ -19,7 +19,7 @@ def parse_args() -> argparse.Namespace: - parser = argparse.ArgumentParser(description="Generate question answer pairs from the dataset of passages") + parser = argparse.ArgumentParser(description="Generate question answer pairs from the dataset of passages ") parser.add_argument( "--dataset_path", type=str,