DSPy 的最小化例子

Minimal Working Example

In this post, we walk you through a minimal working example using the DSPy library.

We make use of the GSM8K dataset and the OpenAI GPT-3.5-turbo model to simulate prompting tasks within DSPy.

Setup

Before we jump into the example, let's ensure our environment is properly configured. We'll start by importing the necessary modules and configuring our language model:

import dspy
from dspy.datasets.gsm8k import GSM8K, gsm8k_metric

# Set up the LM.
turbo = dspy.OpenAI(model='gpt-3.5-turbo-instruct', max_tokens=250)
dspy.settings.configure(lm=turbo)

# Load math questions from the GSM8K dataset.
gsm8k = GSM8K()
gsm8k_trainset, gsm8k_devset = gsm8k.train[:10], gsm8k.dev[:10]

Let's take a look at what gsm8k_trainset and gsm8k_devset are:

print(gsm8k_trainset)

The gsm8k_trainset and gsm8k_devset datasets contain lists of dspy.Examples, with each example having question and answer fields.

Define the Module

With our environment set up, let's define a custom program that utilizes the ChainOfThought module to perform step-by-step reasoning to generate answers:

class CoT(dspy.Module):
    def __init__(self):
        super().__init__()
        self.prog = dspy.ChainOfThought("question -> answer")
  
    def forward(self, question):
        return self.prog(question=question)

Compile and Evaluate the Model

With our simple program in place, let's move on to compiling it with the BootstrapFewShot teleprompter:

from dspy.teleprompt import BootstrapFewShot

# Set up the optimizer: we want to "bootstrap" (i.e., self-generate) 4-shot examples of our CoT program.
config = dict(max_bootstrapped_demos=4, max_labeled_demos=4)

# Optimize! Use the `gsm8k_metric` here. In general, the metric is going to tell the optimizer how well it's doing.
teleprompter = BootstrapFewShot(metric=gsm8k_metric, **config)
optimized_cot = teleprompter.compile(CoT(), trainset=gsm8k_trainset)

Note that BootstrapFewShot is not an optimizing teleprompter, i.e. it simply creates and validates examples for steps of the pipeline (in this case, chain-of-thought reasoning) but does not optimize the metric. Other teleprompters like BootstrapFewShotWithRandomSearch and MIPRO will apply direct optimization.

Evaluate

Now that we have a compiled (optimized) DSPy program, let's move to evaluating its performance on the dev dataset.

from dspy.evaluate import Evaluate

# Set up the evaluator, which can be used multiple times.
evaluate = Evaluate(devset=gsm8k_devset, metric=gsm8k_metric, num_threads=4, display_progress=True, display_table=0)

# Evaluate our `optimized_cot` program.
evaluate(optimized_cot)

Inspect the Model's History

For a deeper understanding of the model's interactions, we can review the most recent generations through inspecting the model's history:

turbo.inspect_history(n=1)

And there you have it! You've successfully created a working example using the DSPy library.

This example showcases how to set up your environment, define a custom module, compile a model, and rigorously evaluate its performance using the provided dataset and teleprompter configurations.

Feel free to adapt and expand upon this example to suit your specific use case while exploring the extensive capabilities of DSPy.

If you want to try what you just built, run optimized_cot(question='Your Question Here').

‍

关于 SiYuan v3.1.12 后默认自动清理超过 180 天快照的调查

目前思源加入了自动的快照清理功能，触发时机如下：手动触发同步每 24 小时执行一次默认的配置是保留 180 天内的快照，每天保留两份。在启动、退出时的同步不会触发，但是 30s 的自动同步会触发。我个人不喜欢这个设计，因此以下的描述可能会有偏颇。不喜欢的原因如下：清理功能是自动的且没有开关默认开启，这导致假 ..

是否可以为“添加到数据库”按钮增加一个快捷键（类 Supertag）？

[图片] 昨天刚刚从 B 站看到了 tana 比较的介绍，惊为天人，来论坛里查了一下，其实已经有很多关于 TANA 的介绍了，而且作者大大已经添加了将任意块添加到数据库的功能，我突然发现这不就是 supertag 的鼠标按键版本嘛！所以想问一下，能否为这个按钮增加一个快捷键呢？或者给我一个插件开发的思路，我想试试能不 ..

欢迎来到这里！

我们正在构建一个小众社区，大家在这里相互信任，以平等 • 自由 • 奔放的价值观进行分享交流。最终，希望大家能够找到与自己志同道合的伙伴，共同成长。

关于

DSPy 的最小化例子

Minimal Working Example

Setup

Define the Module

Compile and Evaluate the Model

Evaluate

Inspect the Model's History

相关帖子

关于 SiYuan v3.1.12 后默认自动清理超过 180 天快照的调查

思源的字体外观不应关联其他模块的颜色

笔记编辑器优化参考

块引用中图片显示错误，显示为无意义的文字

是否可以为“添加到数据库”按钮增加一个快捷键（类 Supertag）？

公司网络登录失败

购买了功能特性激活码在哪？

欢迎来到这里！