Summarizing Text with Map-Reduce in LLM-Chain
Having problems? Don't worry, reach out on discord and we will help you out.
Map-reduce is a powerful technique for processing and aggregating data in parallel. In this tutorial, we'll explore how to use map-reduce in llm-chain
to summarize text effectively. We'll cover implementing a basic map-reduce for text summarization.
To start, create a file named "article_to_summarize.md", take the content of a wikipedia article and paste it in there.
Here's a Rust program that demonstrates how to create a map-reduce chain for summarizing text:
use llm_chain::chains::map_reduce::Chain;
use llm_chain::step::Step;
use llm_chain::{executor, parameters, prompt, Parameters};
#[tokio::main(flavor = "current_thread")]
async fn main() -> Result<(), Box<dyn std::error::Error>> {
// Create a new ChatGPT executor with the default settings
let exec = executor!()?;
// Create the "map" step to summarize an article into bullet points
let map_prompt = Step::for_prompt_template(prompt!(
"You are a bot for summarizing wikipedia articles, you are terse and focus on accuracy",
"Summarize this article into bullet points:\n{{text}}"
));
// Create the "reduce" step to combine multiple summaries into one
let reduce_prompt = Step::for_prompt_template(prompt!(
"You are a diligent bot that summarizes text",
"Please combine the articles below into one summary as bullet points:\n{{text}}"
));
// Create a map-reduce chain with the map and reduce steps
let chain = Chain::new(map_prompt, reduce_prompt);
// Load the content of the article to be summarized
let article = include_str!("article_to_summarize.md");
// Create a vector with the Parameters object containing the text of the article
let docs = vec![parameters!(article)];
// Run the chain with the provided documents and an empty Parameters object for the "reduce" step
let res = chain.run(docs, Parameters::new(), &exec).await.unwrap();
// Print the result to the console
println!("{}", res.to_immediate().await?.as_content());
Ok(())
}
Let's break down the code and understand the different parts:
- Define the map and reduce prompts as Step objects: a. The map_prompt summarizes a given article into bullet points. b. The reduce_prompt combines multiple summaries into a single summary as bullet points.
- Create a new map-reduce Chain by providing the map_prompt and reduce_prompt.
- Load the article to be summarized and create a Parameters object with the text.
- Execute the map-reduce Chain with the provided Parameters and store the result in res.
- Print the LLM response to the console.
This should be able to summarize any wikipedia article you might find. Play around with the prompt templates to make it best fit your usecase.
That's it folks, thanks for following along for the tutorial. You are now ready to use llm-chain
for something useful. Don't forget to stop by discord and share what you are making.