The most surprising thing about cross-language prompts is that the model often performs better when you ask it to translate and then answer, rather than asking it to answer directly in the target language.

Let’s see this in action. Imagine we want to get a summary of a news article, but we want it in French, and the article is in English.

Here’s a direct prompt:

Summarize the following article in French:

[Insert English Article Text Here]

Now, consider a two-step prompt:

Translate the following article to French, and then provide a summary of the translated text:

[Insert English Article Text Here]

You might think the direct prompt is more efficient. However, many large language models (LLMs) have been trained on vast amounts of parallel corpora (text and its translation). Their translation capabilities are often exceptionally robust. By explicitly asking for translation first, you leverage this strength. The model first generates a high-quality French version of the article, and then summarizes that French text. This process can lead to a more nuanced and accurate summary in the target language because the model isn’t trying to perform both translation and summarization simultaneously, potentially losing fidelity in one or both tasks.

The mental model here is about task decomposition. LLMs are powerful, but complex, multi-step reasoning can sometimes lead to errors. By breaking down a complex request into simpler, well-defined sub-tasks that the model excels at, you can improve the overall outcome. In this case, the sub-tasks are translation and summarization.

Here’s how the internal process might look. When you give the two-step prompt:

  1. Translation Phase: The model identifies the source language (English) and the target language (French). It accesses its internal representations and knowledge gained from training on parallel text to produce a French rendition of the input. This is a highly optimized path for the model.
  2. Summarization Phase: The model then treats the generated French text as its new input. It applies its summarization capabilities to this French text, producing a concise summary in French.

The levers you control are:

  • Explicit Instruction: Clearly stating "Translate to X, then summarize" guides the model’s execution path.
  • Input Language: The source language of the original text.
  • Target Language: The desired output language.
  • Model Choice: Different models have varying strengths in translation and summarization. Some might handle direct cross-lingual summarization better than others.
  • Prompt Formatting: Using clear delimiters and instructions, as shown in the examples, helps the model parse the request.

The key is that the model’s ability to translate accurately is often superior to its ability to perform a complex cross-lingual inference task directly. By separating these, you’re essentially using two of its best skills in sequence.

One thing that often surprises developers is how much the order of operations within a prompt can matter, even if the final desired output appears to be the same. The model doesn’t just "understand" the end goal; it executes a sequence of internal operations. When you provide a prompt like "Summarize this English text in French," the model might attempt a single, complex inference path. However, when you say "Translate this English text to French, then summarize the French text," you are implicitly directing it down a path that involves two distinct, and often more robust, internal processes. This is akin to a human expert breaking down a complex problem into simpler steps to ensure accuracy.

The next concept you’ll likely explore is how to handle cultural nuance and idiomatic expressions effectively in cross-language prompts, especially when the model’s training data might not fully capture them.

Want structured learning?

Take the full Prompt-engineering course →