Rules of Workflows
A Workflow contains one or more steps. Each step is a self-contained, individually retriable component of a Workflow. Steps may emit (optional) state that allows a Workflow to persist and continue from that step, even if a Workflow fails due to a network or infrastructure issue.
This is a small guidebook on how to build more resilient and correct Workflows.
Because a step might be retried multiple times, your steps should (ideally) be idempotent. For context, idempotency is a logical property where the operation (in this case a step), can be applied multiple times without changing the result beyond the initial application.
As an example, let us assume you have a Workflow that charges your customers, and you really do not want to charge them twice by accident. Before charging them, you should check if they were already charged:
export class MyWorkflow extends WorkflowEntrypoint {  async run(event, step) {    const customer_id = 123456;    // โ
 Good: Non-idempotent API/Binding calls are always done **after** checking if the operation is    // still needed.    await step.do(      `charge ${customer_id} for its monthly subscription`,      async () => {        // API call to check if customer was already charged        const subscription = await fetch(          `https://payment.processor/subscriptions/${customer_id}`,        ).then((res) => res.json());
        // return early if the customer was already charged, this can happen if the destination service dies        // in the middle of the request but still commits it, or if the Workflows Engine restarts.        if (subscription.charged) {          return;        }
        // non-idempotent call, this operation can fail and retry but still commit in the payment        // processor - which means that, on retry, it would mischarge the customer again if the above checks        // were not in place.        return await fetch(          `https://payment.processor/subscriptions/${customer_id}`,          {            method: "POST",            body: JSON.stringify({ amount: 10.0 }),          },        );      },    );  }}export class MyWorkflow extends WorkflowEntrypoint {  async run(event: WorkflowEvent<Params>, step: WorkflowStep) {    const customer_id = 123456;    // โ
 Good: Non-idempotent API/Binding calls are always done **after** checking if the operation is    // still needed.    await step.do(      `charge ${customer_id} for its monthly subscription`,      async () => {        // API call to check if customer was already charged        const subscription = await fetch(          `https://payment.processor/subscriptions/${customer_id}`,        ).then((res) => res.json());
        // return early if the customer was already charged, this can happen if the destination service dies        // in the middle of the request but still commits it, or if the Workflows Engine restarts.        if (subscription.charged) {          return;        }
        // non-idempotent call, this operation can fail and retry but still commit in the payment        // processor - which means that, on retry, it would mischarge the customer again if the above checks        // were not in place.        return await fetch(          `https://payment.processor/subscriptions/${customer_id}`,          {            method: "POST",            body: JSON.stringify({ amount: 10.0 }),          },        );      },    );  }}Steps should be as self-contained as possible. This allows your own logic to be more durable in case of failures in third-party APIs, network errors, and so on.
You can also think of it as a transaction, or a unit of work.
- โ Minimize the number of API/binding calls per step (unless you need multiple calls to prove idempotency).
export class MyWorkflow extends WorkflowEntrypoint {  async run(event, step) {    // โ
 Good: Unrelated API/Binding calls are self-contained, so that in case one of them fails    // it can retry them individually. It also has an extra advantage: you can control retry or    // timeout policies for each granular step - you might not to want to overload http.cat in    // case of it being down.    const httpCat = await step.do("get cutest cat from KV", async () => {      return await env.KV.get("cutest-http-cat");    });
    const image = await step.do("fetch cat image from http.cat", async () => {      return await fetch(`https://http.cat/${httpCat}`);    });  }}export class MyWorkflow extends WorkflowEntrypoint {  async run(event: WorkflowEvent<Params>, step: WorkflowStep) {    // โ
 Good: Unrelated API/Binding calls are self-contained, so that in case one of them fails    // it can retry them individually. It also has an extra advantage: you can control retry or    // timeout policies for each granular step - you might not to want to overload http.cat in    // case of it being down.    const httpCat = await step.do("get cutest cat from KV", async () => {      return await env.KV.get("cutest-http-cat");    });
    const image = await step.do("fetch cat image from http.cat", async () => {      return await fetch(`https://http.cat/${httpCat}`);    });  }}Otherwise, your entire Workflow might not be as durable as you might think, and you may encounter some undefined behaviour. You can avoid them by following the rules below:
- ๐ด Do not encapsulate your entire logic in one single step.
- ๐ด Do not call separate services in the same step (unless you need it to prove idempotency).
- ๐ด Do not make too many service calls in the same step (unless you need it to prove idempotency).
- ๐ด Do not do too much CPU-intensive work inside a single step - sometimes the engine may have to restart, and it will start over from the beginning of that step.
export class MyWorkflow extends WorkflowEntrypoint {  async run(event, step) {    // ๐ด Bad: you are calling two separate services from within the same step. This might cause    // some extra calls to the first service in case the second one fails, and in some cases, makes    // the step non-idempotent altogether    const image = await step.do("get cutest cat from KV", async () => {      const httpCat = await env.KV.get("cutest-http-cat");      return fetch(`https://http.cat/${httpCat}`);    });  }}export class MyWorkflow extends WorkflowEntrypoint {  async run(event: WorkflowEvent<Params>, step: WorkflowStep) {    // ๐ด Bad: you are calling two separate services from within the same step. This might cause    // some extra calls to the first service in case the second one fails, and in some cases, makes    // the step non-idempotent altogether    const image = await step.do("get cutest cat from KV", async () => {      const httpCat = await env.KV.get("cutest-http-cat");      return fetch(`https://http.cat/${httpCat}`);    });  }}Workflows may hibernate and lose all in-memory state. This will happen when engine detects that there is no pending work and can hibernate until it needs to wake-up (because of a sleep, retry, or event).
This means that you should not store state outside of a step:
function getRandomInt(min, max) {  const minCeiled = Math.ceil(min);  const maxFloored = Math.floor(max);  return Math.floor(Math.random() * (maxFloored - minCeiled) + minCeiled); // The maximum is exclusive and the minimum is inclusive}
export class MyWorkflow extends WorkflowEntrypoint {  async run(event, step) {    // ๐ด Bad: `imageList` will be not persisted across engine's lifetimes. Which means that after hibernation,    // `imageList` will be empty again, even though the following two steps have already ran.    const imageList = [];
    await step.do("get first cutest cat from KV", async () => {      const httpCat = await env.KV.get("cutest-http-cat-1");
      imageList.append(httpCat);    });
    await step.do("get second cutest cat from KV", async () => {      const httpCat = await env.KV.get("cutest-http-cat-2");
      imageList.append(httpCat);    });
    // A long sleep can (and probably will) hibernate the engine which means that the first engine lifetime ends here    await step.sleep("๐ค๐ค๐ค๐ค", "3 hours");
    // When this runs, it will be on the second engine lifetime - which means `imageList` will be empty.    await step.do(      "choose a random cat from the list and download it",      async () => {        const randomCat = imageList.at(getRandomInt(0, imageList.length));        // this will fail since `randomCat` is undefined because `imageList` is empty        return await fetch(`https://http.cat/${randomCat}`);      },    );  }}function getRandomInt(min, max) {  const minCeiled = Math.ceil(min);  const maxFloored = Math.floor(max);  return Math.floor(Math.random() * (maxFloored - minCeiled) + minCeiled); // The maximum is exclusive and the minimum is inclusive}
export class MyWorkflow extends WorkflowEntrypoint {  async run(event: WorkflowEvent<Params>, step: WorkflowStep) {    // ๐ด Bad: `imageList` will be not persisted across engine's lifetimes. Which means that after hibernation,    // `imageList` will be empty again, even though the following two steps have already ran.    const imageList: string[] = [];
    await step.do("get first cutest cat from KV", async () => {      const httpCat = await env.KV.get("cutest-http-cat-1");
      imageList.append(httpCat);    });
    await step.do("get second cutest cat from KV", async () => {      const httpCat = await env.KV.get("cutest-http-cat-2");
      imageList.append(httpCat);    });
    // A long sleep can (and probably will) hibernate the engine which means that the first engine lifetime ends here    await step.sleep("๐ค๐ค๐ค๐ค", "3 hours");
    // When this runs, it will be on the second engine lifetime - which means `imageList` will be empty.    await step.do(      "choose a random cat from the list and download it",      async () => {        const randomCat = imageList.at(getRandomInt(0, imageList.length));        // this will fail since `randomCat` is undefined because `imageList` is empty        return await fetch(`https://http.cat/${randomCat}`);      },    );  }}Instead, you should build top-level state exclusively comprised of step.do returns:
function getRandomInt(min, max) {  const minCeiled = Math.ceil(min);  const maxFloored = Math.floor(max);  return Math.floor(Math.random() * (maxFloored - minCeiled) + minCeiled); // The maximum is exclusive and the minimum is inclusive}
export class MyWorkflow extends WorkflowEntrypoint {  async run(event, step) {    // โ
 Good: imageList state is exclusively comprised of step returns - this means that in the event of    // multiple engine lifetimes, imageList will be built accordingly    const imageList = await Promise.all([      step.do("get first cutest cat from KV", async () => {        return await env.KV.get("cutest-http-cat-1");      }),
      step.do("get second cutest cat from KV", async () => {        return await env.KV.get("cutest-http-cat-2");      }),    ]);
    // A long sleep can (and probably will) hibernate the engine which means that the first engine lifetime ends here    await step.sleep("๐ค๐ค๐ค๐ค", "3 hours");
    // When this runs, it will be on the second engine lifetime - but this time, imageList will contain    // the two most cutest cats    await step.do(      "choose a random cat from the list and download it",      async () => {        const randomCat = imageList.at(getRandomInt(0, imageList.length));        // this will eventually succeed since `randomCat` is defined        return await fetch(`https://http.cat/${randomCat}`);      },    );  }}function getRandomInt(min, max) {  const minCeiled = Math.ceil(min);  const maxFloored = Math.floor(max);  return Math.floor(Math.random() * (maxFloored - minCeiled) + minCeiled); // The maximum is exclusive and the minimum is inclusive}
export class MyWorkflow extends WorkflowEntrypoint {  async run(event: WorkflowEvent<Params>, step: WorkflowStep) {    // โ
 Good: imageList state is exclusively comprised of step returns - this means that in the event of    // multiple engine lifetimes, imageList will be built accordingly    const imageList: string[] = await Promise.all([      step.do("get first cutest cat from KV", async () => {        return await env.KV.get("cutest-http-cat-1");      }),
      step.do("get second cutest cat from KV", async () => {        return await env.KV.get("cutest-http-cat-2");      }),    ]);
    // A long sleep can (and probably will) hibernate the engine which means that the first engine lifetime ends here    await step.sleep("๐ค๐ค๐ค๐ค", "3 hours");
    // When this runs, it will be on the second engine lifetime - but this time, imageList will contain    // the two most cutest cats    await step.do(      "choose a random cat from the list and download it",      async () => {        const randomCat = imageList.at(getRandomInt(0, imageList.length));        // this will eventually succeed since `randomCat` is defined        return await fetch(`https://http.cat/${randomCat}`);      },    );  }}The event passed to your Workflow's run method is immutable: changes you make to the event are not persisted across steps and/or Workflow restarts.
export class MyWorkflow extends WorkflowEntrypoint {  async run(event, step) {    // ๐ด Bad: Mutating the event    // This will not be persisted across steps and `event.payload` will    // take on its original value.    await step.do("bad step that mutates the incoming event", async () => {      let userData = await env.KV.get(event.payload.user);      event.payload = userData;    });
    // โ
 Good: persist data by returning it as state from your step    // Use that state in subsequent steps    let userData = await step.do("good step that returns state", async () => {      return await env.KV.get(event.payload.user);    });
    let someOtherData = await step.do(      "following step that uses that state",      async () => {        // Access to userData here        // Will always be the same if this step is retried      },    );  }}interface MyEvent {  user: string;  data: string;}
export class MyWorkflow extends WorkflowEntrypoint {  async run(event: WorkflowEvent<MyEvent>, step: WorkflowStep) {    // ๐ด Bad: Mutating the event    // This will not be persisted across steps and `event.payload` will    // take on its original value.    await step.do("bad step that mutates the incoming event", async () => {        let userData = await env.KV.get(event.payload.user)        event.payload = userData    })
    // โ
 Good: persist data by returning it as state from your step    // Use that state in subsequent steps    let userData = await step.do("good step that returns state", async () => {      return await env.KV.get(event.payload.user)    })
    let someOtherData = await step.do("following step that uses that state", async () => {      // Access to userData here      // Will always be the same if this step is retried    })  }}Steps should be named deterministically (that is, not using the current date/time, randomness, etc). This ensures that their state is cached, and prevents the step from being rerun unnecessarily. Step names act as the "cache key" in your Workflow.
export class MyWorkflow extends WorkflowEntrypoint {  async run(event, step) {    // ๐ด Bad: Naming the step non-deterministically prevents it from being cached    // This will cause the step to be re-run if subsequent steps fail.    await step.do(`step #1 running at: ${Date.now()}`, async () => {      let userData = await env.KV.get(event.payload.user);      // Do not mutate event.payload      event.payload = userData;    });
    // โ
 Good: give steps a deterministic name.    // Return dynamic values in your state, or log them instead.    let state = await step.do("fetch user data from KV", async () => {      let userData = await env.KV.get(event.payload.user);      console.log(`fetched at ${Date.now}`);      return userData;    });
    // โ
 Good: steps that are dynamically named are constructed in a deterministic way.    // In this case, `catList` is a step output, which is stable, and `catList` is    // traversed in a deterministic fashion (no shuffles or random accesses) so,    // it's fine to dynamically name steps (e.g: create a step per list entry).    let catList = await step.do("get cat list from KV", async () => {      return await env.KV.get("cat-list");    });
    for (const cat of catList) {      await step.do(`get cat: ${cat}`, async () => {        return await env.KV.get(cat);      });    }  }}export class MyWorkflow extends WorkflowEntrypoint {  async run(event: WorkflowEvent<Params>, step: WorkflowStep) {    // ๐ด Bad: Naming the step non-deterministically prevents it from being cached    // This will cause the step to be re-run if subsequent steps fail.    await step.do(`step #1 running at: ${Date.now()}`, async () => {        let userData = await env.KV.get(event.payload.user)        // Do not mutate event.payload        event.payload = userData    })
    // โ
 Good: give steps a deterministic name.    // Return dynamic values in your state, or log them instead.    let state = await step.do("fetch user data from KV", async () => {        let userData = await env.KV.get(event.payload.user)        console.log(`fetched at ${Date.now}`)        return userData    })
    // โ
 Good: steps that are dynamically named are constructed in a deterministic way.    // In this case, `catList` is a step output, which is stable, and `catList` is    // traversed in a deterministic fashion (no shuffles or random accesses) so,    // it's fine to dynamically name steps (e.g: create a step per list entry).    let catList = await step.do("get cat list from KV", async () => {      return await env.KV.get("cat-list")    })
    for(const cat of catList) {      await step.do(`get cat: ${cat}`, async () => {        return await env.KV.get(cat)      })    }  }}Workflows allows the usage steps within the Promise.race() or Promise.any() methods as a way to achieve concurrent steps execution. However, some considerations must be taken.
Due to the nature of Workflows' instance lifecycle, and given that a step inside a Promise will run until it finishes, the step that is returned during the first passage may not be the actual cached step, as steps are cached by their names.
// helper sleep methodconst sleep = (ms) => new Promise((r) => setTimeout(r, ms));
export class MyWorkflow extends WorkflowEntrypoint {  async run(event, step) {    // ๐ด Bad: The `Promise.race` is not surrounded by a `step.do`, which may cause undeterministic caching behavior.    const race_return = await Promise.race([      step.do("Promise first race", async () => {        await sleep(1000);        return "first";      }),      step.do("Promise second race", async () => {        return "second";      }),    ]);
    await step.sleep("Sleep step", "2 hours");
    return await step.do("Another step", async () => {      // This step will return `first`, even though the `Promise.race` first returned `second`.      return race_return;    });  }}// helper sleep methodconst sleep = (ms: number) => new Promise((r) => setTimeout(r, ms));
export class MyWorkflow extends WorkflowEntrypoint {  async run(event: WorkflowEvent<Params>, step: WorkflowStep) {    // ๐ด Bad: The `Promise.race` is not surrounded by a `step.do`, which may cause undeterministic caching behavior.    const race_return = await Promise.race(      [        step.do(          'Promise first race',          async () => {            await sleep(1000);            return "first";          }        ),        step.do(          'Promise second race',          async () => {            return "second";          }        ),      ]    );
    await step.sleep("Sleep step", "2 hours");
    return await step.do(      'Another step',      async () => {        // This step will return `first`, even though the `Promise.race` first returned `second`.        return race_return;      },    );  }}To ensure consistency, we suggest to surround the Promise.race() or Promise.any() within a step.do(), as this will ensure caching consistency across multiple passages.
// helper sleep methodconst sleep = (ms) => new Promise((r) => setTimeout(r, ms));
export class MyWorkflow extends WorkflowEntrypoint {  async run(event, step) {    // โ
 Good: The `Promise.race` is surrounded by a `step.do`, ensuring deterministic caching behavior.    const race_return = await step.do("Promise step", async () => {      return await Promise.race([        step.do("Promise first race", async () => {          await sleep(1000);          return "first";        }),        step.do("Promise second race", async () => {          return "second";        }),      ]);    });
    await step.sleep("Sleep step", "2 hours");
    return await step.do("Another step", async () => {      // This step will return `second` because the `Promise.race` was surround by the `step.do` method.      return race_return;    });  }}// helper sleep methodconst sleep = (ms: number) => new Promise((r) => setTimeout(r, ms));
export class MyWorkflow extends WorkflowEntrypoint {  async run(event: WorkflowEvent<Params>, step: WorkflowStep) {    // โ
 Good: The `Promise.race` is surrounded by a `step.do`, ensuring deterministic caching behavior.    const race_return = await step.do(      'Promise step',      async () => {        return await Promise.race(          [            step.do(              'Promise first race',              async () => {                await sleep(1000);                return "first";              }            ),            step.do(              'Promise second race',              async () => {                return "second";              }            ),          ]        );      }    );
    await step.sleep("Sleep step", "2 hours");
    return await step.do(      'Another step',      async () => {        // This step will return `second` because the `Promise.race` was surround by the `step.do` method.        return race_return;      },    );  }}Workflow instance IDs are unique per Workflow. The ID is the unique identifier that associates logs, metrics, state and status of a run to a specific instance, even after completion. Allowing ID re-use would make it hard to understand if a Workflow instance ID referred to an instance that run yesterday, last week or today.
It would also present a problem if you wanted to run multiple different Workflow instances with different input parameters for the same user ID, as you would immediately need to determine a new ID mapping.
If you need to associate multiple instances with a specific user, merchant or other "customer" ID in your system, consider using a composite ID or using randomly generated IDs and storing the mapping in a database like D1.
// This is in the same file as your Workflow definitionexport default {  async fetch(req, env) {    // ๐ด Bad: Use an ID that isn't unique across future Workflow invocations    let userId = getUserId(req); // Returns the userId    let badInstance = await env.MY_WORKFLOW.create({      id: userId,      params: payload,    });
    // โ
 Good: use an ID that is unique    // e.g. a transaction ID, order ID, or task ID are good options    let instanceId = getTransactionId(); // e.g. assuming transaction IDs are unique    // or: compose a composite ID and store it in your database    // so that you can track all instances associated with a specific user or merchant.    instanceId = `${getUserId(req)}-${await crypto.randomUUID().slice(0, 6)}`;    let { result } = await addNewInstanceToDB(userId, instanceId);    let goodInstance = await env.MY_WORKFLOW.create({      id: instanceId,      params: payload,    });
    return Response.json({      id: goodInstance.id,      details: await goodInstance.status(),    });  },};// This is in the same file as your Workflow definitionexport default {  async fetch(req: Request, env: Env): Promise<Response> {    // ๐ด Bad: Use an ID that isn't unique across future Workflow invocations    let userId = getUserId(req) // Returns the userId    let badInstance = await env.MY_WORKFLOW.create({      id: userId,      params: payload    });
    // โ
 Good: use an ID that is unique    // e.g. a transaction ID, order ID, or task ID are good options    let instanceId = getTransactionId() // e.g. assuming transaction IDs are unique    // or: compose a composite ID and store it in your database    // so that you can track all instances associated with a specific user or merchant.    instanceId = `${getUserId(req)}-${await crypto.randomUUID().slice(0, 6)}`    let { result } = await addNewInstanceToDB(userId, instanceId)    let goodInstance = await env.MY_WORKFLOW.create({      id: instanceId,      params: payload    });
    return Response.json({      id: goodInstance.id,      details: await goodInstance.status(),    });  },};When calling step.do or step.sleep, use await to avoid introducing bugs and race conditions into your Workflow code.
If you don't call await step.do or await step.sleep, you create a dangling Promise. This occurs when a Promise is created but not properly awaited, leading to potential bugs and race conditions.
This happens when you do not use the await keyword or fail to chain .then() methods to handle the result of a Promise. For example, calling fetch(GITHUB_URL) without awaiting its response will cause subsequent code to execute immediately, regardless of whether the fetch completed. This can cause issues like premature logging, exceptions being swallowed (and not terminating the Workflow), and lost return values (state).
export class MyWorkflow extends WorkflowEntrypoint {  async run(event, step) {    // ๐ด Bad: The step isn't await'ed, and any state or errors is swallowed before it returns.    const issues = step.do(`fetch issues from GitHub`, async () => {      // The step will return before this call is done      let issues = await getIssues(event.payload.repoName);      return issues;    });
    // โ
 Good: The step is correctly await'ed.    const issues = await step.do(`fetch issues from GitHub`, async () => {      let issues = await getIssues(event.payload.repoName);      return issues;    });
    // Rest of your Workflow goes here!  }}export class MyWorkflow extends WorkflowEntrypoint {  async run(event: WorkflowEvent<Params>, step: WorkflowStep) {    // ๐ด Bad: The step isn't await'ed, and any state or errors is swallowed before it returns.    const issues = step.do(`fetch issues from GitHub`, async () => {        // The step will return before this call is done        let issues = await getIssues(event.payload.repoName)        return issues    })
    // โ
 Good: The step is correctly await'ed.    const issues = await step.do(`fetch issues from GitHub`, async () => {        let issues = await getIssues(event.payload.repoName)        return issues    })
    // Rest of your Workflow goes here!  }}When creating multiple Workflow instances, use the createBatch method to batch the invocations together. This allows you to create multiple Workflow instances in a single request, which will reduce the number of requests made to the Workflows API and increase the number of instances you can create per minute.
export default {  async fetch(req, env) {    let instances = [      { id: "user1", params: { name: "John" } },      { id: "user2", params: { name: "Jane" } },      { id: "user3", params: { name: "Alice" } },      { id: "user4", params: { name: "Bob" } },    ];
    // ๐ด Bad: Create them one by one, which is more likely to hit creation rate limits.    for (let instance of instances) {      await env.MY_WORKFLOW.create({        id: instance.id,        params: instance.params,      });    }
    // โ
 Good: Batch calls together    // This improves throughput.    let instances = await env.MY_WORKFLOW.createBatch(instances);    return Response.json({ instances });  },};export default {  async fetch(req: Request, env: Env): Promise<Response> {    let instances = [{id: "user1", params: {name: "John"}}, {id: "user2", params: {name: "Jane"}}, {id: "user3", params: {name: "Alice"}}, {id: "user4", params: {name: "Bob"}}];
    // ๐ด Bad: Create them one by one, which is more likely to hit creation rate limits.    for (let instance of instances) {      await env.MY_WORKFLOW.create({        id: instance.id,        params: instance.params      });    }
    // โ
 Good: Batch calls together    // This improves throughput.    let instances = await env.MY_WORKFLOW.createBatch(instances);    return Response.json({ instances })  },};Was this helpful?
- Resources
- API
- New to Cloudflare?
- Directory
- Sponsorships
- Open Source
- Support
- Help Center
- System Status
- Compliance
- GDPR
- Company
- cloudflare.com
- Our team
- Careers
- ยฉ 2025 Cloudflare, Inc.
- Privacy Policy
- Terms of Use
- Report Security Issues
- Trademark
-