View on Github Try in Colab Download notebook

Transferring Labels to a Twin Project¶

This example demonstrates how to transfer checklist labels from "Project A" and convert them into yes/no radio labels in "Project B."

Requirements¶

This notebook guides you through the Workflow template and Ontology required.

For this notebook, you need:

Two Encord Projects with Ontologies and Workflows shown in this example.
Both Projects must be linked to the same Datasets.

Ontologies¶

📖 Here is the documentation for creating Ontologies.

Ontology in Project A:
The Ontology in Project A contains checklist classifications.

No description has been provided for this image — **Figure 1:** Source project ontology (Project A).

Ontology in Project B:
Each completed task in Project A is converted into a "model-friendly version" in Project B, where radio classifications are used. Project B includes three classifications with the same names as in Project A, but each offers two radio options.

Example Workflows¶

The following are examples of Workflows to be used. Create and save a Workflow template for each of the following Workflows.

📖 Here is the documentation for creating a Workflow with Encord.

Project A Workflow:

Project B Workflow:

With this configuration, all annotation work happens in Project A, while Project B mirrors the transformed labels.

Installation¶

Ensure that you have the encord-agents library installed:

In [ ]:

Copied!

!python -m pip install encord-agents
!python -m pip install encord-agents

Encord Authentication¶

Encord uses ssh-keys for authentication. The following is a code cell for setting the ENCORD_SSH_KEY environment variable. It contains the raw content of your private ssh key file.

If you have not setup an ssh key, see our documentation.

💡 In colab, you can set the key once in the secrets in the left sidebar and load it in new notebooks. IF YOU ARE NOT RUNNING THE CODE IN THE COLLAB NOTEBOOK, you must set the environment variable directly.
os.environ["ENCORD_SSH_KEY"] = """paste-private-key-here"""

In [ ]:

Copied!

import os

os.environ["ENCORD_SSH_KEY"] = "private_key_file_content"
# or you can set a path to a file
# os.environ["ENCORD_SSH_KEY_FILE"] = "/path/to/your/private/key"
import os

os.environ["ENCORD_SSH_KEY"] = "private_key_file_content"
# or you can set a path to a file
# os.environ["ENCORD_SSH_KEY_FILE"] = "/path/to/your/private/key"

[Alternative] Temporary Key¶

There's also the option of generating a temporary (fresh) ssh key pair via the code cell below. Please follow the instructions printed when executing the code.

In [ ]:

Copied!

# ⚠️ Safe to skip if you have authenticated already
import os

from encord_agents.utils.colab import generate_public_private_key_pair_with_instructions

private_key_path, public_key_path = generate_public_private_key_pair_with_instructions()
os.environ["ENCORD_SSH_KEY_FILE"] = private_key_path.as_posix()
# ⚠️ Safe to skip if you have authenticated already
import os

from encord_agents.utils.colab import generate_public_private_key_pair_with_instructions

private_key_path, public_key_path = generate_public_private_key_pair_with_instructions()
os.environ["ENCORD_SSH_KEY_FILE"] = private_key_path.as_posix()

Define the Agent¶

An agent can perform this translation using the dep_twin_label_row dependency. For every label row from Project A, the agent automatically fetches the corresponding label row (and optionally the Workflow task) from Project B.

In the following code cell, we define the custom code for the translation.

Ensure that you add::

<project_hash_a>: The Project hash for Project A
<project_hash_b>: The Project hash for Project B
<transfer_agent_stage_uuid>: The task agent node uuid in Project A.
<labeling_completion_pathway_uuid>: The uuid (or name) of the pathway in Project A that leads to the complete state.
<twin_completion_pathway_uuid>: The uuid (or name) of the pathway in Project B that leads to the complete state.

In [ ]:

Copied!





from encord.objects.ontology_labels_impl import LabelRowV2
from encord.objects.options import Option
from encord.workflow.stages.agent import AgentTask
from typing_extensions import Annotated

from encord_agents.tasks import Depends, Runner
from encord_agents.tasks.dependencies import Twin, dep_twin_label_row

# 1. Setup the runner
runner = Runner(project_hash="<project_hash_a>")

# 2. Get the classification attribute used to query answers
checklist_classification = runner.project.ontology_structure.classifications[0]  # type: ignore
checklist_attribute = checklist_classification.attributes[0]


# 3. Define the agent
@runner.stage(stage="<transfer_agent_stage_uuid>")
def copy_labels(
    manually_annotated_lr: LabelRowV2,
    twin: Annotated[Twin, Depends(dep_twin_label_row(twin_project_hash="<project_hash_b>"))],
) -> str | None:
    # 4. Reading the checkboxes that have been set
    instance = manually_annotated_lr.get_classification_instances()[0]
    answers = instance.get_answer(attribute=checklist_attribute)
    if answers is None or isinstance(answers, (str, Option)):
        return None

    set_options = {o.title for o in answers}  # Use title to match

    # 5. Set answer on the sink labels
    for radio_clf in twin.label_row.ontology_structure.classifications:
        ins = radio_clf.create_instance()

        attr = radio_clf.attributes[0]
        if radio_clf.title in set_options:
            ins.set_answer(attr.options[0])
        else:
            ins.set_answer(attr.options[1])

        ins.set_for_frames(frames=0)
        twin.label_row.add_classification_instance(ins)

    # 6. Save labels and proceed tasks
    twin.label_row.save()
    if twin.task and isinstance(twin.task, AgentTask):
        twin.task.proceed(pathway_uuid="<twin_completion_pathway_uuid>")

    return "<labeling_completion_pathway_uuid>"
from encord.objects.ontology_labels_impl import LabelRowV2
from encord.objects.options import Option
from encord.workflow.stages.agent import AgentTask
from typing_extensions import Annotated

from encord_agents.tasks import Depends, Runner
from encord_agents.tasks.dependencies import Twin, dep_twin_label_row

# 1. Setup the runner
runner = Runner(project_hash="")

# 2. Get the classification attribute used to query answers
checklist_classification = runner.project.ontology_structure.classifications[0]  # type: ignore
checklist_attribute = checklist_classification.attributes[0]


# 3. Define the agent
@runner.stage(stage="")
def copy_labels(
    manually_annotated_lr: LabelRowV2,
    twin: Annotated[Twin, Depends(dep_twin_label_row(twin_project_hash=""))],
) -> str | None:
    # 4. Reading the checkboxes that have been set
    instance = manually_annotated_lr.get_classification_instances()[0]
    answers = instance.get_answer(attribute=checklist_attribute)
    if answers is None or isinstance(answers, (str, Option)):
        return None

    set_options = {o.title for o in answers}  # Use title to match

    # 5. Set answer on the sink labels
    for radio_clf in twin.label_row.ontology_structure.classifications:
        ins = radio_clf.create_instance()

        attr = radio_clf.attributes[0]
        if radio_clf.title in set_options:
            ins.set_answer(attr.options[0])
        else:
            ins.set_answer(attr.options[1])

        ins.set_for_frames(frames=0)
        twin.label_row.add_classification_instance(ins)

    # 6. Save labels and proceed tasks
    twin.label_row.save()
    if twin.task and isinstance(twin.task, AgentTask):
        twin.task.proceed(pathway_uuid="")

    return ""

The code does six things:

Instantiates a runner that executes the agent code against every task in the agent stage of the Project.
Read the necessary information to do the label translation from the Ontology of Project A.
Links the implementation to the correct stage in Project A + define a twin_label_dependency to the twin Project in order to be able to write the converted labels to the other Project.
Reads the manual annotations.
Converts and writes the labels to Project B.
Proceeds the two "sibling tasks" from Project A and B to the complete state.

Running the Agent¶

The runner object is callable which means that you can just call it to prioritize your tasks.

In [ ]:

Copied!

# Run the agent
runner()
# Run the agent
runner()

Outcome¶

When the agent runs, tasks approved in Project A’s review stage move to the "Complete" stage in Project B, with the labels automatically converted and displayed.

💡 To run this as a command-line interface, save the code in an agents.py file and replace:
runner()
with:
if __name__ == "__main__":
    runner.run()
This lets you set parameters like the project hash from the command line:
python agent.py --project-hash "..."

View on Github Try in Colab Download notebook