How to Build a Tool

This guide explains how to build tools with TextLayer Core. TextLayer Core uses the Vaul toolkit to create and manage tool calls for AI systems. Tools can be implemented either as standalone functions or with a service implementation class for more complex functionality.

Overview

In TextLayer Core, tools are functions decorated with @tool_call and @observe that can be called by AI systems. There are two main approaches to implementing tools:

Tools without a service: Simple functions that implement their functionality directly
Tools with a service implementation: Functions that use a separate service class with multiple methods

Let’s look at how to implement each type of tool.

Tools Without a Service

Tools without a service are implemented as standalone functions decorated with @tool_call and @observe. These are suitable for simple functionality that doesn’t require complex business logic or external service interactions.

Step 1: Import the Necessary Decorators

from langfuse.decorators import observe
from vaul import tool_call

The @observe decorator from Langfuse tracks the function’s execution for monitoring and tracing, while the @tool_call decorator from Vaul enables the function to be called by AI systems.

Step 2: Create a Function with Type Hints and Docstrings

Define your function with proper type hints and comprehensive docstrings:

@tool_call
@observe
def think(thought: str) -> str:
    """Use the tool to think about something. It will not obtain new information or change the
    database, but just append the thought to the log. Use it when complex reasoning or some cache
    memory is needed.
    Args:
        thought: A thought to think about.
    """
    return thought

This simple “think” tool gives the model room to think about its previous or upcoming action. It’s useful for AI systems to reason about their actions in order to self correct or self validate.

Step 3: Implement More Complex Functionality (Optional)

For tools that require more functionality but don’t need a separate service class, you can import libraries and implement the functionality directly in the tool function:

@tool_call
@observe
def get_current_date_time(
    format: Optional[str] = None,
    timezone: Optional[str] = None,
) -> Dict[str, Any]:
    """
    Get the current date and time in a specified format and timezone.
    
    Args:
        format: Optional. The format string to format the date/time (e.g., "%Y-%m-%d %H:%M:%S").
                If None, defaults to ISO 8601 format.
        timezone: Optional. A valid timezone string from the IANA time zone database
                  (e.g., "America/New_York", "Asia/Tokyo"). Defaults to UTC.
    
    Returns:
        dict: A dictionary containing current_date_time and epoch.
    """
    from datetime import datetime
    import pytz
    
    if timezone:
        tz = pytz.timezone(timezone)
    else:
        tz = pytz.utc
    
    now = datetime.now(tz)
    
    return {
        "current_date_time": now.strftime(format) if format else now.isoformat(),
        "epoch": int(now.timestamp())
    }

This datetime tool handles timezone conversions and formatting but still doesn’t require a separate service class.

Tools With a Service Implementation

For more complex functionality, it’s often better to create a separate service class that handles the business logic, and then create a tool that uses this service. This approach provides better separation of concerns and allows the service to be reused across multiple tools.

Step 1: Create a Service Class

First, create a service class with the necessary methods for your functionality:

import sqlite3
from typing import Any, Dict, Optional
import pandas as pd


class SQLiteDatastore:
    """A datastore implementation for SQLite."""

    def __init__(self, database: Optional[str] = None) -> None:
        """
        Initialize the SQLiteDatastore.

        Args:
            database (str, optional): Path to the SQLite database file.
                                     If None, an in-memory database is used.
        """
        if database is None:
            database = ":memory:"
        self.connection = sqlite3.connect(database)
        self.connection.row_factory = sqlite3.Row

    def execute(self, query: str, parameters: Optional[Dict[str, Any]] = None) -> pd.DataFrame:
        """
        Execute a SQL query and return the result as a DataFrame.

        Args:
            query (str): The SQL query to execute.
            parameters (Dict[str, Any], optional): Parameters to include in the query.

        Returns:
            pd.DataFrame: The query result.
        """
        cursor = self.connection.cursor()
        if parameters:
            cursor.execute(query, parameters)
        else:
            cursor.execute(query)
        rows = cursor.fetchall()
        columns = [description[0] for description in cursor.description]
        return pd.DataFrame(rows, columns=columns)

    def get_columns(self, table_name: str) -> pd.DataFrame:
        """Retrieve column information for a specific table."""
        query = f"PRAGMA table_info('{table_name}')"
        # Implementation details...
        return df

    def get_sample_data(self, table_name: str, limit: int = 5) -> pd.DataFrame:
        """Retrieve a sample of data from a specific table."""
        query = f"""
        SELECT *
        FROM {table_name}
        ORDER BY RANDOM()
        LIMIT {limit}
        """
        return self.execute(query)

Step 2: Create a Tool That Uses the Service

Next, create a tool function that initializes and uses the service class:

from langfuse.decorators import observe
from vaul import tool_call

from app import logger
from app.services.db.datastore import SQLiteDatastore


@tool_call
@observe
def text_to_sql(query: str) -> str:
    """Executes a SQL query for SQLite and returns the result as a markdown table.
    Args:
        query (str): The SQL query to execute on the SQLite database.
    Returns:
        str: The result of the SQL query execution, formatted as a markdown table.
    """

    logger.info(f"Converting natural language query to SQL query: {query}")

    # Initialize the SQLite datastore
    datastore = SQLiteDatastore(database="data/data.db")

    if not query:
        logger.error("No query provided")
        return ""

    # Execute the query
    result = datastore.execute(query)

    # Return the result
    return result.to_markdown(index=False, floatfmt=".2f") if result is not None else ""

This tool function initializes the SQLiteDatastore service, executes the query, and returns the result as a markdown table.

Best Practices

When building tools for TextLayer Core, follow these best practices:

1. Use Proper Type Hints

Always use proper type hints for function parameters and return values. This helps with code completion, error checking, and documentation generation.

def my_tool(param1: str, param2: Optional[int] = None) -> Dict[str, Any]:
    # Implementation

2. Write Comprehensive Docstrings

Include detailed docstrings that explain:

What the tool does
What parameters it accepts
What it returns
Any exceptions it might raise
Usage examples if helpful

3. Error Handling

Always include proper error handling in your tools:

@tool_call
@observe
def my_tool(param: str) -> str:
    try:
        # Implementation
        return result
    except Exception as e:
        logger.error(f"Error in my_tool: {e}")
        return f"Error: {str(e)}"

4. Service Reusability

Design your service classes to be reusable across multiple tools. This promotes code reuse and maintainability.

5. Keep Tools Focused

Each tool should have a single responsibility. If a tool is becoming too complex, consider splitting it into multiple tools or moving complexity to the service layer.

Why Docstrings and Type Hints Matter for LLMs

When building tools for TextLayer Core, detailed docstrings and proper type hints are not just good programming practices—they are essential for LLM integration. Here’s why:

OpenAPI Schema Generation

The @tool_call decorator from Vaul automatically generates OpenAPI schemas from your function signatures and docstrings. These schemas are then passed to LLMs like GPT-4o to enable them to understand and use your tools.

# Under the hood, Vaul transforms your function into an OpenAPI schema like this:
{
    "name": "text_to_sql",
    "description": "Executes a SQL query for SQLite and returns the result as a markdown table.",
    "parameters": {
        "type": "object",
        "properties": {
            "query": {
                "type": "string",
                "description": "The SQL query to execute on the SQLite database."
            }
        },
        "required": ["query"]
    }
}

LLM Understanding

LLMs use these schemas to:

Understand tool capabilities: The function name and docstring help the LLM understand what the tool does
Determine when to use the tool: Detailed descriptions help the LLM decide when a tool is appropriate
Format inputs correctly: Type hints ensure the LLM provides correctly formatted arguments
Interpret outputs: Return type annotations help the LLM understand and process the tool’s response

Implementation Details

Vaul handles this schema generation through its Toolkit class, which:

Extracts function signature information using Python’s type annotations
Generates OpenAPI schemas from the function’s docstring and type hints
Validates inputs using Pydantic to ensure data integrity
Provides methods for interacting with AI responses

Without proper docstrings and type hints, LLMs may misunderstand your tools, use them incorrectly, or fail to use them at all.

Registering Tools

To make your tools available to AI systems, you need to register them with the Vaul toolkit:

from vaul import Toolkit
from app.services.llm.tools.db.text_to_sql import text_to_sql
from app.services.llm.tools.prompting.think import think
from app.services.llm.tools.datetime.get_current_date_time import get_current_date_time

# Create a toolkit
toolkit = Toolkit(name="TextLayer Tools")

# Add individual tools
toolkit.add_tool(text_to_sql)
toolkit.add_tool(think)
toolkit.add_tool(get_current_date_time)

# Or add multiple tools at once
toolkit.add_tools(text_to_sql, think, get_current_date_time)

# Get schemas for all tools in the toolkit (for use with LLM providers)
tool_schemas = toolkit.tool_schemas()

The toolkit provides methods for adding tools, generating schemas, and running tools by name. This centralized management makes it easier to maintain and extend your tool collection.

Overview

Get Started

Core Concepts

Guides

Security and Compliance

Troubleshooting

Overview

Tools Without a Service

Step 1: Import the Necessary Decorators

Step 2: Create a Function with Type Hints and Docstrings

Step 3: Implement More Complex Functionality (Optional)

Tools With a Service Implementation

Step 1: Create a Service Class

Step 2: Create a Tool That Uses the Service

Best Practices

1. Use Proper Type Hints

2. Write Comprehensive Docstrings

3. Error Handling

4. Service Reusability

5. Keep Tools Focused

Why Docstrings and Type Hints Matter for LLMs

OpenAPI Schema Generation

LLM Understanding

Implementation Details

Registering Tools

Overview

Get Started

Core Concepts

Guides

Security and Compliance

Troubleshooting

​Overview

​Tools Without a Service

​Step 1: Import the Necessary Decorators

​Step 2: Create a Function with Type Hints and Docstrings

​Step 3: Implement More Complex Functionality (Optional)

​Tools With a Service Implementation

​Step 1: Create a Service Class

​Step 2: Create a Tool That Uses the Service

​Best Practices

​1. Use Proper Type Hints

​2. Write Comprehensive Docstrings

​3. Error Handling

​4. Service Reusability

​5. Keep Tools Focused

​Why Docstrings and Type Hints Matter for LLMs

​OpenAPI Schema Generation

​LLM Understanding

​Implementation Details

​Registering Tools

Overview

Tools Without a Service

Step 1: Import the Necessary Decorators

Step 2: Create a Function with Type Hints and Docstrings

Step 3: Implement More Complex Functionality (Optional)

Tools With a Service Implementation

Step 1: Create a Service Class

Step 2: Create a Tool That Uses the Service

Best Practices

1. Use Proper Type Hints

2. Write Comprehensive Docstrings

3. Error Handling

4. Service Reusability

5. Keep Tools Focused

Why Docstrings and Type Hints Matter for LLMs

OpenAPI Schema Generation

LLM Understanding

Implementation Details

Registering Tools