Tool-calling schema compliance check (repro steps + observed output)

by lesj0610 - opened Jan 11

•

Hello, I’m trying to verify whether this model supports schema-based tool calling (strict tool name + arguments matching a predefined schema), rather than generating API-usage explanations.

Repro steps (minimal):

Provide the model a single tool schema (OpenAI-style function calling):

{
  "name": "get_current_weather",
  "description": "Get the current weather for a given city",
  "parameters": {
    "type": "object",
    "properties": {
      "location": { "type": "string", "description": "City name with country code (e.g., Seoul,KR)" },
      "unit": { "type": "string", "enum": ["metric", "imperial"] }
    },
    "required": ["location"]
  }
}

Ask the model in natural Korean (no mention of tools/APIs/functions):

지금 서울의 현재 기온이 몇 도인가요?

Expected output (example of a correct schema-compliant tool call):

{
  "name": "get_current_weather",
  "arguments": { "location": "Seoul,KR", "unit": "metric" }
}

Observed output (raw, unchanged):

날씨 정보에 접근이 가능하다고 가정할 때, 서울의 현재 기온을 조회하기 위한 JSON 형식의 도구 호출 양식은 다음과 같습니다. 이는 일반적인 날씨 API 요청 형식(예: OpenWeatherMap API 기반)을 참고한 것입니다.

{
  "method": "GET",
  "endpoint": "https://api.openweathermap.org/data/2.5/weather",
  "query_params": {
    "q": "Seoul,KR",
    "appid": "YOUR_API_KEY",
    "units": "metric",
    "lang": "kr"
  }
}

설명:
method: HTTP 요청 메서드 (GET)
endpoint: 날씨 데이터를 제공하는 API 엔드포인트
query_params:
q: 도시 및 국가 코드 (Seoul,KR)
appid: 발급받은 OpenWeatherMap API 키 (실제 사용 시 본인의 키로 교체 필요)
units: 섭씨 온도 사용을 위해 metric 지정
lang: 응답 언어를 한국어로 설정하려면 lang=kr (한국어 지원 여부에 따라 생략 가능)

⚠️ 주의: 실제 실행을 위해서는 유효한 appid가 필요하며, 위 JSON은 요청 구조를 설명하는 도구 호출 양식입니다. 저는 직접 API를 호출할 수 없으므로, 이 형식을 기반으로 외부 시스템이나 백엔드에서 실행해야 합니다.

응답 예시 (성공 시 일부):
{
  "main": {
    "temp": 18.5,
    "feels_like": 17.8,
    "humidity": 65
  },
  "weather": [
    {
      "description": "흩어진 구름",
      "icon": "03d"
    }
  ],
  "name": "Seoul"
}

이 구조를 바탕으로 실제 날씨 API에서 서울의 현재 기온(예: 18.5°C)을 가져올 수 있습니다.

필요하시면 Python 코드 예시도 제공해 드릴 수 있습니다.

From this result, the model appears to generate an API request example rather than emitting a schema-compliant tool call using the provided tool name and argument schema.

Could you clarify:

Was schema-level tool calling included in training for this model?
If yes, is there a required serving configuration or prompt format to enable strict tool-call emission?

Thank you.

nuxlear

LG AI Research org Jan 13

Hello, @lesj0610 . Thank you for your attention.

We checked your case but could not reproduce the same failure.

Here is the snippet we tested:

tools = [{
    "name": "get_current_weather",
    "description": "Get the current weather for a given city",
    "parameters": {
        "type": "object",
            "properties": {
                "location": { "type": "string", "description": "City name with country code (e.g., Seoul,KR)" },
                "unit": { "type": "string", "enum": ["metric", "imperial"] }
            },
        "required": ["location"]
    }
}]

messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "지금 서울의 현재 기온이 몇 도인가요?"},
]

inputs = tokenizer.apply_chat_template(
    messages,
    return_tensors="pt",
    add_generation_prompt=True,
    tools=tools,
    chat_template_kwargs={"enable_thinking": False},
)

output = model.generate(
    **inputs,
    do_sample=True,
    temperature=1.0,
    top_p=0.95,
    max_new_tokens=256,
)

Please note that the tools schema should be passed to the HF tokenizer, not directly to model.generate(), when using a tool-augmented chat template.
(For inference server, please check the documentation of using OpenAI-compatible tool calling. )

With this setup, we get the following tool call:

<tool_call>{"name": "get_current_weather", "arguments": {"location": "Seoul,KR", "unit": "metric"}}</tool_call>

It seems the tool schema might not be passed to model in proper way.
If you can share the generation part of your code, we can take a closer look.

lesj0610

Jan 13

•

edited Jan 13

Thanks for the clarification and the working snippet.

One key detail: my testing was done only via the FriendliAI web UI (I cannot run the 236B model locally). In that UI, I do not have any way to verify whether the tools schema is actually injected into the model inputs (i.e., whether the server uses a tool-augmented chat template like tokenizer.apply_chat_template(..., tools=tools, ...)).

So the failure I observed (API-style JSON explanation instead of a tool call) may be due to the FriendliAI web UI pipeline not passing tools/schema, rather than the base model lacking tool-call training.

Could you confirm whether the FriendliAI web UI currently supports tool calling with schema injection?
If yes, could you share the minimal steps/settings in the UI to enable it, or provide an OpenAI-compatible endpoint example request that triggers tool calls?

If the web UI does not inject tools/schema, then my observation is expected from the deployment, and I will update my report accordingly.

Thank you.

lesj0610

Jan 13

Thanks for the clarification. I was testing via the FriendliAI web UI, where I couldn’t verify whether the tools schema was actually injected. After switching to the FriendliAI OpenAI-compatible API and explicitly passing tools (and tool_choice), the model correctly returned finish_reason: "tool_calls" with a valid get_current_weather tool call. I’ll update my report accordingly.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment