Generate js and curl snippets using templates #1291

Wauplin · 2025-03-17T16:59:53Z

PR built on top of #1273.

This is supposed to be the last PR refactoring inference snippets 🙉
python.ts, curl.ts and js.ts have been merged into a single getInferenceSnippets.ts which handles snippet generations for all languages and all providers at once. Here is how to use it:

import { snippets } from "@huggingface/inference";

const generatedSnippets = snippets.getInferenceSnippets(model, "api_token", provider, providerModelId, opts);

it returns a list InferenceSnippet[] defined as

export interface InferenceSnippet {
    language: InferenceSnippetLanguage; // e.g. `python`, `curl`, `js`
    client: string; // e.g. `huggingface_hub`, `openai`, `fetch`, etc.
    content: string;
}

How to review?

It's hard to track all atomic changes made to the inference snippets but the best way IMO to review this PR is to check the generated snippets in the tests. Many inconsistencies in the URLs, sent parameters and indentation have been fixed.

What's next?

get Use makeRequestOptions to generate inference snippets #1273 approved
merge this one (Generate js and curl snippets using templates #1291) into Use makeRequestOptions to generate inference snippets #1273
merge Use makeRequestOptions to generate inference snippets #1273
open PR in moon-landing to adapt to new interface (basically use snippets.getInferenceSnippets instead of python.getPythonSnippets, etc)
open PR to fix hub-docs automatic generation
fully ready for Inference Providers documentation!

…l-snippets-using-templates

SBrandeis

I haven't reviewed all the snippets yet, but conceptually looks really good 🔥
I have left some comments

packages/inference/src/snippets/getInferenceSnippets.ts

packages/tasks-gen/scripts/generate-snippets-fixtures.ts

SBrandeis · 2025-03-18T10:21:24Z

...tasks-gen/snippets-fixtures/automatic-speech-recognition/js/huggingface.js/0.hf-inference.js

+
+const client = new InferenceClient("api_token");
+
+const data = fs.readFileSync("sample1.flac");


Maybe let's add an import statement for this

Side note: I think the fs API is only available in a NodeJS context 😅
The equivalent in the Browser is the File Reader API

I think it's fine if the snippets are only compatible with Node, for simplicity - thoughts @julien-c @coyotte508 ?

In the README.md we have these:

await hf.automaticSpeechRecognition({ model: 'facebook/wav2vec2-large-960h-lv60-self', data: readFileSync('test/sample1.flac') })

await hf.imageToImage({ inputs: new Blob([readFileSync("test/stormtrooper_depth.png")]), parameters: { prompt: "elmo's lecture", }, model: "lllyasviel/sd-controlnet-depth", });

await hf.zeroShotImageClassification({ model: 'openai/clip-vit-large-patch14-336', inputs: { image: await (await fetch('https://placekitten.com/300/300')).blob() }, parameters: { candidate_labels: ['cat', 'dog'] } })

Note that for now I've only reproduced what we already have in https://huggingface.co/openai/whisper-large-v3-turbo?inference_api=true&inference_provider=hf-inference&language=js. I've fine with changing this but prefer to do it in a follow-up PR.

My personal preference would be to align with the Python snippet:

from huggingface_hub import InferenceClient client = InferenceClient( provider="hf-inference", api_key="hf_xxxxxxxxxxxxxxxxxxxxxxxx", ) output = client.automatic_speech_recognition("sample1.flac", model="openai/whisper-large-v3-turbo")

opened #1294 as a follow-up issue

SBrandeis · 2025-03-18T10:23:02Z

...s/tasks-gen/snippets-fixtures/basic-snippet--token-classification/js/fetch/0.hf-inference.js

+	return result;
+}
+
+query({ inputs: "My name is Sarah Jessica Parker but you can call me Jessica" }).then((response) => {


😄 (not a change request)

Suggested change

query({ inputs: "My name is Sarah Jessica Parker but you can call me Jessica" }).then((response) => {

query({ inputs: "My name is Giovanni Giorgio, but everybody calls me Giorgio" }).then((response) => {

SBrandeis · 2025-03-18T10:25:57Z

...tasks-gen/snippets-fixtures/automatic-speech-recognition/js/huggingface.js/0.hf-inference.js

+
+const data = fs.readFileSync("sample1.flac");
+
+const output = await client.automaticSpeechRecognition({


More general remark: the types for stand-alone methods (automaticSpeechRecognition) have a correct typing, while methods on the InferenceClient class (client.automaticSpeechRecognition) do not

Until this is fixed, I would advocate to use the stand-alone functions for better user experience

opened #1294 as a follow-up issue

SBrandeis · 2025-03-18T10:28:25Z

packages/inference/src/snippets/templates/js/huggingface.js/basic.jinja

+import { InferenceClient } from "@huggingface/inference";
+
+const client = new InferenceClient("{{ accessToken }}");
+
+const output = await client.{{ methodName }}({
+	model: "{{ model.id }}",
+	inputs: {{ inputs.asObj.inputs }},
+	provider: "{{ provider }}",
+});
+
+console.log(output);


Following my remark about InferenceClient types - applies to all huggingface.js snippets I think

Suggested change

import { InferenceClient } from "@huggingface/inference";

const client = new InferenceClient("{{ accessToken }}");

const output = await client.{{ methodName }}({

model: "{{ model.id }}",

inputs: {{ inputs.asObj.inputs }},

provider: "{{ provider }}",

});

console.log(output);

import { {{methodName}} } from "@huggingface/inference";

const output = await {{ methodName }}({

model: "{{ model.id }}",

inputs: {{ inputs.asObj.inputs }},

provider: "{{ provider }}",

accessToken: "{{ accessToken }}"

});

console.log(output);

opened #1294 as a follow-up issue

SBrandeis · 2025-03-18T10:30:16Z

packages/inference/src/snippets/templates/js/fetch/textToImage.jinja

@@ -0,0 +1,21 @@
+{% if provider == "hf-inference" %}


What's the reason for not outputing a snippet when the provider is external?

hmm, don't remember the reason 🤔 I reused what was existing before (see https://huggingface.co/black-forest-labs/FLUX.1-schnell?inference_api=true&inference_provider=hf-inference&language=js vs https://huggingface.co/black-forest-labs/FLUX.1-schnell?inference_api=true&inference_provider=together&language=js).

Probably because inputs are not exactly the same depending on the provider. But now that we use makeRequestOptions it shouldn't be an issue anymore.

fixed in 18d87cd

…l-snippets-using-templates

Co-authored-by: Simon Brandeis <33657802+SBrandeis@users.noreply.github.com>

…b.com:huggingface/huggingface.js into generate-js-and-curl-snippets-using-templates

Co-authored-by: Simon Brandeis <33657802+SBrandeis@users.noreply.github.com>

Wauplin · 2025-03-18T16:44:27Z

Thanks for the review @SBrandeis and @coyotte508! Given how big this PR is, I'd rather not update the current snippets too much. They are mainly based on what existed in js.ts and curl.ts. I have created #1294 to keep track of what has been mentioned above. I'll now merge in #1273 to proceed with final cleaning.

Wauplin added 23 commits March 14, 2025 19:20

first structure, let's add some jinja files now

1b73a4c

add basic curl

709efb6

headers first

961d2cc

curl conversational

ba53cbd

add basicAudio basicImage

b06c6b2

basic js

a9f762e

hf.js conversational

1c185d0

hf.js conversational stream

f822275

openai conversational

807aa08

openai conversational stream

a788ad8

zero shot classification JS

7802a48

text to image JS client

1eee70d

text to image fetch hf-inference

b6c80f2

textToVideo js

60f4739

text to audio JS

1d9eb0c

basicAudio (ASR) JS

be84719

basic image JS

aa6da1e

Single getInferenceSnippets.ts module

733608f

piping

afbf59b

Merge branch 'main' into generate-js-and-curl-snippets-using-templates

41ddb9d

Merge branch 'fix-openai-inference-snippets' into generate-js-and-cur…

7b1c9ab

…l-snippets-using-templates

always run for all languages

a61e8fd

fix test

a780cb4

Wauplin requested review from SBrandeis, gary149, julien-c, pcuenca, ngxson, hanouticelina and coyotte508 as code owners March 17, 2025 16:59

SBrandeis reviewed Mar 18, 2025

View reviewed changes

Wauplin and others added 7 commits March 18, 2025 12:16

Merge branch 'fix-openai-inference-snippets' into generate-js-and-cur…

1eedeba

…l-snippets-using-templates

Update packages/inference/src/snippets/getInferenceSnippets.ts

44c0bfb

Co-authored-by: Simon Brandeis <33657802+SBrandeis@users.noreply.github.com>

sh_clients

e998bf0

Merge branch 'generate-js-and-curl-snippets-using-templates' of githu…

24c39e8

…b.com:huggingface/huggingface.js into generate-js-and-curl-snippets-using-templates

Update packages/inference/src/snippets/getInferenceSnippets.ts

bd15155

Co-authored-by: Simon Brandeis <33657802+SBrandeis@users.noreply.github.com>

Update packages/inference/src/snippets/getInferenceSnippets.ts

2e7c739

Co-authored-by: Simon Brandeis <33657802+SBrandeis@users.noreply.github.com>

fix

5a288ab

Wauplin mentioned this pull request Mar 18, 2025

[InferenceSnippet] Improve JS snippets (import fs, don't instanciate client) #1294

Open

add testToImage fetch js

18d87cd

Wauplin added 2 commits March 18, 2025 17:46

not const

d7ceb35

type fix

27496d8

Wauplin merged commit 186a347 into fix-openai-inference-snippets Mar 18, 2025
5 checks passed

Wauplin deleted the generate-js-and-curl-snippets-using-templates branch March 18, 2025 16:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate js and curl snippets using templates #1291

Generate js and curl snippets using templates #1291

Wauplin commented Mar 17, 2025 •

edited

Loading

SBrandeis left a comment

SBrandeis Mar 18, 2025

coyotte508 Mar 18, 2025

Wauplin Mar 18, 2025

Wauplin Mar 18, 2025

SBrandeis Mar 18, 2025

SBrandeis Mar 18, 2025

Wauplin Mar 18, 2025

SBrandeis Mar 18, 2025

Wauplin Mar 18, 2025

SBrandeis Mar 18, 2025

Wauplin Mar 18, 2025 •

edited

Loading

Wauplin Mar 18, 2025

Wauplin commented Mar 18, 2025


		const client = new InferenceClient("api_token");

		const data = fs.readFileSync("sample1.flac");

	query({ inputs: "My name is Sarah Jessica Parker but you can call me Jessica" }).then((response) => {
	query({ inputs: "My name is Giovanni Giorgio, but everybody calls me Giorgio" }).then((response) => {


		const data = fs.readFileSync("sample1.flac");

		const output = await client.automaticSpeechRecognition({

Generate js and curl snippets using templates #1291

Generate js and curl snippets using templates #1291

Conversation

Wauplin commented Mar 17, 2025 • edited Loading

How to review?

What's next?

SBrandeis left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Wauplin Mar 18, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Wauplin commented Mar 18, 2025

Wauplin commented Mar 17, 2025 •

edited

Loading

Wauplin Mar 18, 2025 •

edited

Loading