Skip to content

Commit

Permalink
edits to jg12 page
Browse files Browse the repository at this point in the history
  • Loading branch information
janand-octo authored Sep 5, 2024
1 parent bdd6155 commit fd9a132
Showing 1 changed file with 13 additions and 14 deletions.
27 changes: 13 additions & 14 deletions fern/docs/media-gen-solution/rest-apis/generate/juggernautXII.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -5,12 +5,12 @@ slug: media-gen-solution/rest-apis/generate/juggernautXI

Generate an image using a Juggernaut XII model (SDXL based checkpoint).

OctoAI's Juggernaut XII API supports both `Full` and `Lighting` checkpoints for text-to-image, image-to-image use cases, and works with custom assests like LoRAs, VAES, and textual inversions.
OctoAI's Juggernaut XII API supports `Full` checkpoints for text-to-image, image-to-image use cases, and works with custom assests like LoRAs, VAES, and textual inversions.
It also support ControlNets. For more details, refer [SDXL-ControlNets](https://octo.ai/docs/media-gen-solution/rest-apis/generate/images/controlnet-sdxl).

Recommended values to get optimal results from Juggernaut XII models are:
- `steps` - Between 20 to 30 for Juggernaut XII Full checkpoint and 4 to 6 for Juggernaut XII Lighting checkpoint.
- `cfg_scale` - Between 3 to 6 for Juggernaut XII Full checkpoint and 1 to 2 for Juggernaut XII Lighting checkpoint.
- `steps` - Between 20 to 30 for Juggernaut XII Full checkpoint
- `cfg_scale` - Between 3 to 6 for Juggernaut XII Full checkpoint
- `sampler` - `DPM_PLUS_PLUS_2M_KARRAS`, though others will work as well. Regular samplers include `DDIM`,`DDPM`,`DPM_PLUS_PLUS_2M_KARRAS`,`DPM_SINGLE`,`DPM_SOLVER_MULTISTEP`,`K_EULER`, `K_EULER_ANCESTRAL`,`PNDM`,`UNI_PC`.
Premium samplers (2x price) include `DPM_2`, `DPM_2_ANCESTRAL`,`DPM_PLUS_PLUS_SDE_KARRAS`, `HEUN` and `KLMS`.

Expand All @@ -26,34 +26,34 @@ The headers of the request must include an Authentication Token in the authoriza
**Generating with a prompt**: Commonly referred to as **text-to-image**, this mode generates an image from text alone. The required parameters are:

- `prompt` - text to generate the image from
- `checkpoint` - Here you can specify the Juggernaur XI checkpoints (full or lighting) from the OctoAI asset library. Supported values `octoai:rundiffusion-juggernaut-xi` , `octoai:rundiffusion-juggernaut-xi-lightning`
- `checkpoint` - Here you can specify the Juggernaur XII checkpoint from the OctoAI asset library. Supported values `octoai:rundiffusion-juggernaut-xii`

**Generating with a prompt and an image**: Commonly referred to as **image-to-image**, this mode also generates an image from text but uses an existing image as the starting point. The required parameters are:

- `prompt` - text to generate the image from
- `checkpoint` - Here you can specify the Juggernaur XI checkpoints (full or lighting) from the OctoAI asset library. Supported values `octoai:rundiffusion-juggernaut-xi` , `octoai:rundiffusion-juggernaut-xi-lightning`
- `checkpoint` - Here you can specify the Juggernaur XII checkpoint from the OctoAI asset library. Supported values `octoai:rundiffusion-juggernaut-xii`
- `init_image` - the image to use as the starting point for the generation. Argument takes an image encoded as a string in base64 format.
- `strength` - controls how much influence the image parameter has on the output image

**Generating with a prompt and a custom asset**: This mode generates an image from text but uses either a custom checkpoint, LoRA, textual inversion, or VAE. Note that using a custom asset increases generation time.

- `prompt` - text to generate the image from
- `checkpoint` - Here you can specify the Juggernaur XI checkpoints (full or lighting) from the OctoAI asset library. Supported values `octoai:rundiffusion-juggernaut-xi` , `octoai:rundiffusion-juggernaut-xi-lightning`
- `checkpoint` - Here you can specify the Juggernaur XII checkpoint from the OctoAI asset library. Supported values `octoai:rundiffusion-juggernaut-xii`
- `loras` - Here you can specify LoRAs, in name-weight pairs, either from the OctoAI asset library or your private asset library.
- `textual_inversions` - Here you can specify textual inversions and their corresponding trigger words.
- `vae` - Here you can specify variational autoencoders.

**Inpainting with a prompt**: Inpainting replaces or edits specific areas of an image. This makes it a useful tool for image restoration like removing defects and artifacts, or even replacing an image area with something entirely new. Inpainting relies on a mask to determine which regions of an image to fill in. The required parameters are:

- `prompt` - text to generate the image from
- `checkpoint` - Here you can specify the Juggernaur XI checkpoints (full or lighting) from the OctoAI asset library. Supported values `octoai:rundiffusion-juggernaut-xi` , `octoai:rundiffusion-juggernaut-xi-lightning`
- `checkpoint` - Here you can specify the Juggernaur XII checkpoint from the OctoAI asset library. Supported values `octoai:rundiffusion-juggernaut-xii`
- `init_image` - the image to use as the starting point for the generation. Argument takes an image encoded as a string in base64 format.
- `mask_image` - area of the picture to inpaint. Argument takes an image encoded as a string in base64 format.

**Outpainting with a prompt**: Outpainting is the process of using an image generation model like Stable Diffusion to extend beyond the canvas of an existing image. Outpainting is very similar to inpainting, but instead of generating a region within an existing image, the model generates a region outside of it. The required parameters are:

- `prompt` - text to generate the image from
- `checkpoint` - Here you can specify the Juggernaur XI checkpoints (full or lighting) from the OctoAI asset library. Supported values `octoai:rundiffusion-juggernaut-xi` , `octoai:rundiffusion-juggernaut-xi-lightning`
- `checkpoint` - Here you can specify the Juggernaur XII checkpoint from the OctoAI asset library. Supported values `octoai:rundiffusion-juggernaut-xii`
- `init_image` - the existing image you’d like to outpaint. You need to create a source image that places your original image within a larger canvas. Argument takes an image encoded as a string in base64 format.
- `mask_image` - a black and white mask representing the extended area. Argument takes an image encoded as a string in base64 format.
- `outpainting` - Argument takes a boolean value to determine Whether the request requires outpainting or not. If so, special preprocessing is applied for better results. Defaults to `false`. This needs to be set to `true`, if you wish to use outpainting.
Expand All @@ -71,7 +71,6 @@ The resolution of the generated image will be 1 megapixel. The default resolutio
### **Pricing**

- Juggernaut XII (Full): ***$0.0104*** per image, 1024x1024, 30 steps; billed per image
- Juggernaut XII (Lighting): ***$0.0065*** per image, 1024x1024, 30 steps; billed per image
- Fine tuning job SDXL: ***$0.25*** per tune, using the 500 step default
- Inpainting: Same price as corresponding Juggernaut XII image
- Outpainting: Same price as corresponding Juggernaut XII image
Expand All @@ -91,11 +90,11 @@ Check [Pricing Page](https://octo.ai/docs/getting-started/pricing-and-billing) f
- `prompt` (string [ upto 77 tokens], Required): A string of text describing the image to generate. You can use prompt weighting, e.g. `(A tall (beautiful:1.5) woman:1.0) (some other prompt with weight:0.8)` . The weight will be the product of all brackets a token is a member of. The brackets, colons and weights do not count towards the number of tokens.
- `negative_prompt` (string, Optional): Text describing image traits to avoid during generation.
- `sampler` (string, Optional): A string specifying which scheduler to use when generating an image. Defaults to `DDIM`. Regular samplers include `DDIM`,`DDPM`,`DPM_PLUS_PLUS_2M_KARRAS`,`DPM_SINGLE`,`DPM_SOLVER_MULTISTEP`,`K_EULER`, `K_EULER_ANCESTRAL`,`PNDM`,`UNI_PC`. Premium samplers (2x price) include `DPM_2`, `DPM_2_ANCESTRAL`,`DPM_PLUS_PLUS_SDE_KARRAS`, `HEUN` and `KLMS`.
- `cfg_scale` (double, Optional): Floating-point number represeting how closely to adhere to prompt description. Must be a positive number no greater than 50.0. Recommended value is between 3 to 6 for Juggernaut XII Full checkpoint and 1 to 2 for Juggernaut XII Lighting checkpoint.
- `cfg_scale` (double, Optional): Floating-point number represeting how closely to adhere to prompt description. Must be a positive number no greater than 50.0. Recommended value is between 3 to 6 for Juggernaut XII Full checkpoint.
- `image_encoding` (enum, Optional): Define which encoding process should be applied before returning the generated image(s). Allowed values: `jpeg` `png`
- `num_images` (integer, Optional): Integer representing how many output images to generate with a single prompt/configuration. Defaults to 1. Allowed values: 1-16.
- `seed` (union, Optional): Integer number or list of integers representing the seeds of random generators. Fixing random seed is useful when attempting to generate a specific image. Must be greater than 0 and less than 2^32.
- `steps` (integer, Optional Defaults to 30): Integer representing how many steps of diffusion to run. Must be greater than 0 and less than or equal to 200. Recommended value is between 20 to 30 for Juggernaut XII Full checkpoint and 4 to 6 for Juggernaut XII Lighting checkpoint.
- `steps` (integer, Optional Defaults to 30): Integer representing how many steps of diffusion to run. Must be greater than 0 and less than or equal to 200. Recommended value is between 20 to 30 for Juggernaut XII Full checkpoint.
- `init_image` (string, Optional): The image (encoded in b64 string) to use as the starting point for the generation. This parameter is for Image-to-Image generation and Inpainting.
<Note> Use .jpg format to ensure best latency </Note>
- `strength` (double,Optional): Floating-point number indicating how much creative the Image to Image generation mode should be. Must be greater than 0 and less than or equal to 1.0. Defaults to 0.8. This parameter is for Image-to-Image generation.
Expand Down Expand Up @@ -136,7 +135,7 @@ curl -X POST "https://image.octoai.run/generate/sdxl" \
--data-raw '{
"prompt": "A still frame from a commercial of a DeLorean parked in bustling times square, rainy night shot with droplets of water falling",
"negative_prompt": "Blurry, low-res, poor quality",
"checkpoint": "octoai:rundiffusion-juggernaut-xi",
"checkpoint": "octoai:rundiffusion-juggernaut-xii",
"width": 1024,
"height": 1024,
"num_images": 1,
Expand All @@ -163,7 +162,7 @@ def _process_test(url):
payload = {
"prompt": "A still frame from a commercial of a DeLorean parked in bustling times square, rainy night shot with droplets of water falling",
"negative_prompt": "Blurry, low-res, poor quality",
"checkpoint": "octoai:rundiffusion-juggernaut-xi",
"checkpoint": "octoai:rundiffusion-juggernaut-xii",
"width": 1024,
"height": 1024,
"num_images": 1,
Expand Down Expand Up @@ -208,7 +207,7 @@ const octoai = new OctoAIClient({
const { images } = await octoai.imageGen.generateSdxl({
prompt": "A still frame from a commercial of a DeLorean parked in bustling times square, rainy night shot with droplets of water falling",
negative_prompt": "Blurry, low-res, poor quality",
checkpoint: "octoai:rundiffusion-juggernaut-xi",
checkpoint: "octoai:rundiffusion-juggernaut-xii",
width: 1024,
height: 1024,
num_images: 1,
Expand Down

0 comments on commit fd9a132

Please sign in to comment.