Enhance Your Images with FOFR SDXL Fine-Tuning Cognitive Actions

24 Apr 2025
Enhance Your Images with FOFR SDXL Fine-Tuning Cognitive Actions

In the realm of image generation and modification, the fofr/sdxl-text API provides powerful Cognitive Actions that enable developers to fine-tune images based on text prompts. The standout capability here is the Execute SDXL Fine-Tuning action, which allows for intricate image processing through customizable parameters. This action is particularly adept at handling the word "FOFR" within a variety of image contexts, leveraging features like inpainting and img2img modes. By utilizing these pre-built actions, developers can save time and effort while enhancing their applications with advanced image processing functionalities.

Prerequisites

To get started with the Cognitive Actions provided by the fofr/sdxl-text API, developers will need:

  • An API key for the Cognitive Actions platform.
  • Basic knowledge of sending HTTP requests.
  • Familiarity with JSON format, as the input and output will be structured as JSON objects.

Authentication typically involves passing the API key in the request headers, ensuring secure access to the Cognitive Actions.

Cognitive Actions Overview

Execute SDXL Fine-Tuning

The Execute SDXL Fine-Tuning action allows users to perform exploratory fine-tuning on text prompts with a focus on the word "FOFR". This action supports both image processing modes: img2img and inpainting. Developers can customize various parameters, including image dimensions, guidance scales, and even the application of watermarks.

Input

The input for this action is structured as follows:

{
  "mask": "http://example.com/mask.png",
  "seed": 12345,
  "image": "http://example.com/input_image.png",
  "width": 1024,
  "height": 1024,
  "prompt": "The word FOFR, 3d text in a forest",
  "refineStyle": "no_refiner",
  "loraIntensity": 0.6,
  "opposingPrompt": "Something unwanted",
  "enableWatermark": true,
  "promptIntensity": 0.8,
  "refinementSteps": 10,
  "outputImageCount": 1,
  "guidanceIntensity": 7.5,
  "highNoiseFraction": 0.8,
  "selectionScheduler": "K_EULER",
  "inferenceStepsCount": 50
}
  • Required Fields: prompt, image, width, height
  • Optional Fields: mask, seed, refineStyle, loraIntensity, opposingPrompt, enableWatermark, promptIntensity, refinementSteps, outputImageCount, guidanceIntensity, highNoiseFraction, selectionScheduler, inferenceStepsCount

Example Input

Here’s a practical example of the input JSON payload needed to invoke this action:

{
  "width": 1024,
  "height": 1024,
  "prompt": "The word FOFR, 3d text in a forest",
  "refineStyle": "no_refiner",
  "loraIntensity": 0.6,
  "enableWatermark": true,
  "promptIntensity": 0.8,
  "outputImageCount": 1,
  "guidanceIntensity": 7.5,
  "highNoiseFraction": 0.8,
  "selectionScheduler": "K_EULER",
  "inferenceStepsCount": 50
}

Output

The response from the action typically returns an array of image URLs:

[
  "https://assets.cognitiveactions.com/invocations/3722d9ce-e83d-4c59-b876-182f33040d7c/9bdba6ce-6f9e-4772-856c-62d82177f9b1.png"
]

This output provides the generated images based on the specifications provided in the input.

Conceptual Usage Example (Python)

Here’s how you might call this action using Python:

import requests
import json

# Replace with your Cognitive Actions API key and endpoint
COGNITIVE_ACTIONS_API_KEY = "YOUR_COGNITIVE_ACTIONS_API_KEY"
COGNITIVE_ACTIONS_EXECUTE_URL = "https://api.cognitiveactions.com/actions/execute"  # Hypothetical endpoint

action_id = "91f3adb4-1f3c-4e25-bd4e-d2cc8d98b559"  # Action ID for Execute SDXL Fine-Tuning

# Construct the input payload based on the action's requirements
payload = {
    "width": 1024,
    "height": 1024,
    "prompt": "The word FOFR, 3d text in a forest",
    "refineStyle": "no_refiner",
    "loraIntensity": 0.6,
    "enableWatermark": True,
    "promptIntensity": 0.8,
    "outputImageCount": 1,
    "guidanceIntensity": 7.5,
    "highNoiseFraction": 0.8,
    "selectionScheduler": "K_EULER",
    "inferenceStepsCount": 50
}

headers = {
    "Authorization": f"Bearer {COGNITIVE_ACTIONS_API_KEY}",
    "Content-Type": "application/json"
}

try:
    response = requests.post(
        COGNITIVE_ACTIONS_EXECUTE_URL,
        headers=headers,
        json={"action_id": action_id, "inputs": payload}  # Hypothetical structure
    )
    response.raise_for_status()  # Raise an exception for bad status codes (4xx or 5xx)

    result = response.json()
    print("Action executed successfully:")
    print(json.dumps(result, indent=2))

except requests.exceptions.RequestException as e:
    print(f"Error executing action {action_id}: {e}")
    if e.response is not None:
        print(f"Response status: {e.response.status_code}")
        try:
            print(f"Response body: {e.response.json()}")
        except json.JSONDecodeError:
            print(f"Response body: {e.response.text}")

In this example, the action ID and input payload are clearly defined, and the code demonstrates how to send a request to the Cognitive Actions endpoint. Note that the endpoint URL and request structure are illustrative and may vary based on actual implementation.

Conclusion

The Execute SDXL Fine-Tuning action offers developers a robust way to enhance images by leveraging customizable text prompts and various image processing features. By integrating this action into your applications, you can provide users with dynamic image generation capabilities tailored to their needs. Explore further use cases such as creating unique art, modifying existing images, or automating design tasks to fully harness the power of FOFR SDXL Fine-Tuning!