Enhance Your Images with the microphonist/xray_model Cognitive Actions

24 Apr 2025
Enhance Your Images with the microphonist/xray_model Cognitive Actions

In the world of image processing, the ability to enhance and modify images with precision and creativity is invaluable. The microphonist/xray_model offers a set of Cognitive Actions designed to facilitate image manipulation through advanced techniques, including inpainting. With these actions, developers can easily integrate sophisticated image generation capabilities into their applications, allowing for customizable output and improved user experiences.

Prerequisites

To start using the Cognitive Actions provided by the microphonist/xray_model, you will need the following:

  • An API key for the Cognitive Actions platform to authenticate your requests.
  • Basic familiarity with JSON structure and HTTP requests.
  • A Python environment with the requests library installed.

Authentication typically involves passing your API key in the request headers, allowing access to the various actions available.

Cognitive Actions Overview

Generate Image with Inpainting

Description:
This action enables the creation of enhanced images using mask-based inpainting. You can customize dimensions and optimize model settings for either speed or precision, and it also supports additional LoRA weight integration for more detailed outputs.

Category: image-processing

Input

The input for this action requires a JSON object with the following schema:

{
  "prompt": "string (required)",
  "mask": "string (optional, uri)",
  "seed": "integer (optional)",
  "image": "string (optional, uri)",
  "model": "string (optional, default: 'dev')",
  "width": "integer (optional, 256-1440)",
  "goFast": "boolean (optional, default: false)",
  "height": "integer (optional, 256-1440)",
  "loraScale": "number (optional, default: 1, range: -1 to 3)",
  "megapixels": "string (optional, default: '1')",
  "aspectRatio": "string (optional, default: '1:1')",
  "outputFormat": "string (optional, default: 'webp')",
  "guidanceScale": "number (optional, default: 3, range: 0 to 10)",
  "outputQuality": "integer (optional, default: 80, range: 0 to 100)",
  "promptStrength": "number (optional, default: 0.8, range: 0 to 1)",
  "numberOfOutputs": "integer (optional, default: 1, range: 1 to 4)",
  "additionalWeights": "string (optional)",
  "additionalLoraScale": "number (optional, default: 1, range: -1 to 3)",
  "disableSafetyChecker": "boolean (optional, default: false)",
  "numberOfInferenceSteps": "integer (optional, default: 28, range: 1 to 50)"
}

Example Input:

{
  "model": "dev",
  "goFast": false,
  "prompt": "XRYxray of a man smiling ",
  "loraScale": 1,
  "megapixels": "1",
  "aspectRatio": "1:1",
  "outputFormat": "webp",
  "guidanceScale": 5.46,
  "outputQuality": 80,
  "promptStrength": 0.8,
  "numberOfOutputs": 1,
  "additionalLoraScale": 1,
  "numberOfInferenceSteps": 40
}

Output

The action will return a JSON array containing the URIs of the generated images. For example:

[
  "https://assets.cognitiveactions.com/invocations/a2dc1752-81cc-45bf-ade9-0837dab2b9d1/9913896e-17b2-4836-8348-8795af64a130.webp"
]

Conceptual Usage Example (Python)

Here’s how you might call the Generate Image with Inpainting action using Python:

import requests
import json

# Replace with your Cognitive Actions API key and endpoint
COGNITIVE_ACTIONS_API_KEY = "YOUR_COGNITIVE_ACTIONS_API_KEY"
COGNITIVE_ACTIONS_EXECUTE_URL = "https://api.cognitiveactions.com/actions/execute" # Hypothetical endpoint

action_id = "eac66e51-7998-43ce-9edf-44aa6d00cd28" # Action ID for Generate Image with Inpainting

# Construct the input payload based on the action's requirements
payload = {
  "model": "dev",
  "goFast": False,
  "prompt": "XRYxray of a man smiling ",
  "loraScale": 1,
  "megapixels": "1",
  "aspectRatio": "1:1",
  "outputFormat": "webp",
  "guidanceScale": 5.46,
  "outputQuality": 80,
  "promptStrength": 0.8,
  "numberOfOutputs": 1,
  "additionalLoraScale": 1,
  "numberOfInferenceSteps": 40
}

headers = {
    "Authorization": f"Bearer {COGNITIVE_ACTIONS_API_KEY}",
    "Content-Type": "application/json"
}

try:
    response = requests.post(
        COGNITIVE_ACTIONS_EXECUTE_URL,
        headers=headers,
        json={"action_id": action_id, "inputs": payload}
    )
    response.raise_for_status()  # Raise an exception for bad status codes (4xx or 5xx)

    result = response.json()
    print("Action executed successfully:")
    print(json.dumps(result, indent=2))

except requests.exceptions.RequestException as e:
    print(f"Error executing action {action_id}: {e}")
    if e.response is not None:
        print(f"Response status: {e.response.status_code}")
        try:
            print(f"Response body: {e.response.json()}")
        except json.JSONDecodeError:
            print(f"Response body: {e.response.text}")

In this code snippet:

  • Replace YOUR_COGNITIVE_ACTIONS_API_KEY with your actual API key.
  • The action_id is set to the specific action you want to call.
  • The payload is structured according to the action's input schema, ensuring all required fields are included.

Conclusion

The microphonist/xray_model Cognitive Actions provide powerful tools for developers looking to enhance and manipulate images programmatically. By leveraging the Generate Image with Inpainting action, you can create visually stunning outputs tailored to your application's needs. Whether you're building creative applications, automated workflows, or enhancing user-generated content, these actions open up new possibilities for image processing.

Explore how you can integrate these capabilities into your projects today, and consider experimenting with additional parameters to achieve optimal results!