Case study - III

Cloudhands

"Can he structure complexity?"

From Models to Outcomes: Designing a Unified AI Experience

Leading design for a 0→1 multi-modal AI platform that unified image, video, audio, text, and code generation into a single guided workflow.

Role

Head of Design

Team

Product, Engineering, PM

Timeline

2025

Focus

AI UX, multi-modal workflows, 0→1

Outcome

Fragmented tools → unified platform in beta

01 - Context

Cloudhands is a multi-modal AI platform designed to simplify how users interact with AI tools.

Instead of navigating separate products for text, images, video, and code, the platform brings everything into a single, unified experience, enabling users to focus on outcomes rather than underlying models.

02 - The Problem

AI capabilities are powerful but fragmented.

Users often need:

  • Multiple tools

  • Multiple subscriptions

  • Different interaction patterns

To accomplish even simple tasks, users must understand which model to use and how to use it—creating unnecessary complexity.

03 - Key Insight

Users don’t care about models—they care about outcomes.

This insight reframed the product from a collection of tools into a cohesive system designed around user intent.

04 - Exploration

Early design exploration focused on how to unify diverse AI capabilities into a single, understandable experience.

Key questions included:

  • Should the experience be tool-based or intent-based?

  • How much should users control vs. automate?

  • Can different modalities share a common interaction model?

05 - Core System

The platform evolved into a multi-layered system that balances flexibility and consistency:

Multi-Modal Generation

  • Chat

  • Image

  • Speech

  • Audio

  • Code

All generation types were designed to feel cohesive while supporting their unique outputs.

Assistant Layer

A conversational interface that acts as a central entry point for interacting with different capabilities.

Recipes System

v1: Pre-filled prompts for quick access

v2: Structured forms for guided generation

This allowed users to move from exploration to repeatable workflows.

Model Comparison (Arena)

A feature enabling users to compare outputs across different models, helping them understand tradeoffs without requiring deep technical knowledge.

06 - concept Exploration

One of the biggest challenges was designing across multiple modalities without fragmenting the experience.

We explored:

  • Dedicated interfaces per tool vs. shared layouts

  • Prompt-first vs. guided workflows

  • Flexible vs. structured input systems

This led to a hybrid approach that balances consistency for usability with flexibility for power users.

Expanding Beyond Tools

Future directions explored extending the platform into domain-specific applications:

  • Health & wellness

  • Personal finance

  • Career development

  • Relationships & communication

  • Home & lifestyle

  • Education

This shifts the platform from a tool aggregator to an AI-powered life operating system.

Multi-Step Intelligence

Another direction explored multi-step prompts—where outputs from one action feed into the next—enabling more complex workflows and automation.

07 - Tradeoffs

Innovation vs. Speed

To move quickly, we prioritized consistency across experiences rather than building fully specialized interfaces for each modality.

  • Benefit: Faster development and easier scalability

  • Tradeoff: Less tailored experiences for specific use cases

Feature Expansion vs. Deep Insight

We focused heavily on expanding capabilities to differentiate the product.

  • Benefit: Broader feature set and competitive positioning

  • Tradeoff: Less time spent deeply analyzing user behavior and optimizing core flows

08 - Outcome

Cloudhands reached beta as a unified AI platform capable of supporting multiple modalities within a single system.

The product successfully demonstrated how a cohesive experience can reduce friction and make AI tools more accessible, even in a rapidly evolving space.

09 - Whats Next

Future iterations focus on increasing intelligence and automation:

  • More personalized AI workflows

  • Smarter orchestration across tools

  • Autonomous multi-step task execution

The long-term vision is to evolve from a unified interface into a proactive, intelligent system that anticipates user needs.

10 - What i'd do differently

Reflecting on the process, there are several areas I would approach differently to strengthen both product direction and outcomes:

Lead with data earlier

Begin with deeper user conversations and behavioral insights to ground decisions in real user needs rather than assumptions.

Define clearer testing expectations

Establish upfront timelines and success criteria for experiments to ensure we gather meaningful data before making directional decisions.

Narrow the MVP scope

Focus on a smaller set of high-impact features to validate core value more quickly, rather than expanding feature breadth early in the product lifecycle.

— LET'S TALK —

Have a problem

Have a problem

worth exploring?

worth exploring?

worth exploring?

RitzieWilliams@gmail.com

RitzieWilliams@gmail.com

© MMXXVI · Ritzie Williams