Manus: A Reality Check for Coding and Research

There’s been a lot of buzz around Manus lately, positioning it as a go-to AI solution for complex tasks like coding and deep research. Naturally, I was curious. Does it live up to the hype, or is it another case of sizzle over steak? I decided to put it through its paces, comparing it with some other tools I often use. Let’s see how it fared.

Round 1: The Coding Gauntlet

First up, coding. I decided to give Manus a reasonably complex, real-world-ish task using a zero-shot approach – no hand-holding, just the prompt. I wanted to see how close it could get straight out of the box.

The Prompt:

I asked Manus to tackle a multi-part development task:

Build the Slack MCP Server: Implement a FastMCP-based server for Slack events/commands, integrate MCPInspector, and ensure secure channels.
Orchestrate AI Agents: Scaffold an agent framework with PydanticAI, use Litellm for LLM calls (async, batched, fault-tolerant), and set up messaging protocols.
Implement Memory Management: Deploy Redis for short-term cache, Pinecone/Weaviate for long-term vector storage, and build APIs for memory operations.

The Result:

Well, it certainly took its time. After chewing through the prompt, Manus reported using 594 “resources.” Based on their pricing ($39 for 3900 resources, https://manus.im/), that single attempt cost roughly $5.94.

And the deliverable? Let’s just say it wasn’t quite ready for production. Or development. Or staging. The solution simply didn’t work. It seemed to stumble over some package names and apparently didn’t think to quickly check the internet to resolve ambiguities.

Looking at the output, my gut feeling is that the underlying model might not be playing in the same league as heavyweights like Claude Sonnet 3.7, Gemini 2.5 Pro, or GPT 4.1. For coding tasks like this, I’d probably stick with something like WindSurf paired with a strong model like Claude Sonnet 3.7 – my experience suggests that combo is not only more effective but likely cheaper too.

Round 2: The Deep Research Dive

Okay, maybe coding isn’t Manus’s forte. The chatter suggests it excels at deep research. So, I gave it a task requiring detailed, multi-faceted information gathering.

The Prompt:

I needed a comprehensive guide on registering a Limited Liability Partnership (LLP) in India specifically for exporting software services to the EU. I asked for everything:

Legal/procedural steps for LLP registration in India.
Compliance for exporting software services to the EU.
Timelines, documentation, approximate costs.
A recurring compliance calendar.
Specific registrations/licenses needed.
Best practices for regulatory compliance and managing foreign clients/contracts.
Consideration of relevant laws (Companies Act, LLP Act, FEMA, recent amendments) and court rulings.

The Result:

I ran a similar test using OpenAI’s DeepResearch feature. Both tools took a fair bit of time, which is expected for deep dives. OpenAI, however, started by asking some clarifying questions to refine the scope – a sensible approach.

Manus, on the other hand, gave me a running commentary of its process, including showing the specific web pages it was browsing. Honestly, this felt more like a gimmick than a genuinely useful feature for this application. Did I really need to see the browser window flicker?

Manus clocked in at 424 credits for this task, translating to about $4.24. How does that compare? Estimating OpenAI’s cost is tricky, as DeepResearch is bundled. But if we very roughly allocate the $20 subscription cost across its 10 monthly DeepResearch uses (ignoring all other GPT-4 usage, DALL-E, etc.), we could ballpark it at ~$2 per research task. Even with caveats, Manus looks pricier here too.

More importantly, based on this experience, tools like OpenAI DeepResearch (and likely Google’s equivalent, though untested here) seem better suited for this kind of in-depth research task.

The Verdict: Is Manus Worth the Hype (and the Cost)?

Based on my little experiment, I’m left unconvinced.

For Coding: It struggled significantly, produced non-working code, and seemed expensive for the attempt. Tools like WindSurf combined with top-tier models feel more reliable and cost-effective.

For Research: While it completed the task, the process felt less refined than competitors like OpenAI DeepResearch, and it still came out looking like the more expensive option.

My takeaway? Manus might be getting a lot of attention, but in its current state, it seems like a costlier and less capable alternative compared to other established solutions for both coding and research. The hype, for now, seems greater than the reality.

← Previous Post Next Post →