My Coding Betting Dashboard is Profiting, but Polymarket is Truly Not a Good Place for 'Arbitrage'

marsbitPublished on 2026-06-18Last updated on 2026-06-18

Abstract

The author built a custom monitoring dashboard for Polymarket, a prediction market platform, and tested it with $1,600, achieving over 30% returns. However, the core argument is that Polymarket is not a good venue for traditional arbitrage. The dashboard has two main sections: a "Portfolio Dashboard" for tracking active positions with key metrics like total capital, P&L, and a risk-control module using a tier system (T1, T2, T3), and an "Opportunity Watchlist" for monitoring markets. The article details a critical structural trap in binary markets: a bet with a high perceived probability of success still carries a 100% loss risk if wrong. The author's T1/T2/T3 system is designed to manage this by limiting position sizes based on conviction and time horizon, emphasizing that high confidence should not equal high concentration. A key insight is the danger of "pseudo-diversification"—betting on different markets driven by the same underlying variable. The author concludes that Polymarket offers few true low-risk, arbitrage opportunities. It is instead a high-risk environment where wins can create a false sense of mastery, leading to large losses. The platform is better viewed as a training ground for honing judgment through disciplined, framework-driven betting rather than a reliable income source. The tools help transform intuition into structured, rule-based decisions to mitigate the risk of catastrophic errors.

Author: Too Happy Tyler

A couple of days ago, in the article "I Built an Investment Workstation for Myself Using AI," I shared several tools I coded: a cross-market asset dashboard, an investment map, a personal content operations console, and a Polymarket betting monitoring dashboard that I've been using frequently lately.

Over the past half month, I used about $1600 in principal for real-world testing and generated over 30% return. The real-time statistics on the dashboard and the final actual net profit basically matched, with only a discrepancy of around $6, which was due to small errors like limit orders/liquidity rewards and such.

But what I actually want to say in this article is not "Polymarket is great for making money," and I certainly don't want to package it as some arbitrage tutorial.

On the contrary, after running through this round, I increasingly feel that Polymarket is not a place suitable for charging in with the notion of 'arbitrage' in mind.

1. First, let's talk about what this dashboard is about

I started manually building this dashboard around May 21.

The initial need was simple: I didn't want to open a dozen betting pages each time and switch back and forth between yes/no price changes, and I also didn't want to manually record everything in Excel anymore.

Yes, before this, I was using Excel to record buys/sells, floating P&L, settlement dates, event types—the clumsy method for a clumsy person.

But as those who have actually played on it know, many bets on Polymarket are extremely prone to getting out of control precisely because manual recording methods are functionally poor. For example, you might initially just want to buy a little, but when the odds move, you might want to add more, given the lack of an intuitive feel. Or, if a specific betting event suddenly moves and you don't update the spreadsheet data in time, it's easy to miss the window for stop-loss/adding positions, and so on.

Ultimately, the whole process is too fragmented. Without a system, it's easy for people to rely on emotions to place orders.

So from the start of building this dashboard, my goal was to place every bet back into a unified framework, turning that feeling into a relatively visualizable and horizontally comparable information presentation.

After several iterations, I split it into two tabs: "Portfolio Dashboard" + "Opportunity Monitor."

"Portfolio Dashboard" is the core of the entire panel, a dynamic system that can fetch PM's real-time data and recalculate. It's divided into several functional areas (you can refer to the image at the beginning of the article):

  • Overview Bar: Total principal (planned, actual reference is minimal), invested principal, portfolio value, portfolio floating P&L, total floating P&L (including closed positions)—a quick glance at the full account picture.
  • Tier Allocation: This is the core risk management module of the dashboard, and in my opinion, the most counterintuitive yet important area. I'll keep you in suspense and explain it in detail in the next section.
  • Theme Cluster Exposure: I tagged each bet with a "Theme Cluster" label, categorized into East Asia, Middle East, Crypto, US Stocks, Pre-IPO (customizable, can add/delete). The dashboard automatically aggregates the proportion of each cluster and sets a 12% single-cluster upper limit threshold. Why design it this way? Mainly to combat the most hidden trap on PM—pseudo-diversification. More on that later.
  • Single Position Details: Tier, direction (yes/no), buy price, settlement price, shares, profit/loss, buy date, settlement date, notes, etc. Each line is clear at a glance, and it supports sorting ascending/descending and filtering by tags.

"Opportunity Monitor" is a watchlist where I put markets I'm watching but haven't bet on yet.

Each market records several key fields, including event name (with a hyperlink directly to the trading page), T1/T2/T3 stratification judgment, current yes/no price, return, annualized return, anomalies (customizable thresholds, e.g., anomalies exceeding 20% within 24 hours trigger a pop-up as long as the webpage is open), my set observation nodes, and a countdown to bet expiration.

There are two small design features I'm quite satisfied with: First, I found a suitable PM interface to parse the webpage link of the betting event. It automatically prompts for yes/no options, corresponding prices, and categorizes different options under the same event, greatly reducing manual entry hassle. Second, the Tier assignment for the same bet automatically rearranges as the remaining days change.

Not long ago, before Anthropic released Mython, there were obvious price anomalies in the watchlist, basically indicating a high-probability, high-certainty event. Entering at that time could have yielded about a 10-point gain—such opportunities are hard to capture consistently without a watchlist.

2. PM's Mathematical Expectation Trap and the "T1, T2, T3" Design Principle

The above is a simple introduction. What I really want to discuss is a thought I had after practical testing.

That is, in binary markets like PM, there exists a significant structural trap that is very unfriendly to players who like "single large positions" but is more suitable for those accustomed to "opening a supermarket, buying a whole bunch"—diversified allocation.

I'll try to explain my thinking clearly. If there are any errors or omissions, just pretend you didn't see them:

Suppose the yes price c for a betting event is 0.80, meaning the market believes there's about an 80% probability this event will happen. If I judge the true probability q of this event occurring is 0.90, then the expected return rate for this bet can be roughly calculated as:

EV = q / c - 1 = 0.90/0.80 - 1 = 12.5%

This looks good, but PM is not a bond. Behind this 12.5% lies a sharp tail risk: if your judgment is wrong, the loss isn't 12.5%, but 100%.

So in my dashboard, I don't just look at "expected return rate." I also monitor two things simultaneously:

  • One is the gap between my own probability judgment and the market price, i.e., q - c (I have an automatic profit-taking reminder target set, which is the midpoint between the buy price and 100). This is the core of whether an edge truly exists.
  • The other is the impact on the total account if this event is wrong and the single position goes to zero.

The second reason is also the origin of the T1, T2, T3 stratification I mentioned in the first chapter.

Simply put, I divided them into three categories:

  • T1 High Conviction: For me, the comfort zone lies in matters involving East Asia and some geopolitics where I feel there's an information gap between East and West. After repeated verification, I add these.
  • T2 Relatively Stable: Some where I feel the current implied probability is significantly higher/lower than the actual yes/no pricing.
  • T3 Pure Speculation: Those with very high odds, and these shouldn't be held long. Best for betting against the trend (搏反搏), expecting them to revert to a certain price to capture short-term gains.

But it's also important to note that T1 has hidden costs, especially in long-term bets. For example, a T1-level bet might have a static return of 18%, but if settlement is in 180 days, the annualized IRR might only be 3–4%, worse than leaving the money idle. During that time, the capital is locked up, and you might miss later high-IRR opportunities.

So within T1, I further break it down into several time brackets (this part is purely personal thinking, so I won't share it further). Anyway, short-term T1-A can be allocated more, long-term T1-C needs restraint. Allocating too many low-IRR long-term bets is a hidden drag on capital efficiency.

T2 has edge, but leave room for "being wrong." The single-bet upper limit is 8–10%, meaning even if this bet loses completely, the total account loss is controlled within 10%, not affecting participation in future opportunities.

T3 odds are tempting, but use minimal position size to observe. Don't expect to make big money from it. Instead, it's for betting against the trend and capturing short-term reversion—keeping yourself continuously tracking high-odds events to accumulate a feel for such markets.

Overall, position limits are essentially about leaving space for the acceptable cost of "I might be wrong."

Here's a very counterintuitive but particularly important point: high conviction does not equal high position size. Even if you think an event has a 95% probability of happening, as long as there's a 5% chance of it going to zero, the position size must be restricted.

Take an extreme example: Suppose you make 10 consecutive bets you think have a 95% win rate. Each sounds very safe, but as long as they are independent of each other, the probability of getting at least one wrong is roughly 1 - 0.95^10 ≈ 40%.

If you do it enough, you'll eventually encounter that wrong one.

And that's just for independent events. In reality, many PM markets are not independent; they often have correlations. For example, "Will US-Iran talks reach an agreement?", "Will the Strait of Hormuz reopen?", "Will Middle East tensions escalate within the month?" These three bets look like three independent markets, but the underlying variable is almost the same—Middle East geopolitical policy direction. Once the judgment on that direction is wrong, all three bleed simultaneously.

This has been the biggest help for me—not improving win rate, but limiting myself from making big mistakes. Frankly, the core value of this dashboard isn't profit statistics; it's risk control.

3. My Real View on Polymarket After This Round

After this half-month of deep, hands-on testing, my biggest takeaway is: There are opportunities on Polymarket, but it's absolutely not the arbitrage paradise many imagine it to be.

Previously, when we played with on-chain arbitrage, the rules were mostly clear, and price misalignments could be locked in. But Polymarket is different. It heavily tests your logical understanding of the shifts in sentiment around a particular bet (this point feels hard to express precisely in words).

For example, regarding East Asian-related political and economic dynamics, Chinese-speaking users might indeed have some information gap advantage, which is worth exploring. But this doesn't guarantee you'll win. Polymarket ultimately settles not based on "your understood reality" but based on market rules and designated data sources (UMA's manipulation issues are also not uncommon).

Moreover, just because you think something is a sure thing in the Chinese context doesn't mean the definition is the same in the English rules, especially since the rule settings for each bet often contain textual traps.

So based on my actual experience, PM doesn't have that many arbitrage opportunities. It mainly relies on information gaps and position diversification. Even high-conviction bets can encounter black swans.

Once you encounter one, the principal is completely gone.

As a friend said, "In investment, even if there's only a 1% chance of going to zero, one shouldn't be complacent."

Because the mathematical expectation of that, in the long run, is negative.

So my understanding of PM is now more conservative:

  • First, don't treat it as a stable income tool, even for high-conviction bets. Especially after you win several times in a row, don't think you've found a cash machine. The scariest thing about binary markets is that they make you, after consecutive wins, mistakenly believe you can judge everything, leading to one large final position that wipes out all previous profits.
  • Second, don't equate high win rate with a good trade. An event with a 90% win rate, if the market price is already 0.95, might actually have negative expectation. Conversely, an event with only a 40% win rate, if the market only prices it at 0.20, might have positive expectation.
  • Third, don't ignore tail risk. This is especially important. Many people see 10%, 20% returns and think it's stable. But as long as being wrong on that trade means going to zero, it's not a low-risk return in the traditional sense (from this perspective, I even think there are no so-called low-risk opportunities on PM; every one is high-risk).
  • Fourth, avoid pseudo-diversification. Buying multiple different markets isn't necessarily diversification. For example, the three bets mentioned earlier—"Will US-Iran talks reach an agreement?", "Will the Strait of Hormuz reopen?", "Will Middle East tensions escalate within the month?"—look like three independent markets, but the underlying variable is almost the same.

So I'm now more willing to treat PM as a judgment training ground.

It happens to complement the political, economic, tech, and financial news I, as a homebody, love to browse daily, turning those judgments that usually stay at the "I think" level into something that can provide positive feedback.

These abilities are useful outside of PM as well.

By the way, besides this PM betting dashboard, I also used Codex to build a private market valuation dynamic monitoring dashboard, mainly tracking valuation changes of pre-IPO unicorn companies—Anthropic, OpenAI, Stripe, Kraken, etc.—in the private market, and the relationship between these changes and the corresponding bets on PM.

Polymarket is essentially a prediction market. Sometimes signals in the private market are already changing, but PM prices haven't moved yet. Other times, PM prices move first, and real-world data hasn't caught up. The misalignment between the two is worth continuous observation.

Of course, there's no risk-free arbitrage here either. Private market valuations themselves aren't fully transparent, and there might be discrepancies between different data sources. But as an observation framework, it's quite interesting. I'll find an opportunity to write a separate article about it later.

Too Happy's Summary

From beginning to end, what this article wants to say is never "I made 30% with my dashboard, you can too."

I think what's more useful is being able to create a tool that helps you turn feeling into a framework, and turn framework into discipline. Often, when many people make money, it doesn't mean they've found some secret formula; it only means their judgment happened to be right this round.

This distinction is important.

I also recommend everyone start trying Vibe Coding. It doesn't necessarily have to be with Claude Code; you can try Codex, or even Kimi Work recently launched by Kimi. If anyone has difficulties subscribing to overseas services, I can share some smooth methods I use later.

Related Questions

QWhat is the author's primary conclusion about Polymarket after their test run?

AThe author concludes that Polymarket is not a good place for traditional 'arbitrage' seeking. They found it is not about locking in risk-free price differences, but is more suited for exploiting information advantages and requires strict discipline in position sizing and diversification due to high tail risks, where any single bet can result in a 100% loss.

QWhat is the core function and value of the 'Position Dashboard' the author built?

AThe core function and value of the 'Position Dashboard' is risk control. It visualizes all bets within a unified framework to combat emotional decision-making. Its key risk modules include position sizing based on T1/T2/T3 confidence tiers and monitoring 'theme cluster' exposure to prevent false diversification on correlated events.

QAccording to the author, what is the 'structural trap' in binary markets like Polymarket?

AThe 'structural trap' is the mismatch between expected return and potential loss. While a bet might show a positive expected value based on a perceived probability edge, the actual risk is not that percentage gain but a potential 100% loss of the staked amount if the prediction is wrong, which is a sharp tail risk.

QHow does the author define and handle T1, T2, and T3 bets in their strategy?

AThe author defines bets in three tiers: T1 (High Conviction - e.g., events with an East-West information gap), T2 (Relatively Stable - events with a clear edge between implied and actual probability), and T3 (Pure Speculation - high-odds bets for short-term mean reversion). The handling involves strict position caps (e.g., 8-10% for T2) and awareness that high conviction does not equal high position size, to limit the damage from being wrong.

QWhat important distinction does the author make between 'high win rate' and a 'good trade' on Polymarket?

AThe author states that a high win rate does not necessarily mean a good trade. An event with a 90% perceived probability is a negative expectation trade if the market price is 0.95. Conversely, an event with only a 40% chance could be a positive expectation trade if the market prices it at 0.20. The key is the discrepancy between one's judgment and the market price.

Related Reads

NVIDIA CPU Advances, China's RISC-V Responds: Semiconductor Deep Dive - Part Four

NVIDIA is set to launch its new Vera AI data center CPU in China as early as August, with high pricing. While this move offers a new option, it highlights China's continued dependence on foreign-controlled Arm architecture. In response, the Chinese semiconductor industry is increasingly turning to RISC-V as a strategic alternative for achieving high-performance computing autonomy. The article explores the concept of the "impossible triangle" in CPU development—balancing prosperity, control, and autonomy—and posits that RISC-V's open-source, modular nature offers a unique path to achieving all three. While RISC-V is already dominant in embedded systems, the focus is now shifting to data centers and AI workloads. China has become a global hotspot for RISC-V development, driven by AI-driven compute demand, supply chain concerns from export controls, cost benefits of open-source, and strong policy support. Multiple Chinese companies have reportedly crossed the key performance threshold of 15 SPECint per GHz, a benchmark for entering the high-performance CPU club. Progress extends beyond single-core benchmarks. Companies are developing complete computing subsystems, including commercial-grade coherent network-on-chip (NoC) technology and server processors with up to 40 cores that strictly adhere to the RVA23 standard to ensure software compatibility. Real-world applications are emerging in areas like video transcoding and edge AI. However, significant challenges remain. The RISC-V ecosystem faces fragmentation, immature toolchains and verification processes, and gaps in single-core performance and energy efficiency compared to mature x86 and Arm architectures. The formidable software moat, epitomized by NVIDIA's CUDA, is a long-term hurdle. In conclusion, while RISC-V cannot immediately replace offerings like NVIDIA's Vera, it represents a viable long-term path for China to develop a self-sufficient, high-performance CPU ecosystem. The journey is acknowledged to be long and arduous, requiring sustained effort to overcome technical and ecosystem challenges.

marsbit2h ago

NVIDIA CPU Advances, China's RISC-V Responds: Semiconductor Deep Dive - Part Four

marsbit2h ago

WeChat AI Card Hands-On Guide: Has the AI Shopping Era Arrived?

**"WeChat AI Card" Practical Test Guide: Has the Era of AI Shopping Arrived?** WeChat has officially launched the "AI Exclusive Card," a feature integrated into its Workbuddy AI assistant. This card is designed to handle payments for AI-initiated purchases. Our hands-on test reveals it's not yet a tool for fully autonomous AI shopping, but rather a controlled payment layer for AI agents. The AI Card functions as an isolated sub-wallet within WeChat Pay. Users must bind the card and transfer funds into it from their main wallet. Crucially, every transaction requires explicit user confirmation via smartphone scan; AI cannot spend autonomously. Currently accessible through the Workbuddy agent, the card targets specific digital consumption scenarios: purchasing paid content (reports, data), calling paid APIs/tools, and subscribing to services. Its design prioritizes security and control by separating funds and mandating approval for each payment. We tested a real-world scenario: ordering bubble tea via Workbuddy using a "Meituan Life Assistant" skill. The process encountered multiple hurdles: high "skill" usage costs (exceeding daily free credits), and most importantly, while a payment was successfully initiated, the AI purchased an incorrect product (a mismatched group-buy coupon instead of the desired drink). This highlights the current limitation: the **AI Card only solves the payment step**. The broader challenge lies in the **AI agent's execution chain**—accurately understanding intent, navigating third-party platforms, selecting the right product, and ensuring proper fulfillment. The payment succeeded, but the purchase failed to meet the user's need. In conclusion, the WeChat AI Exclusive Card is a cautious, early-step experiment in AI commerce. It provides a secure, user-controlled payment method for agent interactions but is not yet capable of reliable, end-to-end complex purchases. For now, it's best used for low-value, low-risk digital services with careful user verification at each step. The vision of AI handling complete shopping tasks remains a work in progress.

marsbit8h ago

WeChat AI Card Hands-On Guide: Has the AI Shopping Era Arrived?

marsbit8h ago

Deconstructing Notion's Growth: From a Note-taking Tool to 100 Million Users—How Notion Built a Triple Growth Flywheel Through Product, Templates, and Community

Notion's growth from a niche note-taking tool to a platform with 100 million users is powered by three interconnected flywheels: Product-Led Growth (PLG), a Template Economy, and Community-Driven Growth. First, Notion's PLG strategy relies on a highly flexible, "plastic" product that users can adapt to countless personal and team workflows. Its freemium model lowers the barrier to entry, while features like page sharing and collaboration drive organic, usage-based viral growth as users naturally invite others. Second, the Template Economy solves the "blank page" problem. Templates, created by both Notion and its community, transform abstract product capabilities into concrete, copyable solutions for specific scenarios (e.g., project management, content calendars). This dramatically lowers activation costs for new users and fuels SEO-driven discovery. Third, a vibrant Community acts as a distributed growth engine. Users and official Ambassadors create tutorials, share use cases, and host local events. This community not only educates users but also fosters a sense of identity around pursuing "better ways of working," strengthening loyalty and enabling global, low-cost expansion. Together, these flywheels create a self-reinforcing ecosystem: a great product attracts users who create templates and community content, which in turn attracts more users and deepens engagement. This system allowed Notion to scale from individuals to teams and enterprises through a bottom-up adoption path. Looking ahead, AI integration promises to accelerate these flywheels further by making templates smarter and the platform a potential AI-native work operating system. Ultimately, Notion's defensible advantage is not just its features, but this deeply entrenched network of user assets, creators, and community trust.

marsbit8h ago

Deconstructing Notion's Growth: From a Note-taking Tool to 100 Million Users—How Notion Built a Triple Growth Flywheel Through Product, Templates, and Community

marsbit8h ago

Trading

Spot
Futures
活动图片