Gemini 3's METR 50% time horizon

157

1.5kṀ150k

Dec 31

1.6%

<1.5h

11%

1.5h - 2h

30%

2h - 2.5h

28%

2.5h - 3h

19%

3h - 3.5h

3.5h - 4h

4h - 5h

1.3%

5h - 6h

6h - 7h

0.6%

7h - 8h

0.3%

8h - 9h

0.1%

9h - 10h

0.1%

10h - 11h

0.1%

11h - 12h

0.1%

>=12h

This market will resolve to the highest 50% time horizon, as reported by METR, for any Gemini 3 model released within a month of the first Gemini 3 announcement.

50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's "Measuring AI Ability to Complete Long Tasks" for the technical definition. Claude 3.7 Sonnet, released in February 2025, was the leading model with a 50% horizon of 59 minutes.

Left bounds inclusive, right bounds exclusive.

Time horizon could vary based on the set of tasks used to measure it, so this market will be based on the time horizon for the most comprehensive set of tasks reported by METR (as of 2025, largely software and engineering tasks). This will be ambiguous if METR stops publishing time horizons across all of their autonomy tasks and only publishes separate results for different subsets; I might N/A in that scenario.

People are also trading

Will GPT-5.1 have a longer METR time horizon than Gemini 3?

59% chance

Gemini 3.0 Pro outperforms GPT-5 on METR 50% time horizon?

77% chance

Gemini 3 Pro exits preview by EOY?

18% chance

Best AI time horizon by August 2026, per METR?

Before 2026, will Gemini 3.0 exceed GPT-5 in Metr estimated time horizon?

77% chance

Gemini 3 Deep Think available on API in 2025?

16% chance

How many will Gemini 3.0 achieve? [Read description]

2.19

Opus 4.5's METR time horizon beats Gemini 3.0 Pro's?

Sort by:

@Bayesian Hello, why did the market got closed ?

why did it got closed ??? there is no answer wtf

@Amonium bc the close date was set too soon. fixed

bought Ṁ20 YES

@Bayesian Thank you.

Why are wee all in hold ?!

https://x.com/GregHBurnham/status/1993509024097292388?s=20

some useful references here perhaps

How does this resolve if METR doesn't evaluate any Gemini 3 model which is released within a month?

@jim I think the “within a month” thing means any model of Gemini’s released within a month of the first announcement, not METR’s analysis

@bens yes, but it's not guaranteed that any Gemini models which meet this condition will be evaluated by METR.

opened a Ṁ25,000 NO at 1.0% order

@jim i'll bet they will

but if they don't then ~~obviously it would resolve to <1.5h~~ jk it would resolve N/A

oh no they probably won't that's devastating i forgot they waited for general access before testing gemini 2.5 pro

sold Ṁ176 NO

@Bayesian yeah

opened a Ṁ7 YES at 28% order

@Bayesian what do you mean by general access?

@MaxLennartson The currently available model is gemini 3 pro preview. General access is when they remove all modifiers and sctually call the model gemini 3 pro in the api and such

@Bayesian It looks like they are calling it Gemini 3 pro.

Just reaffirming that i think this market is moronic

@jim You are going to lose alot of mana when the results come out

@Blocksterpen3 well, I have orders up so feel free to take my manas

you guys are criminally retarded

gemini 3.0 will make the best METR 50% time horizon go from 2h17min to 5h+, noting that gemini 2.5 pro, which was already very good, had a 50% time horizon of 39 minutes.

i wanna have what you're having jim

@Bayesian what I'm having is faith in METR-style time-horizon extrapolation. 5h is at the high end, but it's not <2% (nor <5% tbc)

@jim So even higher than 5%?

@HenryE yes

@jim Gemini 2.5 is 39 minutes, well below o3.

A straight line is about 80 minutes. You have to also bet that Google won't be staying behind on this metric which it historically has been.

There's another market that gives about 50% it will tie gpt-5, which is consistent with the 3 to 4 months behind theory, albeit with some catch-up relative to Gemini 2.5

People are also trading

Will GPT-5.1 have a longer METR time horizon than Gemini 3?

59% chance

Gemini 3.0 Pro outperforms GPT-5 on METR 50% time horizon?

77% chance

Gemini 3 Pro exits preview by EOY?

18% chance

Best AI time horizon by August 2026, per METR?

Before 2026, will Gemini 3.0 exceed GPT-5 in Metr estimated time horizon?

77% chance

Gemini 3 Deep Think available on API in 2025?

16% chance

How many will Gemini 3.0 achieve? [Read description]

2.19

Opus 4.5's METR time horizon beats Gemini 3.0 Pro's?

78% chance

People are also trading

People are also trading

Related questions