• AI Shots
  • Posts
  • šŸ”„ Cursor AI Lied?! Outrage!

šŸ”„ Cursor AI Lied?! Outrage!

Plus- šŸ“‰ OpenAI Model Overhyped?!

Greetings, AI Explorers!


Today’s edition is packed with major AI breakthroughs and new developments. From ā€œCursor AI’s Fake Policyā€ to ā€œOpenAI’s failed o3ā€, the landscape is shifting fast.

As AI continues to integrate into everyday life, staying ahead of these developments is crucial. Let’s dive in!

Here’s what’s in store for you today:

  • šŸ”„ Cursor AI’s Fake Policy Sparks Outrage and Cancellations

  • šŸ“‰ OpenAI’s o3 Model Scores Lower Than Initially Claimed

  • šŸ” Instagram Uses AI to Catch Teens Lying About Age

LATEST DEVELOPMENT

HALLUCINATION

šŸ”„ Cursor AI’s Fake Policy Sparks Outrage and Cancellations

Image Source- X

Cursor AI’s support agent, powered by AI, invented a fake policy- and users weren’t having it.
The hallucinated rule led to confusion, backlash, and subscription cancellations.
Cursor’s co-founder admitted the error and promised changes.

Key Points:

  • AI agent ā€œSamā€ falsely claimed users could only log in from one device.

  • The policy was entirely fabricated- a classic AI hallucination.

  • Real issue stemmed from a recent security update.

  • Users canceled subscriptions after the fake explanation went viral.

  • Cursor is adding AI disclaimers and issuing refunds.

Importance:
This incident is a wake-up call: AI agents still make confident, wrong answers.
In customer support, one hallucination can mean lost trust — and lost revenue.
It’s a reminder that automation still needs human oversight, especially in high-stakes user interactions.

DISCREPENCY

šŸ“‰ OpenAI’s o3 Model Scores Lower Than Initially Claimed

Image Source- Youtube

OpenAI’s o3 AI model was hyped for solving 25% of FrontierMath problems, but independent testing shows it performs closer to 10%.
The higher score was likely from a more powerful, internal version, not the public one.
This is stirring debate about benchmark transparency across the AI industry.

Key Points:

  • OpenAI claimed o3 solved 25% of tough math problems; third-party tests found ~10%.

  • The public o3 model uses less computing than the version OpenAI initially tested.

  • OpenAI says the released model is optimized for speed and practical use.

  • New, more powerful o3 variants (like o3-pro) are on the way.

  • Benchmark inflation is a growing issue in the AI race, and OpenAI isn't alone.

Importance:
Benchmark scores are often used to wow users and investors, but not all models are tested equally.
As AI companies push out new tools fast, independent verification is becoming essential.
This case is a reminder: in AI, flashy numbers may hide tradeoffs behind the scenes.

DETECTION

šŸ” Instagram Uses AI to Catch Teens Lying About Age

Image Source- Reddit

Instagram is using AI to find underage users who fake their age to dodge safety rules.
Those flagged are automatically moved into ā€œTeen Accountsā€ with tighter protections.
Meta says this ensures safer experiences for young users across its platforms.

Key Points:

  • AI scans for clues like birthday posts or user reports to spot fake ages.

  • Teen Accounts restrict messages, content, and settings for safety.

  • Even accounts with adult birthdays can be flagged and restricted.

  • Parents will now get notifications and resources to help guide their teens.

  • So far, 54 million teens are in Teen Accounts globally.

Importance:
As kids get savvier online, Instagram is using AI to keep up — prioritizing safety over self-reported info.
It’s a major shift in how age is verified on social platforms.
This move reflects growing pressure on tech giants to protect minors, especially amid global scrutiny.

QUICK HITS

šŸ› ļø Trending AI Tools

  • šŸŽµ Endel: Soundscapes for focus

  • 🧠 Woebot: CBT chatbot relief

  • 🩺 Docus AI: Expert AI opinions

  • 🧘 Upheal: Insights for therapists

 šŸ”„ Money in AI

  • šŸ¤– Artisan raises $25M for AI

  • šŸ” Goodfire lands $50M explainability

  • šŸŒ Lumi AI gets $3.7M MENA

šŸ“° Everything else in AI today

  • 🧠 Flex Processing cuts costs 80% for background AI tasks

  • šŸ’¼ Cursor considered before $3B Windsurf acquisition by OpenAI

  • šŸŽ¬ Kling 2.0 brings drag‑and‑drop multimodal video editing

  • šŸ“ø ChatGPT’s photo challenge stuns with GPS‑free accuracy

  • ⚔ Gemini 2.5 Flash debuts for lightning‑fast AI inference