Google’s AI Health Coach Is Hallucinating Workout...

From Fitbit to Google Health: An AI Coach at the Center

Google is folding Fitbit into a broader wellness ecosystem, with the old Fitbit app now transitioning into the new Google Health app for Fitbit and Pixel Watch devices. At the heart of this shift is the Google Health Coach AI, a chatbot-style trainer designed to interpret your activity, sleep, and nutrition data and turn it into personalized guidance. Many familiar Fitbit features survive, but some social tools and playful touches—like sleep “animals”—are gone, while more of the in-depth analytics and coaching features now sit behind a Google Health Premium paywall. The AI coach is positioned as a key reason to subscribe, offering tailored workout plans, mindfulness content, and conversational check-ins. That makes its reliability more than a cosmetic concern: if the assistant misreads your data or injects fictional activity, it undermines the promise of smarter, more personal health tracking.

Google’s AI Health Coach Is Hallucinating Workouts—and Exposing the Limits of Fitness AI

Phantom Runs and Sleep Scores: When AI Hallucination Meets Your Body

Early testers are already catching Google Health Coach AI in some troubling mistakes. In one hands-on report, the coach correctly referenced a user’s previous night’s sleep and a real workout—but then confidently congratulated them on a five-mile run that never happened. When challenged, the AI eventually admitted the fabrication yet suggested the user might simply have failed to record the run, shifting blame rather than clearly acknowledging an error. Another reviewer saw a banner celebrating a sleep score of 99, only to find an actual score of 85 in the detailed stats. These AI hallucination workouts and misreported metrics are more than quirky bugs; they distort a person’s fitness history and can mislead decision-making about rest, training load, or recovery. In a context where users expect objective health tracking accuracy, invented runs and inflated scores are a serious red flag.

Shallow Coaching, Chatbot Stacks, and the Trust Problem

Beyond hallucinated workouts, the quality of coaching itself is under scrutiny. Testers describe the Google Health Coach AI’s guidance as verbose but “pretty shallow,” often defaulting to generic lectures rather than nuanced, context-aware advice. The bot tends to interpret almost any input as a prompt for a canned monologue, which can feel condescending rather than supportive. Its attempts to show sources sometimes backfire, linking to irrelevant pages—including a Reddit thread where the only reply was someone pasting an answer from another chatbot, illustrating how circular and self-referential AI advice can become. Earlier preview versions kept “memories” that users could delete; now, a more conventional chat history seems to keep conversations somewhat more on track, but frustrations remain. For a tool marketed as a premium health companion, these issues erode trust at exactly the moment users are being asked to rely on AI to interpret sensitive, personal data.

Convenience vs. Precision: Where the AI Coach Already Works

Not everything about Google Health Coach AI is broken. Testers report that logging everyday activity through conversation can be genuinely convenient. You can describe meals or upload screenshots from other fitness apps, and the system will estimate calories and macros or parse workout details. When one user logged a specific frozen burrito, the AI’s estimate for calories and protein landed reasonably close to the nutrition label—good enough for rough tracking, if not for clinical precision. The coach can also process photos of written workouts from a gym whiteboard, then ask follow-up questions like how long the session took or whether to adjust serving sizes. Still, it occasionally misses obvious details, such as logging “1 rep” of a sled push instead of the intended 50 yards, or failing to roll zone minutes into a weekly cardio load. These mixed results highlight a tension: the system is already useful for quick, low-stakes logging, but unreliable where accuracy truly matters.

What Google’s Bugs Reveal About AI in Health Tracking

Taken together, the Google Health app bugs point to a deeper challenge: modern AI is powerful at pattern-matching and language generation, but still prone to confabulation when pressed for specifics. In a chat interface, every response is expected to be fluent and confident—even when the underlying data are incomplete or ambiguous—which is exactly how phantom workouts and incorrect sleep scores slip through. For casual wellness advice, users might tolerate some fuzziness, but health tracking accuracy is non-negotiable when people are adjusting training plans, sleep schedules, or stress management strategies. Google appears to be iterating quickly, with hallucinations reportedly less frequent than in the earliest preview builds and conversation context somewhat improved. Still, the current behavior underscores a gap between AI capabilities and real-world expectations for health monitoring. Until that gap narrows, any AI-powered coach should be treated as a helpful assistant, not a definitive authority on your body.

Google’s AI Health Coach Is Hallucinating Workouts—and Exposing the Limits of Fitness AI

From Fitbit to Google Health: An AI Coach at the Center

Phantom Runs and Sleep Scores: When AI Hallucination Meets Your Body

Shallow Coaching, Chatbot Stacks, and the Trust Problem

Convenience vs. Precision: Where the AI Coach Already Works

What Google’s Bugs Reveal About AI in Health Tracking

You May Also Like