Participate in the quiz based on this newsletter and the lucky five winners will get a chance to win a coffee mug!

OpenAI’s Sam Altman revealed that users who add extra words like “please” and “thank you” are unknowingly driving up server costs—by tens of millions of dollars. Every polite phrase increases token usage, boosting energy consumption and compute loads, making even simple chats surprisingly expensive to run.
This unusual revelation shows how human habits, even well-mannered ones, can scale into massive operational and environmental costs in the AI age. As AI becomes a daily tool, balancing friendly interactions with efficiency might become a serious design challenge for the future of responsible tech.


OpenAI has launched Codex CLI, an open-source AI coding assistant that lives right in your terminal. Designed to understand and modify code in real-time, it supports text and image inputs, and offers multiple working modes—from gentle suggestions to full automation. It’s powered by OpenAI’s lightweight o4-mini model and built to integrate seamlessly into any developer’s local workflow.
Codex CLI brings AI closer to the everyday developer—not as a replacement, but as a powerful sidekick. It empowers coders to build smarter, faster, and more privately. And being open-source, it invites the community to co-create the next era of AI-enhanced software development.

In a massive AI-powered cleanup, Google suspended over 39 million ad accounts in 2024 for fraudulent activity. Using advanced AI models, it flagged suspicious behavior—like fake businesses and deepfake scams—before they could do real harm. The result? Billions of bad ads and deceptive pages wiped off the web.
This isn’t just about ads—it’s about restoring trust in the internet. As scams get smarter, AI is becoming our frontline defense. Google’s bold move shows how AI can protect users at scale and could set the standard for safer digital ecosystems worldwide.

Simplify Job Search is an AI-powered platform that helps job seekers optimize resumes, assess ATS scores, and get personalized job recommendations-streamlining the path to employment.
AI Efficiency Is Becoming the Real Competitive Edge: With Google exploring approaches like TurboQuant, the focus is clearly shifting from building larger models to making them more efficient. As compute costs rise, the ability to run AI faster and cheaper is becoming a defining factor in long-term scalability.
AI Is Embedding Itself Into Everyday Workflows: The integration of Claude into Microsoft Word by Anthropic highlights a major shift — AI is no longer a separate destination. Instead, it is being built directly into the tools people already use, making adoption more seamless and practical.
The Rise of AI Agents Signals a New Phase: OpenAI pushing toward autonomous AI agents reflects a broader industry transition. AI is moving beyond responding to prompts and toward executing tasks independently, marking the beginning of more action-oriented systems.
AI Is Transitioning From Assistance to Execution: Across the industry, there is a clear shift from AI as a support tool to AI as an active participant in workflows. Systems are increasingly designed to complete tasks end-to-end, reducing manual effort and redefining productivity.
The AI Race Is Now About Deployment, Not Just Capability: This week reinforces a key trend: building powerful models is no longer enough. The real competition is now about how efficiently AI can be deployed, where it is integrated, and how effectively it can deliver real-world value.

Google Gemma 4 Open Model Release: Google released Gemma 4 under an open Apache 2.0 license, allowing developers to freely use, modify, and deploy the model commercially. The update focuses on efficient performance, strong reasoning capabilities, and support for lightweight deployments — making advanced AI more accessible beyond large-scale infrastructure.
Anthropic Claude AI Microsoft Word Integration: Anthropic introduced integration of Claude AI into Microsoft Word, enabling users to generate, edit, and refine content directly within documents. The feature is designed to streamline writing workflows and reduce friction between AI tools and everyday productivity software.
OpenAI AI Agents Capability Expansion: OpenAI expanded its work on AI agents, focusing on systems that can execute multi-step tasks, interact with tools, and automate workflows. The update signals a move beyond conversational AI toward more action-oriented systems capable of handling complex real-world tasks.
Nvidia AI Inference Optimization Updates: Nvidia introduced new optimizations across its AI inference stack, improving model serving efficiency and reducing compute costs. The update targets large-scale deployment environments, reinforcing Nvidia’s push toward full-stack AI infrastructure.
Perplexity AI Workflow & File Handling Enhancements: Perplexity AI rolled out improvements to its platform, enabling better document handling, multi-step query execution, and workflow-based interactions. The update strengthens its positioning as a productivity-focused AI tool beyond traditional search.