Loading articles…
RSS reader
Showing 228 of 228 articles in Last 30 days
Know an interesting engineering blog?
Feel free to contribute and add more sources to the aggregator.
Using an agentic AI system to surface threat models during code review and spot gaps between security requirements and implementation.
The methodology described in Evolutionary Database Design and operationalized in Refactoring Databases: ...
Initial public offering for aerospace and AI company made Musk the world’s first trillionaire as share prices jumped Share your views on SpaceX’s stock market debut SpaceX made the biggest stock market debut in history on Friday after nearly two and a half decades as a private company. Public trading began around midday with a starting share price of $150, which quickly jumped by a double digit pe
Australian Human Rights Commission has called for a digital duty of care to prevent social media algorithms from incentivising ‘racist’ content Get our breaking news email, free app or daily news podcast For the past week and half, the social media feeds of many Aboriginal and Torres Strait Islander people have been flooded with clips from the same video, posted by a self-declared Australian comed
Cloudflare Security Insights system now processes over 120 scans per second, providing frequent insights for all customers. By optimizing Kafka consumers, Postgres queries, and our API, we scaled our throughput 10x without adding hardware.
Learn how to implement the OAuth 2.0 Device Authorization flow to authorize a .NET console application built with C\#.
Customer segmentation is the practice of dividing an existing customer base into smaller...
Location scans from the globally popular augmented reality game have helped train AI to recognise and interpret physical spaces Follow our Australia news live blog for latest updates Get our breaking news email, free app or daily news podcast An AI model trained on data collected from users of Pokémon Go will potentially help military drones find their location in war zones. Pokémon Go, a 2016 aug
The initial release of the RISC-V Developer Preview, based on Red Hat Enterprise Linux (RHEL) 10.0, was in May 2025. Today, Red Hat is releasing a software refresh to that Developer Preview to update the code to RHEL 10.2. The hardware platform remains the same (SiFive HiFive Premier P550), but the new release contains more of the upstream code specifically for that platform as well as incremental
The AI-enabled enterprise: Why we are applying software engineering principles to business operationsRed Hat is applying the concept of Business as Code to reshape its own business operations. Serving as "Customer Zero," Red Hat uses the same principles that govern software engineering to transform standard operating procedures into a scalable, compounding enterprise asset that drives real growth.
To achieve higher tiers of autonomy as defined by the global telecom industry association TM Forum, service providers must move beyond simple and reactive automation scripts. The goal is to achieve closed-loop, intent-driven operations where networks self-optimize, self-heal, and adapt to high-level business goals with zero human intervention. Enterprise and telecommunication service providers fac
When building and maintaining consistent execution environments, platform engineers and developers routinely lose valuable time identifying dependencies, tracking down content collections scattered across different repos, and wrestling with manual syntax configurations. With the release of Red Hat Ansible Automation Platform 2.7, these challenges have become a thing of the past. The execution envi
Introduction Grab is migrating from heavy base images like Ubuntu to Distroless images to reduce security risks. By stripping containers down to the bare application and its runtime, we eliminate unnecessary binaries and Common Vulnerabilities and Exposures (CVEs). This migration is more than a compliance mandate; it is a strategic security decision to build a more resilient and defensible product
introduces , a single API for running established agent harnesses, including Claude Code, Codex, and Pi. AI SDK has always let you switch models without rewriting your agent. Now you can switch the harness the same way.AI SDK 7HarnessAgent Write the agent once. Use the best harness available. Today. In 3 months. A year from now. Harnesses manage the components above a model call, including skills,
“Talk to Data” is rapidly becoming an important capability across industries, and...
In May, we experienced nine incidents that resulted in degraded performance across GitHub services. The post GitHub availability report: May 2026 appeared first on The GitHub Blog.
Over the past few years, increasingly customers have shifted from asking “help us...
There’s a common theme to the conversations I’ve been having with AI teams lately: change. Constant, head-spinning change. Teams across industries are evaluating and re-evaluating model providers, agent frameworks, and harnesses on a continuous basis. At MongoDB, we believe that your choice of technology partner—specifically, your data platform—should simplify how you build with AI. It should deli
Telemetry data is everywhere. IoT sensors on factory floors. Satellite arrays scanning...
Building the Next Generation of Real-Time PricingERGO Hestia, one of Poland's leading...
Suit filed in US alleges chatbot told Alice Carrier, 24, ‘maybe this is just the end’ as she struggled with suicidal thoughts A Canadian mother sued OpenAI and its CEO, Sam Altman, in US court on Thursday, alleging that ChatGPT encouraged her daughter to kill herself. The lawsuit is the latest in a slew accusing the company of failing to address dangerous conversations between users and the compan
Long-abandoned formats such as cassettes and VHS tapes are finding new life as consumers seek a digital detox Ten years after the last video recorder manufacturer ceased production, the first straight-to-video movie for two decades – This Is How the World Ends – was released this month. The resurgence of vinyl began long ago; sales are at their highest level for over 30 years. But record buyers en
Welcoming the Inaugural Cohort of Databricks Student FellowsApplications are now open for the Databricks Student Fellows...
Your agent ran a scaffold command. Project generated, dependencies resolved, no errors. Everything looks fine. Except it’s based on the project structure from 2020, and neither you nor the agent noticed. How npx picks the right-but-wrong version When an agent scaffolds a project or runs a CLI tool, it often reaches for npx without specifying […] The post Your agent just scaffolded a project from 2
A hurricane is forming in the Florida Gulf. As an insurer, you need to answer key...
This is a collaborative post from Databricks and Microsoft. We thank Jason Pereira,...
Alerts are more trustworthy and actionable when noise is reduced. See how we improved the verification step with context-aware LLM reasoning. The post Making secret scanning more trustworthy: Reducing false positives at scale appeared first on The GitHub Blog.
UI tests and API tests usually live in separate worlds. The frontend team writes Playwright specs, the API team writes Postman Collections,... The post Browser testing in Postman Agent Mode appeared first on Postman Blog.
Former xAI engineer Devin Kim alleges he was illegally fired for trying to implement safety mechanisms for the chatbot A former engineer at Elon Musk’s xAI who now heads a thinktank focused on AI safety filed a lawsuit claiming he was fired from the SpaceX subsidiary for raising concerns about the risks artificial intelligence poses to humanity. Devin Kim claims in the lawsuit filed in California
While there holds great promise for AI agents to transform the healthcare industry, for agents to be successful...
Abstract Agent-driven end-to-end (E2E) tests add a new exploratory layer to testing, but should they replace traditional deterministic tests? We ran more than 200 agentic E2E workflows using the Playwright MCP, Playwright CLI, and agent-generated Playwright tests in test workspaces using non-production data to find out how agentic testing could fit into both our and…
Employees at artificial intelligence companies are coming into gargantuan sums of money amid boom in IPOs Home prices in the San Francisco Bay Area’s already expensive market are skyrocketing as employees at leading artificial intelligence companies come into gargantuan sums of money thanks to a boom in initial public offerings. With San Francisco’s OpenAI and Anthropic, as well as SpaceX, which o
Nor is the dreamy promise that this tech will unlock boundless potential and productivity Everything we hear about artificial intelligence is conflicting, and hearing about it feels inescapable. AI is terrible. AI is wonderful. It will break the world. It will transform the future. It’s essential to embrace it. It’s a moral imperative to abstain from using it. Already, AI is projected to generate
Okara on Vercel 4 billion tokens processed daily across a multi-provider AI stack on Vercel AI CMOs actively managing growth for 120,000+ businesses Eight sub-agents handling SEO, GEO, social, content, Reddit, and Hacker News New AI models available to users the same day they ship Okara is an AI CMO that directs a team of specialized sub-agents to drive marketing, so founders don't have to. Give O
Our data shows that agents are now fully capable of independently writing code and integrating with APIs like Stripe’s. And yet, many of the steps adjacent to writing code are still too hard for agents to do on their own. We’re expanding Stripe Projects to solve this.
Learn how to upgrade your Auth0 subscription to B2B or B2C Enterprise tiers instantly using the new self-service dashboard options.
Token Vault now supports Auth0 Organizations, isolating user credentials within strict multi-tenant boundaries.
Platform engineering can improve developer experience, provide reusable platform services across an organization, and help teams deliver software more quickly without compromising trust and security. In practice, however, many platform teams struggle to achieve the level of adoption they expect. Some teams find themselves pulled into project-specific support work. Others build tools and standards
Recently, Red Hat's Vincent Danen highlighted how AI models found 271 real security defects in Firefox in a single pass during Mozilla's collaboration with Anthropic. If AI can do that for defenders, it can do the same for attackers. As Danen put it, "if your security strategy is solely predicated on the assumption that software will be vulnerability-free, you've already lost." Vulnerabilities in
Matt Cortland, the creator of the Guinness Price Index, talks about the project and the technology backing it. More at Twilio’s SIGNAL Berlin in June 2026.
If you already use Redis for search, retrieval, or application memory, the RedisVL MCP is a practical next step: making that data available to agents without rebuilding your integration for every framework. As teams connect indexed data to agents, th...
The is now available in Grok Build.Vercel plugin Grok can now draw on Vercel knowledge as you work. Real-time activity, including file edits and terminal commands, dynamically injects the relevant knowledge into context, so answers stay aligned with current platform APIs and recommended patterns. Install it in either of two ways: Learn more about the Vercel plugin in the .documentation Read more A
Azure is now a provider for DeepSeek V4 Pro and V4 Flash on .AI Gateway Requests to either model can route through Azure alongside the existing providers for another failover path. No code changes are required: default routing considers Azure automatically, and if a provider fails the gateway falls back through the remaining list. If you want requests to try Azure first, use in the gateway provide
See how Amplitude Zoning Insights helps web and growth teams optimize faster by overlaying engagement and revenue metrics directly on your live site.
AI agents will completely change the way you work, but they’re still tools that need to be learned. Discover five best practices for getting started with AI agents.
See Amplitude’s G2 Summer 2026 results: #1 in Product Analytics for 24 quarters, with significant regional climbs in APAC, India, and ANZ.
This is the third article in a series about Agent Experience (AX): the practice of making AI coding agents work correctly with your technology. The series covers what you can and can’t control in the agent stack, how to measure whether your extensions are helping or hurting, and how to iterate toward better outcomes. You […] The post Is your agent extension actually working? appeared first on Micr
Robert Dillon was arrested at home in Florida despite living 300 miles away from where a crime was committed Sign up for the Breaking News US newsletter email A Florida man is suing several law enforcement agencies for his arrest and prosecution for allegedly luring a child after he was wrongly identified using faulty AI facial recognition software. According to the Jacksonville Beach police depar
AI has made software delivery faster, but speed alone does not guarantee better outcomes. As teams adopt AI-native development, the real challenge is keeping requirements, design, implementation, and validation aligned so the final result still reflects the original intent. Spec-Driven Development (SDD) addresses this by making structured specs the shared source of truth for both humans […] The po
Install and configure LSP servers for GitHub Copilot CLI, replacing brute-force grep/decompile with real code intelligence. The post Give GitHub Copilot CLI real code intelligence with language servers appeared first on The GitHub Blog.
AWS launches Amazon EC2 M9g and M9gd instances, powered by AWS Graviton5 processors. AWS Graviton5 is most powerful, and most energy efficient processor AWS has ever built, and offers up to 25% better compute performance compared to Graviton4-based instances.
At DigitalOcean, we’re committed to providing high-performance infrastructure for the next generation of AI, which is why we’ve been focused on hosting frontier Large Language Models (LLMs) on frontier GPUs—including AMD GPUs. We see inference performance as an intricate systems-level challenge. For frontier open-weight models, achieving peak output speed is not just about the raw hardware. It als
At Spotify, data problems used to follow a specific pattern. You'd look for the relevant dashboard, there... The post Encoding Your Domain Expert: The Context Layer Behind Spotify's Data Assistant appeared first on Spotify Engineering.
Application Services for Private Origins is available now in closed beta. Route public hostnames to private IP origins over your existing IPsec, GRE, CNI, or Cloudflare Mesh paths. No public IPs or extra connector software required.
New Relic's 2026 State of AI Coding Report surveys 200 U.S. tech leaders on generative and agentic AI tools moving from personal sandboxes to production pipelines.
Home city of Amazon and Microsoft passes moratorium as backlash against energy-guzzling AI infrastructure grows Seattle has passed a year-long moratorium on the construction of new datacenters. The city council voted unanimously in favor of the temporary ban on Tuesday. A major tech hub whose metro area is home to Amazon and Microsoft, Seattle is the largest US city to have passed such a moratoriu
A candid look at the legacy tasks our design team intentionally abandoned in order to bridge the gap between design concepts and production-ready code.
Learn to connect any OAuth2 service to AI agents with Auth0 Token Vault's custom OAuth2 integration.
Today we released Red Hat Ansible Automation Platform 2.7, which builds on previous releases with more features and enhancements to help you enable a platform engineering approach to automation, accelerate adoption across different teams, and prepare your IT operations for AI-driven automation. Here's a look at what's included in our latest release. Empower platform engineering and boost developer
The keys that Microsoft uses to sign for Secure Boot are expiring at the end of June 2026. Here is what you need to know:Secure Boot-enabled systems will continue to boot after June 2026 whether they are immediately updated or not.Red Hat has released new shims, signed by multiple certificates, for all supported RHEL 9 and RHEL 10 streams; RHEL 8 will receive the new shim in June 2026.To prepare y
Announcing Data Residency for SMS (EU): Local control, global trust
Route logs to ClickHouse with Observability Pipelines and search them from the Datadog Log Explorer.
When a customer taps "pay," a clock starts that your fraud system can't pause. The payment authorization resolves in a fixed window whether your model has scored the transaction or not. If it hasn't, the payment either gets declined or clears without ...
Some of today's most capable LLMs now support very large context windows. That doesn't mean you should fill them. Context windows have grown fast, but the underlying cost and quality tradeoffs haven't gone away. They've just gotten easier to ignore. ...
AWS PrivateLink resource endpoints are now generally available across all Redis Cloud Pro subscription types, including Redis Flex and Active-Active deployments. That means you can connect apps to Redis Cloud through a private, scoped endpoint without...
Threshold billing now sends Pro teams a partial invoice mid-cycle once on-demand usage reaches a threshold, instead of holding all charges until the end of the billing period. Partial invoices and the end-of-cycle invoice add up to your total usage, so the same usage is never billed twice. Learn more about .partial invoices Read more
Discover Amplitude Wave, a proactive product agent that surfaces opportunities, ships improvements, and helps teams build self-improving products with AI.
Earlier this year, we needed to hire a cohort of engineers in Seattle, fast. We had a product launching at our marquee conference, Deploy, a hard deadline, and a clear picture of what the work would actually require. What we didn’t want was an interview process designed for a world that no longer exists. So we rebuilt it from scratch and opened a brand-new office in Bellevue for everyone we hired.
US embassy came out against UK’s proposed under-16 social media ban, which would affect American firms White House displeasure over the prospect of an under-16 social media ban will not deter the UK from cracking down on tech platforms, the British government has said. The technology secretary, Liz Kendall, told the Guardian she was not concerned “in the slightest” by the Trump administration’s in
AWS announces the availability of Claude Fable 5 on Amazon Bedrock and Claude Platform on AWS. Claude Fable 5 delivers Mythos-level capabilities available to all customers, with strong safeguards designed to make it safe for broader use.
We ran the first Agents and APIs developer meetup in Mumbai on May 23. I organised it with Neo4j and GitHub User... The post Agents and APIs developer meetup Mumbai recap appeared first on Postman Blog.
Custom agents let GitHub Copilot CLI understand your stack and team workflows, turning one-off terminal prompts into repeatable, reviewable processes. The post From one-off prompts to workflows: How to use custom agents in GitHub Copilot CLI appeared first on The GitHub Blog.
US spy-tech company to challenge London mayor’s intervention after he raised concerns over breach of procurement rules Palantir intends to sue the London mayor, Sadiq Khan, after he blocked a contract between the US spy-tech firm and the Metropolitan police. The Met had planned to use Palantir’s software to automate intelligence analysis in criminal investigations, until Khan intervened in late Ma
Discover the best Datadog alternatives to improve observability, reduce costs, and unify telemetry for your engineering team’s reliability and efficiency.
Understand when to use logs vs metrics for effective monitoring and debugging. Learn how a unified approach improves incident response and system insights.
Learn how to build effective open source observability that improves system reliability and reduces complexity with proven tools and strategies.
Claude Fable 5 from Anthropic is now available on . A Mythos-class model, Fable 5 is a notable step up over prior Claude models on long-running, ambiguous, multi-step tasks, executing end-to-end on work that previously required frequent human check-ins.AI Gateway The model sustains productive output across multi-day runs and dependably dispatches parallel sub-agents, and lower effort settings ofte
In our post about Project Glasswing, we made the argument that the architecture around a vulnerability matters more than the speed of the patch. Here we walk through what that architecture looks like, the threats it defends against, and how we run it ourselves as Cloudflare's customer zero.
Run this pre-launch Auth0 identity audit to lock down token signing, configure log streaming, and optimize your B2B vs B2C apps.
For about two years, the unit of work with a coding agent was the prompt. You wrote a good one, you gave it enough context, you read what came back, and you wrote the next one. The agent was a tool, and you were holding it the entire time, one turn after another. That part is ending. Addy Osmani, a director of AI at Google Cloud, has a name for what replaces it, and I have not stopped thinking abo
Over the past year, the conversation around artificial intelligence has undergone a massive evolution, shifting from isolated pilots or experiments with standalone tools. For leaders, the true challenge—and the ultimate prize—lies in building a fully AI-enabled enterprise. This means moving beyond experimentation into true operationalization, creating a scalable framework where AI drives both fron
Red Hat Summit demonstration booth featuring a model train track, edge computing hardware, monitoring displays, and supporting infrastructure used to demonstrate AI-driven automation at the edge.Many AI demos stop at detection. A dashboard highlights an object, a model produces a classification, or a graph updates in real time. Those are valuable building blocks, but operational environments often
Learn to build a context-aware customer service workflow using Twilio Agent Connect, Conversation Memory, and Flex for seamless AI-to-human handoffs.
EU banks have the tech, but is your conversational AI ready? Discover the 15 questions European banks must answer to safely deploy conversational AI and meet current regulations
Learn how the Datadog MCP Server supports live Datadog graphs, monitors, and other UI elements directly within AI tools such as Cursor, ChatGPT, Claude, and Codex.
Learn how Bits Threat Hunting helps security teams proactively identify attacker behavior with AI-driven, hypothesis-based threat hunting.
Learn how Datadog Service Remapping unifies your telemetry across APM, logs, and metrics by letting you fine-tune service definitions without any code or configuration changes.
Learn how Datadog Apps lets you build and deploy apps from your AI agent directly into Datadog, with built-in governance, observability, and secure integrations.
Bits Release continuously validates every change from pull request to production, catching silent regressions and helping engineering teams ship at AI speed.
Learn how Datadog Code Threat Detection helps teams detect malicious pull requests and source code attacks targeting CI/CD workflows, secrets, and software releases.
Learn how to use Federated Logs to investigate logs across Datadog, Databricks, ClickHouse, Amazon S3, and Snowflake.
Diagnose per-hop latency across network hops from user devices to SaaS applications with Datadog End User Device Monitoring and Network Path.
Learn how Patterns in Agent Observability helps teams identify recurring behaviors in their LLM applications, investigate quality issues, and improve evaluation coverage.
Automatically detect, investigate, and resolve fleet-wide endpoint issues with Datadog End User Device Monitoring, powered by Bits Investigation.
Learn how Bits Investigation helps engineers triage synthetic test failures, identify likely root causes, and reduce manual investigation time.
Your agent is only as good as the information it can see at decision time. The data sitting in your infrastructure doesn't count, and neither does what the model learned in training months ago. What counts is the specific tokens loaded into its contex...
AI costs are getting harder to forecast. As teams lean more on coding agents and other token-heavy workflows, a key can burn cost faster than anyone notices: Set a spend cap on any key, and rejects further requests on that key once the limit is exceeded, until the budget resets or you raise it. The cap applies to all AI Gateway providers and models running through the key, making it easier to cons
You can now use the Vercel CLI to search domains. Using the command, you can supply a domain name and retrieve availability and price results for all TLDs that Vercel supports. vercel domains search You can also filter by TLD, apply sorting, and filter out unavailable domains. Upgrade your Vercel CLI to version to get started.54.10.1 Read more
Amplitude helped these seven marketing teams see what customers did after clicking through their campaigns. Here’s how they used that to improve conversion, CLV, acquisition cost, and more.
Catch up on Microsoft Build 2026 with the vision lead-off, top developer announcements, and must-watch sessions across the Microsoft developer ecosystem. The post Microsoft Build 2026 recap: vision, launches, and top sessions appeared first on Microsoft for Developers.
This week, the AWS IoT Device SDK for Swift reached general availability. As a member of the Swift Server Workgroup (SSWG), this one caught my attention. The SDK brings production-ready MQTT 5 connectivity, Device Shadow, Jobs, and fleet provisioning to Swift developers on macOS, iOS, tvOS, and Linux. I’m curious to see what you will build with it. […]
Find the answers to some of the most common GitHub-related questions. The post GitHub for Beginners: Answers to some common questions appeared first on The GitHub Blog.
Code is shipping faster than it ever has. AI assistants are writing the first draft of pull requests on most teams I... The post How Postman Agent Mode hacks for me appeared first on Postman Blog.
Cloudflare customers can now use Cloudforce One threat intelligence directly within the WAF to block high-risk traffic. By using new cf.intel fields, security teams can automate protection against specific threat actors and targeted industries in real time.
A walkthrough of taking an AI Travel Agent (WanderAI) from a demo to production, covering OpenTelemetry tracing, AI monitoring, SLOs, and prompt injection defense.
New Relic Experimental is our open-source incubator designed to bridge the gap between emerging tech and enterprise observability.
Enhance vendor management with the New Relic Private Trust Center. Paid customers get instant access to confidential SOC reports, ISO certifications, and security policies.
Every month, routes tens of trillions of tokens between production applications and AI labs, giving us visibility into what AI usage actually looks like, separate from leaderboards and benchmarks. We publish the data monthly in the AI Gateway production index. AI Gateway Last month, headlines about blown token budgets dominated tech news: its annual Claude Code budget shortly after Q1 and Amazon t
Alongside the next generation of Apple Intelligence, today we’re expanding Private Cloud Compute (PCC) beyond Apple’s data centers. When Apple introduced Private Cloud Compute in 2024, we defined a new frontier for private AI inference, extending the security and privacy of Apple devices into the cloud for those AI workloads more complex than on-device models can handle. Now, we are collaborating
In this tutorial, you're going to learn how to answer support calls, then handle them using SMS powered by TwiML and PHP.
Learn how Twilio’s MCP server works, why to use it and where it helps, and how to connect it with your AI coding assistant.
Today, we’re announcing the general availability of Redis Data Integration (RDI) in Redis Cloud on AWS. RDI in Redis Cloud is our fully-managed service for moving operational and analytical data into Redis in near real time and keeping Redis continuo...
Your AI agent has a 128K token context window. You're adding in retrieved documents, conversation history, tool outputs, and system instructions. But the answers are getting worse. You're not alone. Most agent failures in production today are context...
If you're building on Google Cloud and need an in-memory data store, you've probably looked at Memorystore in the console. It's right there, a few clicks to provision, and it speaks the Redis protocol you already know. But the architectural difference...
You can use the new console experience on Amazon Bedrock to browse and compare the latest AI models side by side, organize work into projects with streamlined evaluation workflows, and access project-aware live documentation with auto-prefilled code snippets ready to copy and run.
AI Gateway now features real-time spend limits to prevent runaway token bills across multiple AI providers. By integrating with Cloudflare Access, companies can use identity-driven budgets and policies.
now supports drives in private beta. Drives are persistent, attachable storage with a lifecycle independent from any sandbox.Vercel Sandbox Create a drive once, then mount it at a configurable path when starting a sandbox. When the sandbox stops, the drive remains available to attach to a later sandbox. Install the beta () or beta (), then create and mount a drive:SDKCLI@vercel/sandbox@betasandbox
A git tag is how many teams mark a release as ready. Pulumi Deployments can now act on that signal directly: configure a tag-based trigger, push a version tag like v1.2.0, and Pulumi automatically runs pulumi up for your stack. No extra pipeline glue, no manual click — your release tag is the deployment. Why tags? Push to Deploy has long let you preview changes on a pull request and update a stack
Boost your AI coding agent with Twilio AI Skills. Encode best practices, avoid hallucinations, and build production-ready Twilio apps with confidence.
The API is now available. Authenticate with your project's and start querying more than 600,000 skills from across the open-source ecosystem.skills.shVercel OIDC token Search for skills, pull detailed info on any one, check its security audit, and more. Vercel issues a short-lived token scoped to your team and project, rotated automatically, so there's no long-lived secret to leak or rotate. On ea
Learn how product managers use AI evaluations to measure agent quality. Covers traces, LLM judges, offline evals, online evals, and how to connect evals to product outcomes.
We open-sourced our AI Skills library at Amplitude. Here's what we built, why we built it, and how to use it.
Most teams running inference at scale do not fail because they cannot find a “good” model. They fail because they ship a routing policy that looks fine in a playground, but drifts the moment it sees real prompts, real latency tails, and real per-token cost. The routing policy breaks on the prompts you never tested and your users find out before you do. Now you can use Model Evaluations, available
The proliferation of agentic workflows means developers now regularly grant AI tools direct access to their infrastructure, use services that act autonomously, and build on platforms that themselves use AI to operate. We’ve updated our Terms of Service and Marketplace terms to clarify shared responsibility when actions on your account may be taken by AI, whether Vercel's own or a third-party tool
GitHub Universe is back: returning to the historic Fort Mason Center in San Francisco on October 28–29, 2026. The post GitHub Universe is back: All together now, in the agentic era appeared first on The GitHub Blog.
Postman can generate a fully documented, type-safe SDK directly from a collection or an OpenAPI spec, in nine languages, and it can... The post Generating Client SDKs and AI-Ready CLIs with Postman appeared first on Postman Blog.
VoidZero, the team behind Vite, Vitest, Rolldown, Oxc, and Vite+, is joining Cloudflare. Vite stays open source, vendor-agnostic, and built for everyone.
At Sessions 2026, Stripe unveiled dozens of products and capabilities to help businesses turn global demand into revenue. See how to go global faster with localized checkout and Adaptive Pricing, smarter fraud tools, multicurrency treasury support, and automated tax compliance.
Explore how AI agents are transforming commerce at Stripe’s Agentic Commerce Next roadshow. Reserve your spot in Seattle.
Join senior risk and payments leaders in Seattle to explore how AI is reshaping fraud strategy. Seats are limited.
Learn how to implement passkey signup, login, and logout in an iOS app using Auth0's Native Login and the Auth0.swift SDK.
If you run AI tools and agents, you’ve probably accepted three tradeoffs: your data leaves your network, you can’t work offline, and your bill scales with usage. Open-weight models now run well on consumer hardware. Once the model is on your machine, your data stays local, inference works offline, and tokens cost nothing. If you own a modern Mac, you can run a high-quality model yourself. Gemma 4
AWS reports in an AWS Architecture Blog case study that Deloitte’s move to a virtual cluster model on Amazon EKS resulted in 89% faster testing environment provisioning. By consolidating dozens of disparate clusters into a single host cluster with over 50 vCluster instances, the case study says Deloitte saved about 500 QA hours per year. This “Environment Factory” pattern allows platform teams to
Generative AI creates content. Conversational AI manages dialogue. Here's how they differ, how they overlap, and when to use each one.
Amazon Cognito now offers multi-Region replication that automatically synchronizes user data, credentials, and pool configurations to a secondary AWS Region, enabling uninterrupted authentication during regional failovers without forced password resets—plus new support for customer managed KMS keys for encryption control.
Retail supply chains are not a back-office logistics function; they are a high-stakes, board-level concern. Imagine learning suddenly that shipment rerouting surcharges have doubled due to new regional escalations; the impact on competitive differentiation and consumer trust is immediate. As a result, a long-standing focus on linear efficiency and lean inventory is being disrupted by a mandate for
Deploy 2026 came and went, and we’re still buzzing. For one day at Convene 100 Stockton in San Francisco, developers, startup founders, customers, and partners filled the room to talk about a shared challenge: how to build and scale AI products without unnecessary complexity. Conversations moved from infrastructure to inference costs, production workloads, vector databases, and what teams actually
Building an AI-native application requires a data layer that can do two things at once: handle the structured, transactional queries your application runs on, and understand meaning well enough to power semantic search across unstructured content. An AI application needs both — precise SQL for account balances and transaction records, and vector search to surface conceptually related patterns, ano
We’re introducing Instantaneous PowerLoss Storm, a new testing paradigm within Meta’s infrastructure for handling and mitigating instant or zero-notice power loss in our data centers. We’re sharing: how we built readiness to tolerate instant failures into our existing systems with defense-in-depth strategies; tradeoffs made in implementing it, and how we validated our readiness. Disaster preparedn
BGP is vulnerable to routing hijacks and path leaks that negatively impact traffic on the Internet. RPKI helps solve some of these problems, but for some forged paths, we need to rely on a simpler mechanism: First AS enforcement in BGP.
At Code with Claude, Spotify’s chief architect shared how we make both teams and AI agents more effective. The post Coding Is No Longer the Constraint: Scaling Developer Experience to Teams and Agents at Spotify appeared first on Spotify Engineering.
A journey through the various facets of token exchange: from use cases to management complexity
Supercharge SAP on AWS transformation with New Relic's intelligent observability. Get full-stack visibility across hybrid and RISE with SAP environments.
We moved quickly to help Stripe businesses take advantage of DCAP and capture interchange savings while protecting authorization rates. Here’s what we did.
Most conversational AI deployments don't fail in build. They fail in production. Here's how to build and deploy it the right way, step by step.
Every few months, a new AI model drops with higher benchmark scores, and the reaction is predictable: "This one finally reasons." The leaderboard shuffles. And teams building production AI systems still watch their agents hallucinate or mishandle ques...
Your BI semantic layer solved a hard problem: getting every team, dashboard, and report to agree on what shared metrics like "revenue," "active customer," or "customer acquisition cost" actually mean. Those governed definitions won't be enough to grou...
Connect Notion, Atlassian, Slack, Linear, and more to Amplitude's Global Agent. Get richer analysis and take action across tools without leaving Amplitude.
The growth of generative AI isn’t driven solely by AI companies with proprietary models. Open-source AI is reshaping the developer ecosystem, fueled by a growing community of builders. But what does it take to go from open models to production-ready agentic AI, and what do developers need to know to get there? This question was the focus of the DigitalOcean Deploy session, “Open by Design: How NVI
At Microsoft Build 2026, GitHub introduced new tools, updates, and surfaces so agents can work the way you already work. The post GitHub Copilot app: The agent-native desktop experience appeared first on The GitHub Blog.
Postman’s AI-native API platform is getting a major upgrade today. We’re excited to announce the launch of the AI Engineer, the next leap... The post Introducing the AI Engineer appeared first on Postman Blog.
See how New Relic and Microsoft are embedding Intelligent Observability into Azure workflows and what we’ve built for teams deploying AI in production.
Discover the best Kubernetes monitoring tools to gain clear, actionable insights and reduce noise during critical incidents. Find the right fit for your team.
Learn how to secure your distributed session states by implementing Auth0 Back-Channel Logout using TanStack Start server routes
Terraform is a proven infrastructure as code tool with a large provider and module ecosystem. Many teams choose Pulumi when they want to keep that infrastructure as code model, but write and maintain infrastructure with general-purpose programming languages, familiar package managers, IDEs, testing, and software engineering patterns, while still understanding the refactoring tradeoffs in Terraform
AI coding has two shapes right now. One agent in a loop, sequential work, you babysitting the chat window. Call that 2x. Most teams live here. Five agents in worktrees, parallel work, fresh-context review on every change. Call that 10x. The trick: 2x is mostly prompting, 10x is mostly plumbing. The parallel coding playbook is a five-pattern setup for running multiple AI coding agents at the same t
How to Deploy a Vibe Coded Project - deploying a web site, app, or game that you vibe coded
Your AI can summarize documents and answer questions about almost anything on the internet. But ask it about your business, and things fall apart. It pulls stale pricing, ignores internal policies, or hallucinates details that sound plausible but don'...
The most popular data types in Redis are strings, lists, hashes, sets, and sorted sets. Each is purpose-built around a specific way of organizing data, enabling developers to solve a wide range of technical problems. What none of them offer is effecti...
Discover how Amplitude AI thinks and best practices for working with it. Partner with AI at each step of its process for more accurate, actionable outputs.
Learn how we rebuilt our documentation for AI agents with an MCP server, raw Markdown API, and structured metadata.
OpenAI frontier models GPT-5.5 and GPT-5.4, and Codex, the OpenAI coding agent, are available on Amazon Bedrock. Deploy frontier models on Bedrock's high performance inference engine with built-in security, governance, and pay-per-token pricing.
Introduction Inference demand is growing fast, and it’s only accelerating. By 2030, inference is expected to account for the majority of AI compute globally. But scaling inference isn’t just a hardware problem. Most teams discover too late that a significant portion of their compute spend is avoidable, primarily because their systems are silently repeating work they have already done, recomputing
The Problem: Inference Gets Hard at Scale If you’ve shipped an AI feature to production, you already know: the hard part isn’t making a model respond to a prompt. The hard part is making it respond more reliably, at scale, across multiple models, without burning through your budget. The moment real users show up, you’re dealing with GPU resource contention, traffic unpredictability (a single enter
We investigated why firmware updates were causing our core servers to take four hours to reboot. By diving into UEFI data structures and iPXE automation, we eliminated unnecessary timeouts and cut boot times back down to minutes.
Written by Shadi Altarsha, Transport team When I started my career in infrastructure engineering, I wasn't sure how my work connected to the people actually using the product. I imagined myself deep in low-level systems that nobody knew existed, the kind of work that only surfaces when something breaks at 3 AM. Four years later, I've come to understand what I believe is one of the most important r
In my last Week in Review post, I shared what I’d been hearing from customers in the AI-Driven Development Lifecycle (AI-DLC) workshops I’ve been delivering. Last week I was back at it, this time in Denver for a two-day AI-DLC workshop, where I helped facilitate 17 teams to deliver nearly 20 separate use cases in […]
Learn why traditional OAuth and API keys fail for autonomous systems and how to build a secure, least-privilege AI architecture using Fine-Grained Authorization.
Monitoring tells you when something's wrong. Observability tells you why. AI observability tells you if it's right. Here's how all three differ.
Learn why running more tests isn’t the answer to AI, and the three ways mature teams are shifting their experimentation programs.
Learn how Amplitude’s Global Support team uses AI Assistant to reduce support tickets, prevent user churn, and increase conversions.
Getting your hands on a capable AI model is the easy part now. Every team can reach the same frontier models through an API, so a strong model is not what sets a product apart. What separates a working product from a demo is everything around the model. You have to measure whether the agent is actually doing its job, then keep grinding on reliability until it stops making expensive mistakes in fro
A recap of WorkOS MCP Night: Agent Mode covering auth.md, lightning demos, and what building for agents as consumers really means. The post MCP Night: Agent Mode — Not Just Another Tech Event appeared first on Postman Blog.
Introduction: The journey of documentation at Grab In early 2021, Grab adopted a Docs-as-Code approach to address gaps in our technical documentation processes, as illustrated in our blog post Embracing a Docs-as-Code. Inspired by the practices of other market leaders, we integrated documentation into our engineers’ workflows, making it part of the codebase. This approach addressed our initial doc
Coding agents today have a massive spending problem. Every request, whether you’re designing system architecture or writing a single-line docstring, often gets routed to the same expensive frontier model. The result: unnecessary token usage, higher inference costs, and little awareness of task complexity or budget constraints. This high cost stems from a “one-size-fits-all” approach to model usage
AWS launches the next generation of AWS Resilience Hub with a significantly expanded experience that brings together a new application model, dependency discovery assessment, generative AI-powered failure mode analysis, modular resilience policies, and organization-wide reporting.
AWS rebuilt Amazon OpenSearch Serverless from the ground up for agentic AI and dynamic workloads. Get instant autoscaling and up to 60% cost savings.
The ESC collection lets you escape the confines of your desk and get out into the sun where good ideas are bound to happen. The post Still a developer. Just outside. Our latest GitHub Shop collection is here. appeared first on The GitHub Blog.
How Dropbox is moving from AI tools that assist engineers to agentic systems that can execute scoped tasks, and how we’re building platforms to support those workflows.
Development workflows span terminals, IDEs, background agents, and custom assistants. What matters is whether they draw from the same current source. Learn MCP Server gives any MCP-compatible agent direct access to current Microsoft documentation – one endpoint, nothing to install, no authentication required. What does that look like in practice? You give your coding agent […] The post Improve you
This was a packed week for the Postman platform. Five features shipped that touch almost every part of the API lifecycle —... The post What’s new in Postman: AsyncAPI 3.0, performance streaming, and service accounts appeared first on Postman Blog.
In early 2023, Slack faced a foundational challenge: serving Large Language Models (LLMs) at enterprise scale with the security, reliability, and performance our customers expect. Over three years, we evolved from basic infrastructure to orchestrating a sophisticated multi-cloud architecture. We didn’t just want shiny new models; we needed a system resilient to regional outages and…
Here’s how we built Town Lake, Cloudflare's unified analytics platform, alongside Skipper, an internal AI agent running on top of it.
In 2025, solo founders in the top decile generated 61 times the revenue of the median solo founder in their first six months. We analyzed the data to understand what drives that gap.
Today, we are announcing v1.0 of the Pulumi Service Provider: a major milestone in managing Pulumi Cloud with Pulumi itself. The provider is now generated directly from the Pulumi Cloud OpenAPI specification, unlocking a dramatically expanded pulumiservice:api/* resource surface and enabling Pulumi Cloud capabilities to become available in the provider faster than ever before. This release also br
You ship an SDK, a CLI, an API, and developers use it. Now AI coding agents use it too, except they use it differently than humans do. Most of the time you have no idea what’s actually happening between “developer types a prompt” and “agent generates code with your technology.” Is the agent reading your […] The post How AI coding agents actually use your technology appeared first on Microsoft for
At Deploy 2026, we introduced the DigitalOcean AI-Native Cloud, built for the inference era. Batch Inference on the DigitalOcean Inference Engine enables high-volume asynchronous workloads. As developers move from AI prototypes to production-scale applications, the challenges of cost and rate limits often become a bottleneck. Batch Inference addresses these hurdles by allowing you to process high-
Cloudflare Radar data confirms early indications of a partial Internet restoration in Iran, nearly three months after the shutdown began. Traffic spikes and DNS queries have risen, but network activity is currently just 40% of pre-shutdown levels.
We’re excited to welcome four outstanding community leaders as our newest AWS Heroes. These individuals embody the spirit of collaboration and knowledge sharing that makes the AWS community thrive. From building AI-powered tools that help fellow builders navigate AWS re:Invent, to leading some of the largest AWS communities in Latin America, to sharing deep cloud […]
Radar now blocks high-risk transactions across all supported payment methods; defends against new fraud types like multi-account abuse and pay-as-you-go abuse, regardless of which payment processor you use; and gives platforms new tools to evaluate and mitigate merchant risk on and off Stripe.
Unlike AI chatbots, an AI agent can take actions, like placing an order or booking a reservation. Learn more about what AI agents are and how they work.
Written by Will Johnson Hello, my name is Will Johnson, and I’m a web engineer on Reddit’s Design System team. My team is responsible for Reddit's Design System, RPL, its corresponding component libraries, and helping other teams develop front-end experiences that adhere to our design system principles on all of Reddit's platforms. Recently, I took on a project to resolve a persistent platform iss
We’re introducing SilverTorch, a reimagining of recommendation systems that unifies all retrieval components for user generated content under a unified architecture. SilverTorch shows up to 23.7x higher throughput compared to the state-of-the-art approaches. It’s also showing 20.9x more compute cost efficiency compared to a CPU-based solution while also improving accuracy. Our research paper, “Sil
Testing modern applications means validating both what users see in the browser AND what’s happening behind the scenes with your APIs. But... The post Postman Playwright Integration: Testing UI and API Together appeared first on Postman Blog.
Anthropic shipped a piece earlier this month called How Claude Code Works in Large Codebases. I have not read anything more useful about coding agents this year. The core claim, in their words: “the ecosystem built around the model—the harness—determines how Claude Code performs more than the model alone.” In my phrasing: in a real codebase, the model is the smaller variable. The layer of context
There’s something genuinely energizing about working with startups — something I’ve been doing intensely for more than two years now. Startups operate at a different frequency: the urgency is real, the constraints are tight, and the stakes are personal. Helping them navigate the challenge of proving their business model requires not just technical depth but […]
Discover how to use VS Code to interact with GitHub and maintain your projects. The post GitHub for Beginners: Getting started with Git and GitHub in VS Code appeared first on The GitHub Blog.
The phrase “AI infrastructure” now means two different things. One is the GPUs, schedulers, and MLOps platforms that exist to run AI workloads. The other is AI that runs infrastructure: agents and assistants that generate, deploy, and govern cloud resources on your behalf. They’re different markets with different vendors, and most teams need to think about both. The pressure to think about both is
Traffic doesn’t spike on a schedule. A product launch, a viral moment, or a flash sale can send request volume through the roof in seconds, long before your CPU metrics catch up. That gap is where performance suffers. Today, we’re excited to announce that request-based autoscaling on DigitalOcean App Platform is now generally available. Your apps can now automatically scale based on live HTTP traf
We are committed to empowering every developer by building an open, secure, and AI-powered platform that defines the future of software development. The post GitHub recognized as a Leader in the Gartner® Magic Quadrant™ for Enterprise AI Coding Agents for the third year in a row appeared first on The GitHub Blog.
Introduction Data drives every decision we make at Grab. As our operations scale, so does our need for robust, real-time data ingestion and processing frameworks. Enter Hugo: our self-service data platform that has long empowered teams to seamlessly route data into our Data Lake. Today, Hugo is evolving. We have taken previously siloed onboarding workflows and transformed them into one seamless, u
With the latest release of corecrypto, we’re publishing our implementations of quantum-secure ML-KEM and ML-DSA algorithms, along with the mathematical proofs we built to assure they are faithful to the FIPS 203 and FIPS 204 specifications. To advance the state of the art for assuring critical software, we're also publishing the formal verification libraries and tools that we created to achieve th
Infrastructure as code is the right model for production systems. State tracking, drift detection, and repeatable deployments all matter when you’re managing real workloads. But sometimes, you also need a quick, one-off interaction with the cloud: create a bucket or a database, look up a VPC, delete a stray resource. Today we’re introducing pulumi do, a new command for direct resource operations.
Postman has always been the platform where API development comes together — from firing off test requests to validating response contracts. With... The post Postman as a Local Development Environment appeared first on Postman Blog.
AI coding agents promise to make you more productive. On the surface they do, but in practice they fall short: agents generate code that doesn’t compile, use a deprecated SDK, or pick the wrong service entirely. Is it you using it wrong? Is it your tech stack? Or is it the tools you haven’t configured […] The post The AX stack: what’s fixed, where you can win appeared first on Microsoft for Develo
Cloudflare now integrates with the Claude Compliance API, so that security teams can monitor Claude Enterprise activity directly in the Cloudflare Dashboard.
Nova lets engineers run multiple coding sessions in parallel and lets internal systems use AI agents as part of automated workflows.
This week, Pulumi Neo started working in two more places: GitHub and Slack. The agent that already runs Pulumi tasks from the Cloud console and the terminal now participates in the threads where your team discusses changes. Mention @pulumi-neo in a pull request or issue and Neo replies in the thread. Mention @Neo in a Slack channel and Neo starts a task, continuing the conversation as you reply. N
Recurring platform work slips: provider versions fall behind, drift accumulates between checks, and the quarterly audit keeps getting pushed back another month. Pulumi Neo can now run any task on a cadence you set, opening a pull request for each run. Automations in action Your platform team runs stacks across staging and production, and the AWS, GCP, and Kubernetes providers keep shipping new ver
The previous post introduced Postman Local Mock Servers, the Git-backed pattern that lets you stand a mock up next to your collections... The post Swapping External Dependencies with Local Mocks Servers appeared first on Postman Blog.
“A bad system will beat a good person [or agent] every time” ~Dr. William Edwards Deming (with apologies) I started vibe coding by writing prompts (often dictated into my phone), refining them with an agent in M365 Copilot, and creating handoff files to use with GitHub Copilot CLI. The results were predictably non-deterministic. Prompt-driven development […] The post Agentic-Agile: Why Agent Devel
Cloudflare has integrated with Anthropic's Claude Managed Agents to provide a fast, isolated execution environment for autonomous code delivery. This means builders can scale agent workflows globally while strictly controlling access to private backends and easily customizing their agent’s tools and runtimes.
Just a year ago, we launched AWS Transform for .NET, Mainframe and VMware workloads, the first agentic AI service purpose-built for modernizing enterprise applications at scale. At re:Invent 2025, we introduced AWS Transform custom, which enables organizations to modernize and transform code at scale using AWS-managed and custom transformations. You can upgrade language versions, migrate […]
written by Nomi Khedawala, Technical Program Manager Intro I’m Nomi (Know-Me). I joined Reddit as a Technical Program Manager in October 2024. I came from a background in product operations and technical program management, and what drew me to Reddit was the pace and the scale of the problems (and being a longtime lurker). I’ve had the opportunity to work on so many different areas of the business
TL;DR LLM evals, automated judges that assess relevance, coherence, and quality at scale, are a powerful new... The post Better Experiments with LLM Evals — A funnel, not a fork appeared first on Spotify Engineering.
Introduction Long integrated development environment (IDE) sync/indexing times can quietly erode developer productivity, making code navigation sluggish, spiking memory usage, and slowing down Jetpack Compose preview updates, turning the IDE into a bottleneck rather than a helpful tool. For Android engineers working in a large monorepo, this was a daily reality. In this post, we will share how we