Loading articles…
RSS reader
Showing 203 of 203 articles in Last 30 days
Know an interesting engineering blog?
Feel free to contribute and add more sources to the aggregator.
We are delighted to welcome Daniel Aw in his new role as the general manager for Red Hat’s Asia Pacific region. In this role, Daniel will lead the next phase of the company’s growth in the region, and advance customer success with AI and open hybrid cloud technology. Daniel brings more than three decades of international experience driving customer satisfaction, sales, and revenue growth. He is kn
The hidden cost killing your AI apps roadmapAcross leading tech organizations building AI-native apps...
Musk’s lawsuit accuses Altman of fraud, while OpenAI says that Musk is ‘motivated by jealousy’ A trial between two of Silicon Valley’s biggest tycoons kicked off on Monday in California, the culmination of a years-long bitter feud. Elon Musk has accused Sam Altman of betraying the founding agreement of the non-profit they started together, OpenAI, by changing it to a for-profit enterprise. Jury se
Written by Spencer Koch (u/securimancer), Nathan Handler (u/nhandlerOfThings) and Pratik Lotia (u/wind_lectric) A visual metaphor of Dr. Cowsnoo using a security-recommended vehicle and decisively crushing a chaotic, dilapidated pile of legacy technology. When you are running dozens of AWS accounts, each with its own legacy OAuth proxy that you can barely track down on GitHub, along with bastions
Starting June 1, your Copilot usage will consume GitHub AI Credits. The post GitHub Copilot is moving to usage-based billing appeared first on The GitHub Blog.
Late March took me to Seattle for the Specialist Tech Conference, one of the most energizing gatherings of AWS specialists from around the world. It was an incredible opportunity to connect with peers, exchange experiences, and go deep on the latest advancements in Generative AI and Amazon Bedrock — and a powerful reminder of something […]
The invisible problem with agentic AIMost enterprises are experimenting with autonomous AI agents...
After Donald Trump’s second election, I realised the insidious hold my phone had over my life. So I turned to something I’d loved in childhood to better occupy my attention After a long day of looking at screens for work, I used to go to bed and stare at my phone until I fell asleep. When not doomscrolling news headlines, I’d crash out to hateful comments on social media or revisit workplace drama
Starting April 29th, the maximum retention policy for Hobby plans will be capped at 30 days. Deployments outside your retention window will be automatically removed. This excludes your 10 most recent production deployments and any aliased deployments, which continue to be preserved regardless of retention settings. Pro and Enterprise plans are not affected. Learn more about .Deployment Retention R
How close are we to the sci-fi vision of autonomous humanoid robots? I visited 11 companies in five Chinese cities to find out By Chang Che. Read by Vincent Lai Continue reading...
Learn how to bridge the "Identity Wall" in B2B SaaS. Carlos Mostek shares an architectural blueprint for Enterprise SSO and robust multi-tenant data isolation.
At Red Hat, our deep focus on security doesn't stop at the code, it extends to how we communicate vulnerability information to our partners and customers. Based on valuable feedback from our partner community, Red Hat Product Security is announcing a major evolution in our security data ecosystem—the complete overhaul of our Common Security Advisory Framework (CSAF) and Vulnerability Exploit eXcha
In Norse mythology, the god Heimdall protects the rainbow bridge Bifröst from invaders, a bridge that connects the realm of the gods to the realm of humans. This legend provided the name for Exercise HEIMDALL, a unique initiative held this past February in the High North of Norway. Organized by the NATO Center of Excellence - Cold Weather Operations, the exercise focused on innovating in extreme e
Enterprise AI has officially moved past the "can we build it?" phase. Business leaders aren't just asking how to train a model, they’re asking how to scale, protect, and operationalize these systems to drive return on investment (ROI) without losing control of their data.At Red Hat Summit 2026, you'll be able to dive deep into technical architecture, partner integrations, and other topics through
Read about Amplitude’s 2026 AI Week, when we stopped business as usual to uplevel AI skills and become more AI first across our entire company.
Today, we are excited to introduce the next generation of Genie. The new Genie can...
Tesla chief believes Altman broke company’s founding agreement – and legal battle promises to be explosive The bitter rivalry between two of the tech world’s most powerful men arrives in court this week, as Elon Musk’s lawsuit against Sam Altman and OpenAI heads to trial in Oakland, California. The case is set to feature some of the biggest names in Silicon Valley, and its outcome could affect the
Discrepancy in forecasts raises questions over government planning for net zero One vision of the UK’s future involves a decarbonised economy powered by clean, renewable energy. Another involves making the UK an AI superpower. The government departments responsible for these two visions do not appear to have agreed on their numbers. Continue reading...
While emerging technology is banned from the Palme d’Or, an upstart movement is gaining investment and attention In Cannes’ darkened screening rooms, the supposed future of cinema flickered into life this week and it was strange. The first edition of the World AI film festival (WAIFF) showcased visions of men with fish scales erupting from their necks and seaweed from their mouths, a heroine with
Secondhand car buyers urged to carefully inspect vehicles, while owners told to beware tests that are suspiciously quick Rise of the ‘ghost owner’: 18,000 UK vehicles in use without proper records You have just bought a secondhand car. It was older than you wanted, but were reassured because it had recently passed its MOT. Within a few days, you notice a problem with the steering and take it into
Met says AI software unearthed rule-breaking ranging from work-from-home violations to suspected corruption The Metropolitan police have launched investigations into hundreds of officers after using an AI tool built by the controversial tech company Palantir to root out rogue cops. The software was deployed by the Met over the course of a week, surveilling staff members using data the force has re
As AI erases the bottom rungs of the corporate ladder, some gen Z workers skip the entry level to become their own CEOs When Ashley Terrell graduated from the University of Hawaii in 2024, she planned to find a job in marketing, maybe for a tech company. She had a bachelor’s degree in business administration and a college résumé that included a student marketing job for Red Bull. But after months
A crypto tycoon is giving record-breaking amounts to Farage’s party. But little is known about his motives Shortly before Christmas 2022, Chakrit Sakunkrit, owner of the Kamalaya Wellness Sanctuary on the Thai island of Koh Samui, invited 200 guests to spend a few days celebrating his 60th birthday. One sultry afternoon, Sakunkrit and a small group gathered around a table near the shore, surrounde
Getting a model to answer 10 inference requests concurrently is tricky but simple enough; getting it to handle 2,000 engineers hitting a coding assistant with long contexts, all day, without runaway costs, is where teams stall. A working endpoint is only the beginning. Teams need to identify the supporting hardware and wire up the right components—serving, scaling, observability, and cost guardrai
What Changed in the April 2026 MRM GuidanceOn April 17, 2026, the Federal Reserve,...
GPT-5.5 is OpenAI's strongest frontier model for agentic enterprise work, complex...
Revised figures increase fears about energy-intensive datacentres worsening climate emergency The UK government vastly underestimated the climate impact of artificial intelligence, it has emerged, after officials raised their estimate of carbon emissions from AI by a factor of more than 100. According to new data quietly published this week, energy use by AI datacentres in the UK could cause the e
Operational databases — also called online transaction processing (OLTP) databases...
GPT-5.5 is now available on .Vercel AI Gateway There are 2 variants: GPT-5.5 and GPT-5.5 Pro. Both models are tuned for long-running agentic work across coding, computer use, knowledge work, and scientific research, and are more token-efficient than the previous generation. GPT-5.5 is stronger at agentic coding and long-horizon work where the model needs to hold context across a large system and c
Let’s address the most common pitfalls and misconceptions developers encounter when implementing the Backend for Frontend (BFF) pattern
Neo already helps your team manage Pulumi infrastructure, but no infrastructure team works inside Pulumi alone. Pages come from PagerDuty, telemetry from Datadog or Honeycomb, follow-ups from Linear or Jira. Most of the job is shuttling context between those tools. Today we’re launching the Integration Catalog for Pulumi Neo: one place to connect Neo to the tools your team already uses, so your ag
Running Llama 70B as an on-demand cloud inference endpoint costs roughly $16,000 per month. Running Llama 8B costs about $734. For teams where an 8B model meets the quality bar for their workload, that gap is very hard to ignore.The question enterprise teams are asking is rarely, "how do we get the most powerful model?" It is almost always, "how do we get a model that's fast enough, accurate enoug
5 reasons to go with your team to Red Hat Summit 2026Red Hat Summit is where the global community comes together to solve the industry's biggest challenges, and there is no better way to navigate that future than with your team by your side. Register today to join us in Atlanta, May 11-14. Learn more Red Hat Further Drives Digital Sovereignty for the AI Era with Red Hat OpenShift on Google Cloud D
Disruption in the virtualization market has not slowed down. The fallout from industry licensing and packaging changes continues to push organizations into decisions they were not planning to make this year, and for many, the timelines are getting shorter, not longer. Over the past 12 months, we have worked with hundreds of organizations navigating exactly this situation, and at Red Hat Summit 202
Looking at the release notes or changelogs for QEMU upstream, you might notice that there's something new in version 11.0:SEV-SNP and TDX machines can now be reset.This is a feature we at Red Hat helped implement. The motivations and associated challenges have been explained in detail in a FOSDEM 2026 presentation. Before this feature was available, some confidential guests (AMD SEV-based guests)
Extending confidential computing from individual workloads to the entire cluster is a new frontier in cloud-native security.Today, Red Hat is announcing the Developer Preview of confidential clusters for Red Hat OpenShift, a new feature of OpenShift that extends confidential computing to the cluster infrastructure level. Confidential clusters establish hardware-rooted trust across every node in an
Amplitude AI Assistant is a chatbot builder that uses product data to personalize support and capture rich customer feedback.
Databricks is excited to partner with OpenAI on GPT-5.5, their latest frontier model....
In large-scale cloud environments, unpredictable hypervisor crashes carry real operational cost. While traditional reactive monitoring that relies on static thresholds and post-hoc alerts were once the industry standard, this monitoring misses the non-linear, stochastic signals that precede hardware failure. In an era where high availability is the norm, the transition from reactive observation to
Want to build AI agents with JavaScript that go beyond basic chat completions? Agents that reason, call tools, and pull from knowledge bases on their own? We put together a free, open source course to help you get there. LangChain.js for Beginners is 8 chapters and 70+ runnable TypeScript examples. Clone the repo, add your […] The post LangChain.js for Beginners: A Free Course to Build Agentic AI
Ten years ago, I came back to Reddit. Twenty years and change ago (October 2005, if we're being exact), I got a call from u/spez while I was grabbing coffee with a labmate. Paul Graham had suggested he hire me as Reddit's first engineer. I said yes before I hung up the phone. This week, I’ve decided to step down as CTO and take on a new role as Reddit's first Senior Technical Fellow. The last deca
We first introduced Lakeflow Designer at Data and AI Summit last year. Since then,...
Our journey to truly understand our customer experience began with a hard look at our internal availability numbers at the start of 2025. We saw something uncomfortable: the numbers didn’t match our customers’ reality. Our monthly availability oscillated between 99.5% and 99.9%. Those peaks and valleys depended more on whether we declared a high-severity incident that month than on how the platfor
DeepSeek V4 is now available on .Vercel AI Gateway There are 2 model variants: DeepSeek V4 Pro and DeepSeek V4 Flash. A 1M token context window is the default across both models. DeepSeek V4 Pro focuses on agentic coding, formal mathematical reasoning, and long-horizon workflows. It handles feature development, bug fixing, and refactoring across stacks, with tool use that works across harnesses li
How AI Agents Can Implement Auth0 Quickly and Efficiently.
Learn how to mitigate the top five AI agent security risks, including over-privileged tools and memory poisoning, using OWASP 2026 standards and OpenFGA.
Policy authors who need external credentials or environment-specific configuration have had to hardcode values or manage them outside of Pulumi. Policy packs can now reference Pulumi ESC environments, bringing centralized secrets and configuration management to your policies. The problem Pulumi policy packs let you enforce rules across your infrastructure, but some policies need more than just the
Organizations are facing difficult decisions on choices of how to accelerate cloud-native innovation without sacrificing the stability of existing business critical applications and systems. Red Hat OpenShift Virtualization running on Red Hat OpenShift Dedicated provides the answer, offering a unified foundation that runs both containerized applications and virtual machines (VMs) on the same infra
Learn how to use TCP, UDP, and ICMP protocols in network path testing to pinpoint and diagnose application performance issues faster.
Learn about the latency between different product signals when running experiments, so you can prioritize fixes immediately and drive growth.
See how Tira used Amplitude AI agents and MCP to cut analysis cycles from a week to a day and get leaders acting on data faster.
IntroductionIn the Databricks intelligence platform, we regularly explore and use...
How we used Honk, Backstage, and Fleet Management to ease the pain of migrating thousands of datasets. The post Background Coding Agents: Supercharging Downstream Consumer Dataset Migrations (Honk, Part 4) appeared first on Spotify Engineering.
There is a question circulating in boardrooms and data leadership meetings right now that goes something like this...
The Model Context Protocol (MCP) is quickly becoming a common way for AI agents to discover and use tools. It provides a consistent interface to databases, APIs, file systems, and third-party services, which makes it easier to plug capabilities into agent workflows. However, MCP standardizes the execution surface without defining how that surface should be […] The post Securing MCP: A Control Plan
Written by Roman Levitas and Tim Zhu. TL;DR The operational pitfalls of Kubernetes sidecars are well-documented: resource limits that quietly throttle your app, scaling constraints that force wasteful over-provisioning, and cascading failures that are maddeningly hard to diagnose. At Reddit, we ran headfirst into all of them—with our experimentation infrastructure, of all things. Reddit's Decider
We know how to scale traditional web services: throw a load balancer in front of stateless microservices and horizontally scale your CPU instances as traffic grows. Large Language Models break this playbook because LLM inference is fundamentally stateful, bottlenecked by memory bandwidth rather than raw compute, and bound to physical hardware interconnects. Scaling LLM inference isn’t just a matte
The subtle inventiveness that reduced cold start setup from seconds to 200μs.
Panics in Rust Workers were historically fatal, poisoning the entire instance. By collaborating upstream on the wasm‑bindgen project, Rust Workers now support resilient critical error recovery, including panic unwinding using WebAssembly Exception Handling.
New Relic and AWS have surpassed $1 billion in AWS Marketplace transactions. Here's the story behind the milestone, and where the partnership is headed.
This comprehensive guide outlines the implementation of a secure framework for authentication and authorization in Gemini Enterprise Agent Platform Runtime.
Somewhere in your company right now, a developer is building an AI agent. Maybe it’s a release agent that cuts tags when tests pass. Maybe it’s a cost agent that shuts down idle EC2 overnight. It’s running, it’s in production, and there’s a decent chance the platform team doesn’t know it exists. This isn’t a thought experiment. OutSystems just surveyed 1,900 IT leaders and the numbers are rough: 9
This blog post will show you how to filter out landline numbers before sending SMS notifications in Rust using Twilio's Lookup v2 API.
Learn how millions of minutes of AI calls helped us cure the awkward digital pause, and how we are redefining agent infrastructure at SIGNAL 2026.
Read how security engineers and analysts can focus on what actually requires human judgment in cloud security investigations when AI handles the time-intensive steps.
Collect structured developer feedback in Datadog and analyze responses alongside operational data by using Datadog Forms and Sheets.
See how Datadog helps Google Cloud teams evaluate AI agents, optimize GPU and TPU infrastructure, and strengthen security.
Outgoing CEO took stood up for users in battle with FBI but concessions abroad undermine claims of protecting ‘fundamental right’ In his 15 years as Apple’s top executive, Tim Cook has projected an image of the company as a champion of privacy rights. As he prepares to leave that role in September, that legacy has come back into focus. Cook trumpeted the iPhone maker’s commitment to privacy at hom
We have moved past the point where a 70GB model was considered “heavy.” With the rise of models like DeepSeek-V3, the GLM series, and other massive Mixture-of-Experts (MoE) architectures, the industry is now grappling with weights exceeding 700GB in optimized formats—and well over 1.2TB in full precision. And parameters keep climbing—Epoch’s AI data tracks frontier models now reaching into the tri
We’ve fundamentally transformed Facebook Groups Search to help people more reliably discover, sort through, and validate community content that’s most relevant to them. We’ve adopted a new hybrid retrieval architecture and implemented automated model-based evaluation to address the major friction points people experience when searching community content. Under this new framework, we’ve made tangib
As AI assistants and privacy proxies challenge the capabilities of traditional bot detection, the Web needs new models for accountability. We believe that control should remain with the client, and that an open ecosystem of anonymous credentials is key to preserving user privacy while protecting origins from abuse.
GPT Image 2 is now available on .Vercel AI Gateway OpenAI's newest image model supports detailed instruction following, accurate placement and relationships between objects, and rendering of dense text across multiple aspect ratios. The model can render fine-grained elements including small text, iconography, UI elements, dense compositions, and subtle stylistic constraints, at up to 2K resolution
Learn how CheckMate for Auth0 has evolved over six months to provide automated, open-source security audits for your CI/CD pipeline.
Empower customers with self-service admin using Auth0's My Organization API. This guide shows how to build a Next.js dashboard in your SaaS app so they can manage their own settings, reducing your overhead.
tips and tricks for learning how to count segments using Twilio's automated SMS in ways that will save you budget and time
Connect Azure DevOps to Datadog to analyze code health, accelerate troubleshooting, and enforce quality standards across your software delivery life cycle.
Learn how Datadog’s UK availability zone on AWS enables organizations to host observability data in the UK while maintaining end-to-end visibility across their environments.
Learn more about Amplitude AI Assistant. Our in-product support agent knows your users, acts on their behalf, and measures whether it actually helped.
We're making these changes to ensure a reliable and predictable experience for existing customers. The post Changes to GitHub Copilot Individual plans appeared first on The GitHub Blog.
The open source Git project just released Git 2.54. Here is GitHub’s look at some of the most interesting features and changes introduced since last time. The post Highlights from Git 2.54 appeared first on The GitHub Blog.
Claude Opus 4.7 arrives in Amazon Bedrock with improved agentic coding and a 1M token context window. AWS Interconnect reaches general availability with multicloud private connectivity and a new last-mile option. Plus, post-quantum TLS for Secrets Manager, new C8in/C8ib EC2 instances, and more.
Agents Week 2026 is a wrap. Let’s take a look at everything we announced, from compute and security to the agent toolbox, platform tools, and the emerging agentic web. Everything we shipped for the agentic cloud.
We built our internal AI engineering stack on the same products we ship. That means 20 million requests routed through AI Gateway, 241 billion tokens processed, and inference running on Workers AI, serving more than 3,683 internal users. Here's how we did it.
Learn about how we built a CI-native AI code reviewer using OpenCode that helps our engineers ship better, safer code.
Kimi K2.6 from Moonshot AI is now available on .Vercel AI Gateway The model focuses on long-horizon coding tasks, with generalization across languages such as Rust, Go, and Python and across front-end, devops, and performance optimization work. K2.6 can turn simple prompts into complete front-end interfaces with structured layouts. For autonomous, proactive agents that run continuously across mult
The Pulumi Cloud REST API reference is now generated directly from the live OpenAPI spec at build time. Every endpoint, parameter, request body, and response schema you see on the page comes from the same spec the API itself publishes. The docs now stay in sync with the API automatically! Why this matters The previous REST API reference was a set of handwritten pages. That meant every new endpoint
Pulumi Cloud now supports Bitbucket Cloud as a first-class VCS integration, joining GitHub, GitLab, and Azure DevOps. Connect your Bitbucket workspace to deploy infrastructure on every push, preview changes on pull requests, spin up ephemeral review stacks, and get AI-powered change summaries — all without an external CI/CD pipeline. Deploy infrastructure from Bitbucket Connect a Bitbucket reposit
Order confirmed. Now what? These 17 post-purchase email types turn one-time buyers into loyal customers—with timing tips and real examples.
Reverse ETL vs. The Private Cloud: A Conceptual Survival Guide
Learn to connect Google Gemini with Twilio Voice using ConversationRelay and Python's FastAPI. Follow our guide for real-time AI conversations and interactive voice apps.
France's CNIL now requires explicit consent for email open tracking pixels. Learn what's changing, the deliverability exception, and what to do next.
As AI moves from experimental chat interfaces to production-grade agents, the need for a foundational memory layer to transform these AI-powered tasks into stateful models is apparent. The absence of a robust memory layer causes agents to lose vital statefulness, leading to: Inability to maintain long-term recall. Without persistent memory to track context across sessions, an agent might recognize
See how we created an emoji list generator during the Rubber Duck Thursday stream. The post Building an emoji list generator with the GitHub Copilot CLI appeared first on The GitHub Blog.
Changes to the status page will provide more specific data, so you'll have better insight into the overall health of the platform. The post Bringing more transparency to GitHub’s status page appeared first on The GitHub Blog.
The Agent Readiness score can help site owners understand how well their websites support AI agents. Here we explore new standards, share Radar data, and detail how we made Cloudflare’s docs the most agent-friendly on the web.
Today, we’re excited to give you a sneak peek of our support for shared compression dictionaries, show you how it improves page load times, and reveal when you’ll be able to try the beta yourself.
Soft directives don’t stop crawlers from ingesting deprecated content. Redirects for AI Training allows anybody on Cloudflare to redirect verified crawlers to canonical pages with one toggle and no origin changes.
Running LLMs across Cloudflare’s network requires us to be smarter and more efficient about GPU memory bandwidth. That’s why we developed Unweight, a lossless inference-time compression system that achieves up to a 22% model footprint reduction, so that we can deliver faster and cheaper inference than ever before.
Cloudflare Agent Memory is a managed service that gives AI agents persistent memory, allowing them to recall what matters, forget what doesn't, and get smarter over time.
By migrating our request handling layer to a Rust-based architecture called FL2, Cloudflare has increased its performance lead to 60% of the world’s top networks. We use real-user measurements and connection trimeans to ensure our data reflects the actual experience of people on the Internet.
We are launching Flagship, a native feature flag service built on Cloudflare’s global network to eliminate the latency of third-party providers. By using KV and Durable Objects, Flagship allows for sub-millisecond flag evaluation.
Retention policies no longer delete the latest preview deployment for branches with open or unmerged pull requests. Previously, deployments for active branches could be removed if they exceeded the configured retention window. This means you can safely use shorter retention windows without risking losing active preview deployments. This change applies to all plans. Your 10 most recent production d
Zo Computer on Vercel Death by a thousand adapters AI SDK + AI Gateway: two layers, one integration 20x improvement in reliability Scaling to a million personal cloud owners 20x reduction in retry rate (7.5% → 0.34%) 99.93% chat success rate (up from 98%) P99 latency cut 38% (131s → 81s) New models added in less than 1 minute Average latency improved 25.7% P95: 46s → 34s (25% improvement) P99: 131
Stop looking for an AI design playbook. Learn how Auth0 built a capability for AI-native design that reduced cycle times by 50%.
Read about why A/B testing makes sense for a wide variety of engineering purposes—not just growth and product.
Datadog Governance Console centralizes usage insights and automates policy enforcement to reduce risk, control costs, and improve observability at scale.
We’re sharing insights into Meta’s Capacity Efficiency Program, where we’ve built an AI agent platform that helps automate finding and fixing performance issues throughout our infrastructure. By leveraging encoded domain expertise across a unified, standardized tool interface these agents help save power and free up engineers’ time away from addressing performance issues to innovating on [...] Rea
Learn how Github uses eBPF to detect and prevent circular dependencies in its deployment tooling. The post How GitHub uses eBPF to improve deployment safety appeared first on The GitHub Blog.
We’re sharing lessons learned from Meta’s post-quantum cryptography (PQC) migration to help other organizations strengthen their resilience as industry transitions to post-quantum cryptography standards. We’re proposing the idea of PQC Migration Levels to help teams within organizations manage the complexity of PQC migration for their various use cases. By outlining Meta’s approach to this work [.
AWS launches Claude Opus 4.7 in Amazon Bedrock, Anthropic's most intelligent Opus model for advancing performance across coding, long-running agents, and professional work. Claude Opus 4.7 is powered by Amazon Bedrock's next generation inference engine, purpose-built for generative AI inferencing and fine-tuning workloads.
he Agentic Shift demands new observability. IDC shares 4 key trends, including the Agent Economy and AI-Ready Infrastructure, making agentic tracing and AI observability crucial.
is now generally available. Vercel Flags Vercel Flags is a feature flag provider built into the Vercel platform. Create and manage feature flags with targeting rules, user segments, and environment controls directly in the Vercel Dashboard. The provides a framework-native way to define and use these flags within Next.js and SvelteKit applications, integrating directly with your existing codebase:F
Claude Opus 4.7 from Anthropic is now available on .Vercel AI Gateway Opus 4.7 is optimized for long-running, asynchronous agents and handles complex, multi-step tasks with reliable agentic execution. The model shows gains on knowledge-worker tasks, particularly where it needs to visually verify its own outputs. Opus 4.7 is also stronger at programmatic tool-calling with image-processing libraries
The gap between prototypes and production-ready systems is huge. Code that's trivial to run locally falls apart the moment it needs to handle failures, restarts, and real traffic. Framework defined infrastructure solved this for web applications. When you deploy, Vercel infers the right configuration from the app itself. Workflows extends that model to long-running systems. Instead of managing a s
Build your own future with Twilio Flex: a composable, open platform for seamless employee conversations embedded directly into your workflows.
Use Datadog IaC Security to catch GitHub Actions misconfigurations in the diff, before they reach production.
Learn how Datadog Observability Pipelines helps teams transform and normalize logs and metrics from OpenTelemetry.
Load balancing for LLMs is fundamentally different from load balancing for traditional services like web servers, APIs, or databases. Prompt caching is the reason. Prompt caching typically cuts input token costs by 50-90% and can reduce Time to First Token (TTFT) latency by up to 80%, but those gains assume your request lands on the replica that already has the relevant prefix cached. Under naive
Learn about the productivity tool one GitHub engineer built, and how AI supported the development process. The post Build a personal organization command center with GitHub Copilot CLI appeared first on The GitHub Blog.
We’re sharing recent policy updates that developers should know about, updating our Transparency Center with the full year of 2025 data, and looking to what’s ahead. The post Developer policy update: Intermediary liability, copyright, and transparency appeared first on The GitHub Blog.
You can now access Bytedance's latest state-of-the-art video generation model, Seedance 2.0, via with no other provider accounts required.AI Gateway Seedance 2.0 is available on AI Gateway in two variants: Standard and Fast. Both share the same capabilities. Standard produces the highest quality output, while Fast prioritizes generation speed and lower cost. Seedance 2.0 is strong at maintaining m
Vercel is reducing the price of Turbo build machines by 16%. All builds are now priced at $0.0035 per CPU per minute. With this new model: This change will begin rolling out on April 27, and will appear on invoices for the current billing cycle as "Build CPU Minutes". Learn more about or monitor your Builds usage from .build machine pricingProject Usage Read more Turbo machines, with 30 CPUs, are
Learn why identity is the core of AI architecture and how to manage non-human identities to build secure, scalable, and trusted AI agents.
Discover how Cisco Systems used Amplitude and Autocapture to accelerate adoption by 20% and build a lasting culture of data-driven innovation.
We analyzed 27K sessions with Amplitude's Global Agent using our Agent Analytics tool. Here's what we found out about how real users are prompting our agent.
Today, we’re announcing the general availability of AWS Interconnect – multicloud, a managed private connectivity service that connects your Amazon Virtual Private Cloud (Amazon VPC) directly to VPCs on other cloud providers. We’re also introducing AWS Interconnect – last mile, a new capability that simplifies how you establish high-speed, private connections to AWS from your […]
Learn to find and exploit real-world agentic AI vulnerabilities through five progressive challenges in this free, open source game that over 10,000 developers have already used to sharpen their security skills. The post Hack the AI agent: Build agentic AI security skills with the GitHub Secure Code Game appeared first on The GitHub Blog.
This guest post comes from IDC’s Dr. William Lee, Senior Research Director, Service Provider and Core Infrastructure Research. MongoDB commissioned IDC to explore the connection between legacy infrastructure, data challenges, and AI across Asia Pacific, and today we’re happy to share that work. For more, see the full MongoDB-sponsored IDC InfoBrief, Modernizing Legacy: Winning in the Age of AI, Do
PostgreSQL is a powerful and hugely popular database engine, and it really comes alive across Microsoft developer platforms. You can build with PostgreSQL across Azure offerings, develop productively in Visual Studio Code with strong extensions and tooling, and connect your data to agentic development workflows and AI services. There’s amazing opportunity to bring those pieces […] The post Take yo
Learn how to monitor Auth0 usage metrics, track active users, and manage M2M token limits to avoid unexpected billing upgrades and plan spikes.
The new Code Security Risk Assessment gives you a one-click view of vulnerabilities across your organization, at no cost. The post How exposed is your code? Find out in minutes—for free appeared first on The GitHub Blog.
Pulumi Insights account scanning now supports every AWS partition. If your workloads run in GovCloud, China, the European Sovereign Cloud, or one of the ISO intelligence-community clouds, you can get the same resource discovery, cross-account search, and AI-assisted insights that commercial accounts already have. Supported partitions AWS Standard (Commercial) AWS GovCloud (US) AWS ISO (US) AWS ISO
See five reasons why SIGNAL 2026 in San Francisco is the place to be for developers, marketings, and more on May 6 & 7. Build Wonder with Twilio.
Explore the next generation of Linked Audiences in Twilio Segment. From Entity-Based Targeting to Aggregated Conditions, learn how to bridge the gap between your data warehouse and marketing tools with less manual work and deeper observability.
Read Spenser and Gab's conversation about what it means to build a digital product in 2026, and Gab was the perfect fit for our new CPO.
Written by Chris Slowe and Lisa O'Keefe Happy 5th birthday, r/RedditEng! https://preview.redd.it/u7rl3yefxzug1.png?width=306&format=png&auto=webp&s=bd9d11990632b65df5b3a512a93695563e5bee34 Huge thanks to the Reddit engineering team for building, scaling, and sharing so much great work here over the years. This subreddit has become a rare corner of the internet where people can go deep on real syst
Excerpt In complex, long-running agentic systems, maintaining alignment and coherent reasoning between agents requires careful design. In this second article of our series, we explore these challenges and the mechanisms we built to keep teams of agents working productively over long time spans. We present a range of complementary techniques that balance the conflicting requirements…
At DigitalOcean, documentation has always been a priority. Developers come to our docs to get unstuck, and the faster they find what they need, the better. Traditional docs pages work, but they require users to know which page to visit, scan for the relevant section, and map generic instructions to their specific setup. That process takes minutes (or longer) when it could take seconds. So we built
In my last Week in Review post, I mentioned how much time I’ve been spending on AI-Driven Development Lifecycle (AI-DLC) workshops with customers this year. A common theme in those sessions is the need for better cost visibility. Teams are moving fast with AI, but as they go from experimenting to full production, finance and […]
Three community frameworks have emerged that fix the specific ways AI coding agents break down on real projects. Superpowers enforces test-driven development. GSD prevents context rot. GSTACK adds role-based governance. All three started with Claude Code but now work across Cursor, Codex, Windsurf, Gemini CLI, and more. Pulumi uses general-purpose programming languages to define infrastructure. Ty
Test
Learn how AI agents for marketing can help you prioritize impact so you can do important work, instead of just more work.
At Meta, WebRTC powers real-time audio and video across various platforms. But forking a large open-source project like WebRTC within our monorepo presents unique challenges – over time, an internal fork can drift behind upstream, cutting itself off from community upgrades. We’re sharing how we escaped this “forking trap” – from building a dual-stack architecture [...] Read More... The post Escapi
As AI increases developer speed and productivity it also increases the need for safeguards. On this episode of the Meta Tech Podcast, Pascal Hartig sits down with Ishwari and Joe from Meta’s Configurations team to discuss how Meta makes config rollouts safe at scale. Listen in to learn about canarying and progressive rollouts, the health checks [...] Read More... The post Trust But Canary: Configu
Last year we added support for Bun as a package manager for Pulumi TypeScript projects. Today we’re taking the next step: Bun is now a fully supported runtime for Pulumi programs. Set runtime: bun in your Pulumi.yaml and Bun will execute your entire Pulumi program, with no Node.js required. Since Bun’s 1.0 release, this has been one of our most requested features. Why Bun? Bun is a JavaScript runt
Amazon S3 Files makes S3 buckets accessible as high-performance file systems on AWS compute resources, eliminating the tradeoff between object storage benefits and interactive file capabilities while enabling seamless data sharing with ~1ms latencies.
Introduction Prompt caching is the process of reusing already computed KV states across inference requests in order to save money and reduce latency. Within a single replica, modern inference engines like vLLM, SGLang, and TensorRT-LLM handle it automatically. Incoming prompts are matched against cached prefixes and recomputed only where necessary, without requiring user configurations The problem
You can often predict a load spike before it arrives. Maybe it happens at the same time every day, or there’s always a spike at midnight on a Friday when you run a certain batch job. Or maybe it’s not cyclical, but load is rising steadily, and it’s a reasonable guess that it will keep rising for a while. MongoDB Atlas’s reactive auto-scaler handles these spikes, but scaling to the right size takes
Andy Warfield writes about the hard-won lessons dealing with data friction that lead to S3 Files
We analyzed checkout activity across more than 20K businesses, surveyed shoppers and ecommerce leaders, and gathered insights from businesses on the Stripe network to understand what’s changing in online conversion.
AI agents are transforming the way we build — and even how we think of ourselves as software developers. Both... The post Let’s Talk Agentic Development: Spotify x Anthropic Live appeared first on Spotify Engineering.
Written by Nazareno Lorenzo As an engineer you learn a lot from building, but I believe you learn exponentially more from breaking things. We have a saying in the country where I grew up: Those who burnt themselves with milk, see a cow and cry. If you can expand this and learn not just from your own mistakes, but from the mistakes of your entire company, you multiply your learning opportunities an
Last week, I visited AWS Hong Kong User Group with my team. Hong Kong has a small but strong community, and their energy and passion are high. They recently started a new AI user group, and we hope more people will join. I was able to strengthen my bond with the community through great food […]
AI coding assistants are powerful but only as good as their understanding of your codebase. When we pointed AI agents at one of Meta’s large-scale data processing pipelines – spanning four repositories, three languages, and over 4,100 files – we quickly found that they weren’t making useful edits quickly enough. We fixed this by building [...] Read More... The post How Meta Used AI to Map Tribal K
Learn cloud monitoring best practices to reduce blind spots, improve reliability, and resolve issues faster in complex environments with New Relic.
Learn how to choose network monitoring tools that deliver context, clarity, and faster incident response. See how New Relic connects signals.
Explore proven network monitoring best practices to reduce noise, improve visibility, and speed MTTR in distributed systems with New Relic.
Microsoft Entra ID (formerly Azure Active Directory) is Azure’s identity and access management service. Any time your application needs to authenticate with Entra ID, you create an app registration and give it a client secret that proves its identity. But those secrets expire, and if you don’t rotate them in time, your app loses access. If you or your team manages Azure app registrations, you know
Learn how Temporal used Amplitude to unify disjointed data, challenge assumptions, optimize self-signup, and turn user insights into product-led growth.
Learn about key assumptions made when using sample size calculators and how to account for them for more trustworthy tests.
Organizational safeguards are now generally available in Amazon Bedrock Guardrails, enabling centralized enforcement and management of safety controls across multiple AWS accounts within an AWS Organization.
The cloud AI platform ecosystem today looks more powerful than ever, with access to powerful GPUs like NVIDIA H100 and H200, massive libraries of pre-trained models, and full pipelines for fine-tuning and inference. I recently tried deploying a simple inference endpoint for a model. Ideally, it should have taken a few minutes: provision compute load the model send a request Instead, it took clos
You can now run policy packs against your existing stack state without running your Pulumi program or making provider calls. The new pulumi policy analyze command evaluates your current infrastructure against local policy packs directly, turning policy validation into a fast, repeatable check. Why this command matters Policy authoring and policy updates usually involve an iteration loop: Make a po
Learn how DeFacto partnered with Papersource and implemented Amplitude to centralize product analytics, scale experimentation 4x, and drive measurable revenue growth through data-driven digital experiences.
AI is now central to modern software development. Teams across industries are turning to AI to solve product and workflow problems in software. But building production systems is still complex. The hardest part of deploying AI isn’t the model, it’s everything around it. That complexity becomes a glue-code problem when storage, compute, orchestration, networking, authentication, and inference live
This is the second post in the Ranking Engineer Agent blog series exploring the autonomous AI capabilities accelerating Meta’s Ads Ranking innovation. The previous post introduced Ranking Engineer Agent’s ML exploration capability, which autonomously designs, executes, and analyzes ranking model experiments. This post covers how to optimize the low-level infrastructure that makes those models run
By turning compaction into a layered, adaptive pipeline and strengthening our monitoring and controls, we made Magic Pocket more resilient to workload changes.
At DigitalOcean, we have been vocal about our strategic shift: we are building the world’s premier Agentic Inference Cloud. Our mission is to provide the foundation where AI-native enterprises build and run production inference at scale. Today, I am thrilled to announce a significant step in that journey: we have acquired Katanemo Labs, Inc., a leader in agentic AI infrastructure. By integrating K
Observability is no longer an infrastructure tax, it is the strategic command center for AI success. Use New Relic AI Monitoring to bridge the gap between AI potential and business ROI.
Application performance monitoring (APM) allows users to identify and track app performance using real-time data. Learn more about APM solutions with New Relic.
Observability allows you to analyze the internal states of a system, giving you much needed insights. Learn about observability tools, best practices and more.
Retailers know search and discovery have already shifted. What comes next is less settled. From embedded checkout to emerging third-party surfaces, here’s how ecommerce and AI leaders are integrating agentic commerce.
Amazon ECS Managed Daemons gives platform engineers independent control over monitoring, logging, and tracing agents without application team coordination, ensuring consistent daemon deployment and comprehensive host-level observability at scale.
New Relic has been awarded a perfect score of 100 for its policies and practices supporting workplace equality for LGBTQ+ employees
Amsterdam in late March still has that sharp North Sea wind, but inside the RAI Convention Centre, 13,350 people generated enough energy to heat the building twice over. KubeCon + CloudNativeCon EU 2026 was the biggest European edition yet, and the shift from previous years was impossible to miss. AI dominated the conference. I spent most of the conference at the Pulumi booth, and that turned out
Infrastructure work ranges from simple updates to complex multi-stack operations. For straightforward tasks, jumping straight to execution is often fine. But complex tasks benefit from deliberate upfront thinking: understanding what exists, identifying dependencies, and agreeing on an approach before anything changes. Today we’re launching Plan Mode, a dedicated experience for collaborating with N
AWS announces the Sustainability console, a new standalone service that consolidates carbon emissions reporting and resources, giving sustainability teams independent access to Scope 1, 2, and 3 emissions data without requiring billing permissions.
The Problem: Legacy Tooling and Its Limitations Currently, Slack utilizes a hybrid approach to network measurement, incorporating both internal (such as traffic between AWS Availability Zones) and external (monitoring traffic from the public internet into Slack’s infrastructure) solutions. These tools comprise a combination of commercial SaaS offerings and custom-built network testing solutions de
Meta continues to lead the industry in utilizing groundbreaking AI Recommendation Systems (RecSys) to deliver better experiences for people, and better results for advertisers. To reach the next frontier of performance, we are scaling Meta’s Ads Recommender runtime models to LLM-scale & complexity to further a deeper understanding of people’s interests and intent. This increase [...] Read More...
Software engineering is evolving into agentic engineering. According to the Stack Overflow Developer Survey 2025, 84% of respondents use or plan to use AI tools in their development, up from 76% the previous year. At this rate, the tooling needs to keep pace. Last year, we introduced the MongoDB MCP Server to give agents the connectivity they need to interact with MongoDB, helping them generate co
If you’ve been following this blog for a bit, you’ve almost certainly heard us mention Snoosweek before. Last year, we shared a judge’s perspective on the festivities. Today, we’re pulling back the curtain to share what went down at our most recent Snoosweek and give you a look at it all comes together. I'm new here - what is a Snoosweek!? First off, welcome! Snoosweek is Reddit’s internal hackath
Last week, what excited me most was the launch of the 2026 AWS AI & ML Scholars program by Swami Sivasubramanian, VP of AWS Agentic AI, to provide free AI education to up to 100,000 learners worldwide. The program has two phases: a Challenge phase where you’ll learn foundational generative AI skills, followed by a […]
Meta is continuing its long-term roadmap to help the construction industry leverage AI to produce high-quality and more sustainable concrete mixes, as well as those exclusively produced in the United States. Concurrent with the 2026 American Concrete Institute (ACI) Spring Convention, Meta is releasing a new AI model for designing concrete mixes – Bayesian Optimization [...] Read More... The post