-
Video
-
Table of contents
-
Video


- Description
- Transcript
- Discussion
About the talk
Herain Oberoi, Director of Product Marketing at AWS sits down with John Furrier as a part of the AWS Summit 2020 Digital Event Experience.
#AWSSummit #theCUBE
https://siliconangle.com/2020/05/13/aws-ultrawarm-nozzles-microservices-data-hose-with-
distributed-caching-awssummit/
AWS UltraWarm nozzles microservices-data hose with distributed caching
Cloud-based applications built with microservices are firehosing log data at DevOps teams. This is where open-source search-and-analytics engine Elasticsearch comes in for many teams. But Elasticsearch by itself doesn’t solve the problem of where to put all that data. Can a managed-service spin on Elasticsearch do the legwork for them?
The sheer amount of data is potentially great for analyzing app performance but also expensive to store and hard to get a handle on at times, according to Herain Oberoi (pictured), general manager of databases, analytics, and blockchain marketing at Amazon Web Services Inc. “[Users] either start to store that in archives or they don’t store it at all. If you store in archives, now you’ve got DevOps engineers and security experts that have to spend days to restore that data from the archive in order to search and analyze that data [with Elasticsearch],” Oberoi said.
Oberoi spoke with John Furrier, host of theCUBE, SiliconANGLE Media’s livestreaming studio, during the AWS Summit Online event. They discussed how AWS is addressing Elasticsearch’s storage weakness with smart caching. (* Disclosure below.)
Smart cache takes it in, Elasticsearch churns out new use cases
AWS has announced the general availability of UltraWarm — basically, storage-optimized Elasticsearch as a managed service. The main advantage for customers is that it introduces a distributed storage cache for frequently accessed data. It moves cold, or less frequently accessed, data to low-cost Amazon S3 storage.
The service allows DevOps teams to focus on high-level data search and analysis rather than time-consuming problems around storage, configuring and scaling. It costs about 80% less and executes queries about 50% faster than other managed ES services, according to Oberoi.
The types of queries and analyses possible for Elasticsearch users evolves once the engine’s been optimized not just for search, but also distributed storage caching.
“You can now go from storing a few days or maybe weeks worth of operational data to months of operational data at really low cost. With UltraWarm, now you can now use Elasticsearch for a broader set of use cases,” Oberoi concluded.
Watch the complete video interview below, and be sure to check out more of SiliconANGLE’s and theCUBE’s coverage of the AWS Summit Online event. (* Disclosure: TheCUBE is a paid media partner for the AWS Summit Online event. Neither Amazon Web Services, the sponsor for theCUBE’s event coverage, nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)
About speakers
I have a Computer Science Degree and MBA. Startup Internet infrastructure and software guy since 1997. Most known for doing a podcast and podcast network venture 15 years too early. Turned blogger, podcaster, and tech media developer and reporter in 2004. Before that was early in the web and internet architecture (I believe that I was the first to come up with the idea of keyword navigation to replace urls in browser and search). Before doing startups I was in enterprise tech since 1985. Right now I'm running a self-funded startup going on 10yrs now doing media and software in the cloud. At the heart, I'm an entrepreneur. I love conceiving and building new products and technology at the intersection of Media and Technology; Where Computer Science meets Social Science.
View the profileFrom the cheap studios in Palo Alto in. Boston, can I see without leaders all around the world? This is a cute conversation. Are welcome back to our cue virtual coverage of a gift Summit online. What's a virtual event? I'm John for a host of the cube. We're here in our studios in Palo Alto with our quarantine crew, the past, two, and a half months continuing to keep the cube role in keeping the lights. On top of the everyone is out there. Also cover the top of vents eight of us. Some of the cube can't be there. The events not happening, we're happening, virtually got a great guest. Your
people on the train override director of product, marketing database, analytics blockchain. He understands data understands that the structure, great to see you again. Thanks for coming on Virtual queue. Thanks for having us in and this is a cool way to do it. Yeah, you know, now, there's no excuse when I hit you up on LinkedIn. We're going to do a video, a lot more to do. Anyway, in all seriousness, is a tough time at scale. Problems are here, we're seeing more and more things going on at the summit here, Kendra General availability, General availability of the ultra warm for Amazon elastic.
Search on an augmented AI. A lot of the guys are coming from reinvent, so a lot of you have the Cadence of a TV that starts happening. Now you're involved in the elastic, search the ultra warm, this kind of gets to the role of data, warm data called Data, hot data. This is a big part of the machine learning. Can you give it setup for why? This is getting so popular? What's the big deal here? Some some context on elasticsearch itself and why it's gotten so far. More recently. So, you know, we talked about data and
having to grow exponentially Alarm me to time and it's because so many people are now building apps in the towel, using Microsoft's architectures, and I need monologue. The Monitor and assess any of the operation for months of the assistance of our customers, a movie, the last six or so this because it allows customers to collect and analyze and visualize all of this unstructured and semi-structured logged into this machine generated log. They do look at how the applications are doing and sew elastic. Search service is the
fully managed Cloud version of the plastic sword that allows customers to run elastic storage in a way, which means I'm not spending time doing consideration and set up a time finding out Scalability for my clusters, I can focus more on actually analyzing the data itself. So that's going to just a little bit of what plastic surgery has. And so what what's been going on as as customers have been using elastic, sort of the amount of data that they want to be able to analyze is increasing. And so one of the, one of the challenges is that glastic search. It's the file
format itself is really optimized for sword. So it makes it really quick and interactive, but it's not optimized for storage and so it's somewhat and efficient for storage. And so what cuss words end up doing is if you want to store months of operational data, it's actually hundreds of terabytes. And so what happens is it becomes expensive and customers either start the store that in archives or they don't do it at all. And I supposed to have to spend days to restore that data from the archive.
Search and analyze that data. I'm so what what Ultra warm does is, it's a new high-performance low-cost warm storage or elastic. So let's do this. And, you know, this allows customers to store up display petabytes of data about a tenth of the cost of existing options. And so it gives customers the ability to nail store months of interactive analysis than they could before. That's a great description. Thanks for sharing that, I think one of the things that I haven't seen his friend, kind of Old Guard mentality was, hey, hear some
storage going to pay for it? Let's do some tearing pay for that and then that's cool. But then as you getting one more day till you said, there's a lot of files and unstructured data. You need to use that knowledge the store, but use it in the applications of data is actually part of the user experience, right? So I think that's where I see. And know what you're saying is that the old model or even the cloud model was getting costly because it was storing the data cuz I needed to low latency. War is warm implying lower latency faster access to data. As well as. So I get the pricing thing.
So it is lower costs will give that a second but is it a speed issue or an access to data? So, you know, elastic. So it's like I said, you know what it's optimized or sort. So you can get that fast interactive, you know, Korean and visualization of the data, but it's not optimized for Ultra warm. You now have in a warm storage, Co. That's it of optimized for both. You can actually still get that interactive Corey and visualization capabilities that you would expect from The Last Resort. But
you can do it at a lower cost in a much larger amount of data about in terms of order-of-magnitude year, give us a taste for the worm cost structure versus alternative. Yeah, so it's it's roughly about 80% lower than in a lawn care stories from other in a man is elasticsearch services and and you'll get about 50% faster. You shouldn't. So that's enough for customers to be able to get that in her activity. They want from that from that to the plastic surgeon that they're looking for. That's pretty significant numbers. Are
bitches come from the Amazon architecture? Nitro, what's the secret sauce? In all this? Distributed cash, you know and it's a bit of cash for more frequently, accessed data. So what it doesn't? That uses these advanced placement of techniques to determine specifically which blocks of data are going to be accessed, less frequently. And it moves those outside of the cash into S3. That's Low, Cost Storage. And then for the most frequently, accessed blocks and I will keep that in the car. You can get that in our activities
actively doing really, really smart cashing on, really large volumes of data directly inside of elasticsearch. So is itself. And the value for me, is the customers one-time got act better integration for data intelligence into the app is a machine-learning. I mean smoke. Then you would at a much lower cost. And so, if I'm a, I'm a form that's doing analysis of my security logs, and I'm only able to do it cost effectively, by looking at my security logs for the past week, I can now cost-effectively do that same analysis by looking at the security logs for
the past month, and that can actually give me the ability to identify new trends in new patterns that I wouldn't have seen some more usable actual data for the same price. It was before just in a scale. Some more scale for data, making it need to go with you at the park away. From this cost structure scale. Anything else that we should know about around Ultra warm for elasticsearch? Yeah that means the biggest thing is you know again it's it's the amount of is Halle analysis changes. You can
now go from storing just kind of a few days, maybe weeks, look up, operational data to the months of operational data at really low cost. And so, without the warm, now you can now use elasticsearch service for of broader set of use cases as well. Talk about the attack in Europe. On when I put put you in a spot every second around this new reality, right? We're in a hat scale crisis. You guys at Amazon under a lot of pressure to deliver, I talked to the folks on the EC to group. Matt Garmin came on as well. David Brown
capacity but he was compute. I got imagine is going to be a great opportunity to kind of have more data legs as for the Kendra news. General availability today I so data will be killer here feature for the future this house. To be more. Ubiquitous in terms of capability, what's your vision on this post? Pandemic and how to companies reset and reinvent take advantage of that so that they're their outcomes around the upslope post-pandemic when it's still going to be a quasi. Work-at-home more teams are going to be distributed its
virtualization model of media and life. I mean we're going to be virtualized. Yeah I think you don't like like everything we do when you think about roadmap. Is it all starts and stems from working backwards, from what our customers are looking for, and, you know, given the environment now more than ever moving to the cloud is helping customers. In a lower-cost be more agile. You know, it's Caleb scale down more, festive Lee and so it's actually accelerating the need for customers to start to use a lot of
ways to analyze the applications and how they're running, and have scaled applications are going to use a lot of my data and analytics Services as well. And so, could you need to find ways to give customers a better performance that are so listen up. I'm at Lourdes Hospital, Paducah need to be what we focus on in, and having those conversations. And what's your advice adapt? Developers out there and developers who were really going to be in the front lines? The workload. You look differently, they're going to
have more video, more data, needed more microservices, as you pointed out. So how should developers leverage and build? Great products. What's the best practice? I think for developers, just like, for us building, great products starts with working backwards, from the customer is really listening to what are the customer pain points that you're solving, how we going to solve it in a way that's unique and different and better than how it's been solved today and then being able to run that in an operationally efficient way. That's going to provide a high
quality of service and do all of those things continue to hold true. And you know, our job is to give developers the tools that they need that to help them to do that. But what else is new with you, how you, how you doing out there? Are you at Cabin Fever yet? I mean, you got the D, got all the tools with Amazon. Everyone's kind of seems like they're in there in. Okay. Mood, how you doing? Yeah, nobody willing to let you know. I'm here with my family. I have the two kids who are doing some version of remote schooling and do juggling time with the kids and and balancing that with
commitments at work, but focused on continuing to help customers as they go through this this challenging time and and so I think getting the team's aligned on you know what can we do to help them and getting all teams involved in, finding new ways to get the customers what they need is to be ongoing focus and and you know, we recently released a data Lake. A lot of information around the whole covid-19 data sets that are publicly available and what time to see customers use that in particular around the public of state to applied to do analysis on the
data, as well as goodness, you guys doing a lot of texts are good there. Congratulations, thanks for coming on in, Sharon, the insides stay safe, and I've got kids at 4, You know how hard it is. So stay safe and we'll see you soon and I will be remote for now. Q virtual here with a team is Summit 2020, online virtual. I'm John. Thanks for watching.
Buy this talk
Ticket
Interested in topic “IT & Technology”?
You might be interested in videos from this event
Similar talks
Buy this video
Conference Cast
With ConferenceCast.tv, you get access to our library of the world's best conference talks.
