Posts Tagged ‘Data Lake’

Customer Value Spring-boarding from the Data Lake

Chuck Koch

Chuck Koch

Senior Manager, Data Management at Dell IT
Chuck Koch

Latest posts by Chuck Koch (see all)

The data lake has not only allowed IT to open up Big Data to a broader community of internal business users, it is now helping us channel unprecedented amounts of information to DELL EMC customers as well.

Using data lake technology, for example, IT and our DELL EMC business groups forged a groundbreaking partnership to allow customers to leverage Big Data to monitor and proactively manage their IT environments. We created a tool called MyService360, an on-line solution that gives DELL EMC Support customers and partners easier and faster access to near real-time service information. It features a personalized dashboard that provides customers with a 360-degree view of their environment and customer service experience.

Launched last May, MyService360 only scratches the service of the potential value that is expected to spring-board from leveraging Big Data in the data lake.  Having all the data in a centralized location provides easy access and gives developers and data scientists the opportunity to gain data insights that would be extremely difficult to achieve without the data lake. Those insights can then be used to create metrics that we can share to empower our customers.

(more…)

Keeping Garbage Out of the Lake: Data Governance in the Big Data World

Shahidul Mannan

Shahidul Mannan

Sr. Director, Big Data and Analytics, Dell IT

As organizations unleash the power of the data lake by providing business broader access to more and more data, they are facing a growing IT dilemma—How to keep improperly governed or poor quality data from polluting the data lake.

While IT’s traditional approach to managing data governance and quality have been quite effective over the years, the magnitude of data in today’s data lake is much larger than traditional data warehouse levels. Traditional tools and tactics are being overwhelmed by Big Data in the lake.

There are, however, strategies that organizations can use to reshape data governance and quality standards in the Big Data world. While our tactics and tools are still evolving, I will share some of the efforts we are developing at EMC IT to keep our data lake clean.

(more…)

The Power of Self-Service Big Data

Shahidul Mannan

Shahidul Mannan

Sr. Director, Big Data and Analytics, Dell IT

From using analytics to predict how our storage arrays will perform in the field, to engineering product configurations to best meet customers’ future needs, EMC is just beginning to tap into the gold mine of intelligence waiting to be extracted from our new data lake.

In fact, we are currently working on dozens of business use cases that are projected to drive millions in revenue opportunities. And we are just scratching the surface. There’s a lot more data available, more to be harvested, and more analytics to be built out as data scientists and business users hit their stride in exploring a new era of data-driven innovation at EMC.

As I noted in my earlier blog ( The Analytics Journey Leading to the Business Data Lake), EMC IT embarked on creating a data lake to transition from traditional business intelligence to advance analytics more than two years ago. A key focus of this effort was to address the fact that data scientists and business users seeking to leverage our growing amount of data were stifled by the need for such projects to go through IT, which was a costly and slow process that discouraged innovation.

We now have the foundation and tools in place to use data and analytics to create sustainable, long-term competitive differentiation. To get here, we worked closely with EMC affiliate Pivotal Software, Inc. to mature together and leverage the multi-tenancy capabilities of their Big Data Suite.

(more…)

The Data Science of Predicting Disk Drive Failures

Shiri Gaber

Shiri Gaber

Data Scientist, Dell IT
Shiri Gaber

Latest posts by Shiri Gaber (see all)

With the expanding volume of information in the digital universe and the increasing number of disk drives required to store that information, disk drive reliability prediction is imperative for EMC and EMC customers.

Information Expansion

Figure 1- An illustration of the information expansion in the last years and expected growth

Disk drive reliability analysis, which is a general term for the monitoring and “learning” process of disk drive prior-to-failure patterns, is a highly explored domain both in academia and in the industry. The Holy Grail for any data storage company is to be able to accurately predict drive failures based on measurable performance metrics.

Naturally, improving the logistics of drive replacements is worth big money for the business. In addition, predicting that a drive will fail long enough in advance can facilitate product maintenance, operation and reliability, dramatically improving Total Customer Experience (TCE). In the last few months, EMC’s Data Science as a Service (DSaaS) team has been developing a solution capable of predicting the imminent failures of specific drives installed at customer sites.

(more…)

Simplifying Customers’ Lives with EMC MyService360

Ramesh Razdan

Ramesh Razdan

Vice President, Big Data and Analytics, Dell IT
Ramesh Razdan

Latest posts by Ramesh Razdan (see all)

From taking charge of healthcare choices to customizing product purchases, today’s consumers are increasingly using self-service, social, and mobile digital capabilities. EMC’s new MyService360 now brings that same personalized, proactive service to our Online Support customers.

Powered by EMC data lake solution, MyService360 (launched at EMC World 2016 on May 2) gives EMC Support customers easier and faster access to real-time information at their fingertips. Using its easy-to-read visual and powerful analytics, customers can view analysis of code levels, health, and risk scoring on their installed EMC products, service activity views by site, incident management, and more.

(more…)

Architecting a Data Lake: Matching Technology with Your Harvesting Needs

Darryl Smith

Darryl Smith

Chief Database Architect, Dell IT
Darryl Smith an EMC Distinguished Engineer and the Chief Database Architect in EMC's IT organization. He is responsible for all databases at EMC, including one of the largest Oracle eBusiness Suites and Database Grid deployments in the world. He has been working with Oracle Databases since 1988 starting with version 4 and Oracle clustered databases since version 7. Over the past 7 years, he has helped EMC capture and document the best practices learned from managing a global deployment of Oracle Applications, Middleware and Database Grids and actively engage with EMC and Oracle customers to share EMC's experience and perform knowledge transfer.

It takes many different best-of-breed technologies to effectively harvest “game-changing” analytics value from the data lake. Getting the right architecture to navigate your data lake requires a deep understanding of both the needs of Big Data and the available technologies in order to match analytics use cases with the appropriate platforms to get results.

Do you need to analyze large amounts of data fast or process many queries simultaneously? Is the data you are using organized in columns and rows, customer records perhaps? Or are you searching document files?

Let’s look at the basics of data lake architecture, some of the technologies and tools you should consider, and how EMC IT is approaching this crucial process.

Data Lake: Core Architectures

(more…)

Marketing Science Lab is a Data Lake Pioneer

Mark Duncan

Mark Duncan

Sr. Manager, Business Intelligence — Data Lake

In the expanding world of Big Data, there is more and more information out there that can help your organization target the right customers with the most effective messages for the right products and services at the right time. EMC IT is using data lake technology to help our Marketing and Sales teams gain unprecedented insights into our customer behaviors, needs and sentiments to drive effective marketing.

At the center of this effort is our Marketing Science Lab, which provides advanced analytics support for Marketing using a shared Marketing and Sales workspace in the data lake. The Lab collaborates with Sales on shared data and models to deliver 360 views of customer behaviors by analyzing a vast array of data from internal and increasingly, external sources.

(more…)

Why a Data Lake? Keeping Up with the Digital Universe

Brahma Tangella

Brahma Tangella

Sr. Manager, Service Strategy, Dell IT

With the digital universe expected to swell to 44 zettabytes of data by 2020, today’s enterprises need a central data repository that can process increasing volumes of all types of data faster to let business users make better, real-time decisions. In short they need a stronger backbone; they need the data lake!

Not only do traditional databases constrain real-time and shared data analytics due to their siloed nature, they also lack the technology to accommodate the skyrocketing level and types of data being created at an increasing rate. After all, according to IDC research, the growing number of smart devices that analyze everything from home heating systems to consumer information will mean that within four years there will be some 7 billion connected people using an estimated 30 billion devices.

(more…)

Optimizing the Modern Data Center: The View from New Big Data Heights

David Scheffler

David Scheffler

Director, Data Center Services

From adapting energy use to maximizing data consolidation, Big Data (BD) analytics has taken the guesswork out of optimizing the modern data center.

More than ever, the modern data center is a living, changing environment, with new technologies coming in, old technologies being cycled out, and evolving energy efficiency strategies to keep it all humming. We have to make sure we have the space and power to install the latest technology, while we still have the old equipment in place.

Up until recently, orchestrating this shifting ecosystem was only partially data-driven and the rest was based on gauging changing needs from past experience. At EMC IT—like most IT organizations—we had long tracked metrics on our data center facilities, including space, power, cooling, humidity, temperature, etc. And we collected storage data—server utilization, virtual machines, growth trends. But we lacked the tools to process this vast amount of data and we were never able to aggregate this information into one data base.

(more…)

Stocking the Data Lake with Smart Data: IT-Business Partnership is Key

Mark Duncan

Mark Duncan

Sr. Manager, Business Intelligence — Data Lake

The data lake is proving to be a crucial tool as EMC IT strives to partner more closely with the business clients it serves to help them get the most out of enterprise Big Data. For example, EMC IT is offering a smart data base that lets business users across the company leverage a uniform customer profile for more efficient and effective sales analytics.

Created in collaboration with EMC Global Services, the CAP (Customer Account Profile) is based on information collected and aggregated from multiple sources to provide a holistic customer view—a single version of the truth, if you will, about our customers.

CAP is managed by IT and is one of the enterprise data sets made available via the data lake to business clients seeking to analyze customer trends, opportunities and insights.

(more…)