Archive for the ‘Big Data’ Category

Enterprise Information Retrieval: Build vs. Buy Approach

The ease with which we have long been able to retrieve information from the World Wide Web (WWW) using increasingly efficient and high quality search engines underscores a less-than-impressive performance from search engines serving the enterprise environment. Off-the-shelf tools that let organizations retrieve their enterprise information just do not give us the same experience as Google or Bing. But what if you could build your own enterprise information retrieval system by leveraging open source tools and platforms?

In this blog, we will explore the feasibility of doing just that.

(more…)

Optimizing the Modern Data Center: The View from New Big Data Heights

David Scheffler

David Scheffler

Director, Data Center Services

From adapting energy use to maximizing data consolidation, Big Data (BD) analytics has taken the guesswork out of optimizing the modern data center.

More than ever, the modern data center is a living, changing environment, with new technologies coming in, old technologies being cycled out, and evolving energy efficiency strategies to keep it all humming. We have to make sure we have the space and power to install the latest technology, while we still have the old equipment in place.

Up until recently, orchestrating this shifting ecosystem was only partially data-driven and the rest was based on gauging changing needs from past experience. At EMC IT—like most IT organizations—we had long tracked metrics on our data center facilities, including space, power, cooling, humidity, temperature, etc. And we collected storage data—server utilization, virtual machines, growth trends. But we lacked the tools to process this vast amount of data and we were never able to aggregate this information into one data base.

(more…)

Converged vs. Hyper-Converged Infrastructure: Getting the Right Mix

Srini Maguluri

Srini Maguluri

Consultant Architect — Office of Architecture and Innovation

Converged infrastructure (CI)—pre-engineered and deeply integrated blocks of compute, storage and network that deliver mission-critical performance offered as a turnkey solution—has been a game changer in helping IT keep pace with rapidly evolving business demands. And now a more agile technological cousin, called hyper-converged infrastructure (HCI), offers similar plug-and-play efficiencies using building blocks in smaller, more flexible chunks.

So when should you use converged infrastructure and when is hyper-converged technology a better option? The answer depends on what IT workloads you are running, how much resiliency is required, and the need for guaranteed performance verses agility and scalability.

(more…)

Stocking the Data Lake with Smart Data: IT-Business Partnership is Key

Mark Duncan

Mark Duncan

Sr. Manager, Business Intelligence — Data Lake

The data lake is proving to be a crucial tool as EMC IT strives to partner more closely with the business clients it serves to help them get the most out of enterprise Big Data. For example, EMC IT is offering a smart data base that lets business users across the company leverage a uniform customer profile for more efficient and effective sales analytics.

Created in collaboration with EMC Global Services, the CAP (Customer Account Profile) is based on information collected and aggregated from multiple sources to provide a holistic customer view—a single version of the truth, if you will, about our customers.

CAP is managed by IT and is one of the enterprise data sets made available via the data lake to business clients seeking to analyze customer trends, opportunities and insights.

(more…)

Unlock the Textual Content in Your Data Lake

Dr. Raphael Cohen

Dr. Raphael Cohen

Principal Data Scientist, EMC IT

Wouldn’t it be great if you could analyze all customer interaction and learn which parts of our services or sales are better than others? Or analyze all of our service request textual descriptions and infer the call volume drivers? Understand the main topics of a chat session? Use the same data to understand how the customers are actually using our products? Or to go beyond customer interactions and help us identify the common bugs in our code by analyzing the text engineers type in a bug tracking system such as Jira or Bugzilla?

Liberating your data is not enough if a big chunk of it remains locked in human generated texts.

EMC’s Data Science as a Service team has created a highly-advanced text analytics technology which can help your organization unlock the value in human generated texts.

(more…)

The Business Data Lake from a Data Scientist Perspective

Oshry Ben-Harush

Oshry Ben-Harush

Data Science Manager, EMC IT

The Business Data Lake (BDL) is positioned as the one-stop-shop for all of the organization’s (big) data storage and analytics requirements. It is intended to address the three V’s of Big Data analytics – Volume, Variety and Velocity – by providing a vast amount of storage, ingestion of streaming, mini-batches and batches of data, either structured, semi-structured or unstructured. It fundamentally shifts the paradigm in business data storage and analytics by consolidating the multiple silos of data that can be found in organizations today.

BDL3 (more…)

The Analytics Journey Leading to the Business Data Lake

Shahidul Mannan

Shahidul Mannan

Sr. Director, Big Data and Analytics, Dell IT

More than ever, businesses see their futures tied to their ability to harness the explosive growth in data. You may even be familiar with the Business Data Lake concept—a central repository of vast information which can be used across an enterprise to drive all business intelligence, advanced analytics and even, eventually, intelligent applications.

We, at EMC IT, are in the process of creating a Business Data Lake, and I will be sharing insights about our efforts in this blog. To start, let’s trace the vision that’s leading EMC IT and other businesses to the shores of this new data landmark.

bdl (more…)

Predictive Analytics for IT Operations: Continuing the Journey

Bhanu Dhanaraj

Bhanu Dhanaraj

Sr. Manager, Enterprise Analytics, EMC IT
Bhanu Dhanaraj

Latest posts by Bhanu Dhanaraj (see all)

Viktor Mayer-Schonberger and Kenneth Cukier, authors of Big Data: A Revolution That Will Transform How We Live, Work and Think, wrote, “If big data teaches us anything, it is that just acting better, making improvements – without deeper understanding – is often good enough.”

EMC IT not only recognizes the hidden value of Big Data, but also strives to generate better outcomes. So, we at EMC IT can act better and faster to improve our customers’ experience.

In his November 2013 article, Dan Inbar from EMC’s IT organization eloquently presented what IT has been doing to improve the operations of our Exchange email environment. PAITO (Predictive Analytics for IT Operations) is our Big Data analytics solution for outage prediction that allows our IT operations team to collect, analyze, store, and leverage key indicators to predict and prevent interruption in mission-critical operations. The journey that started more than a year ago as a pilot has evolved into a full-fledged IT data lake and analytics platform for various IT managed areas, including applications, servers, devices, licenses, network, storage, security and workloads. (more…)

Assessing Data Loss Costs: Value-Driven Protection of the Bottom Line

In an age when most companies invest to become data-driven, the value of data is increasingly a key criteria for making IT decisions, and the protection of the data becomes paramount to those decisions

When making backup-related decisions, price justification involves the potential capital loss to the organization when a data loss or unavailability occurs. Understanding the value of data and access to that data is key when prioritizing backup technology or even for deciding which infrastructure to protect during a cyber-attack. However, estimating this price is not trivial.

I recently worked on a research project with a team of academic partners at Ben-Gurion University for prioritizing data replication to minimize the monetary loss in the case of a disaster. The method we derived can limit the costs of data loss, and could provide a high return on investment (ROI) of up to one million dollars per incident.

(more…)

Your Data Protection Strategy: An Evolving Business/IT Conversation

Paul Gogan

Paul Gogan

Manager, Cloud Platform Protection and Storage, EMC IT

Creating a data protection strategy for your organization is a little bit like selecting the right insurance policy for your home. It isn’t the most flashy of endeavors and nobody likes paying those insurance premiums, but when a hurricane rips the roof off your house, you’re glad that you took the time to do it right.

Structuring your data protection strategy is not exclusively an IT decision. It’s primarily a business decision involving a range of stakeholders (not just IT) which provides the products, solutions and processes to execute that strategy based on the value of the data and the objectives of the business.

Data protection is not a one-size-fits-all process, as we in EMC IT, have come to learn. The following are best practices and lessons learned that EMC IT uses to create and maintain our data protection strategy.
(more…)

Follow Dell EMC

IT PROVEN MICROSITE:

Recent Tweets

Hyper-converged infrastructure helps organizations along their IT transformation journey. See 4 ways it can help yo… https://t.co/BxxvEPu08k about 16 hours ago
VxBlock, powered by our + Cisco tech, is the industry std for Converged today. More CiscoLive updates at… https://t.co/OUaM55RqEy about 18 hours ago
Scale as you grow with Isilon Express for scale-out NAS. Video: https://t.co/3uBSYINBG2 https://t.co/tK7G213ZV8 about 20 hours ago