Mabro1
4 Beryllium

Ask the Expert: EMC Isilon Scale-out Data Lake

Welcome to the EMC Ask the Expert discussion on Isilon following the EMC Redefine Possible announcement

YOU MAY ALSO BE INTERESTED ON THESE ATE EVENTS...

Ask the Expert: Are you ready to manage deep archiving workloads with Isilon’s HD400 node and OneFS ...

Ask The Expert – Isilon’s New Releases: IsilonSD Edge, OneFS.NEXT and CloudPools

https://community.emc.com/thread/216708

EMC Isilon provides an enterprise grade scale-out data lake to help protect, manage and secure all unstructured data. We are reinforcing our data lake with new announcements that include – 2 new platforms, new solutions, new access methods and SmartFlash Flash as Cache. Join us to learn more about our strategy and what’s new at Isilon.

Your hosts:

profile-image-display.jspa?imageID=10823&size=350

Nicholas Kirsch is the Chief Technology Officer and Vice President of the Isilon Storage Division at EMC. His primary focus is extending EMC Isilon's lead in Scale-Out NAS products, technologies and market solutions. Nick is currently responsible for both product and technology strategy for Isilon's integrated storage appliance and the OneFS distributed file system. He also drives advanced development and strategic acquisitions.

Nick joined Isilon Engineering in 2002 as a Software Engineer for OneFS before serving as the Director of Software Engineering through 2007. He built and led Isilon's Product Management organization as Senior Director of Product Management through 2012. Nick holds Bachelor of Science degrees in Computer Science and Mathematics from the University of Puget Sound and a Master's degree in Computer Science from the University of Washington.

This discussion will take place July 8 - 25.

Share this event on Twitter:

>> Join Ask the Expert, EMC Isilon scale-out Data Lake http://bit.ly/1mGPlqD #EMCATE <<

23 Replies
Jim_Cahill
1 Copper

Re: Ask the Expert: EMC Isilon Scale-out Data Lake

The term data lake is new to me. Can you briefly describe it in the context of the storage industry, in general, and specifically in the context of Isilon scale-out NAS. Thanks!

Highlighted
MRWA
2 Iron

Re: Ask the Expert: EMC Isilon Scale-out Data Lake

Hello Jim, I just noticed a Data Lake white paper that was posted yesterday:

Hope that helps

-Michael

0 Kudos
nick_kirsch
1 Nickel

Re: Ask the Expert: EMC Isilon Scale-out Data Lake

Jim,

(I copied this from the RSVP thread - hopefully you saw it there earlier!)

My apologies on the delay - we just finished the Seoul, Korea edition of the MegaLaunch and had a wonderful time redefining possible. As I see it in the most general context, a data lake is a shared storage infrastructure which enables a multitude of different applications and workloads to interact seamlessly. This naturally applies primarily to unstructured data (since storage systems can have knowledge of this information) and demands scalable technologies (since scale of performance and capacity are both critical requirements.)

In the context of Isilon’s Scale-Out data lake, we indeed provide a large scalable storage infrastructure for unstructured data - and more specifically, we provide seamless and shared access to applications which communicate via NFS, SMB (Windows), FTP/HTTP, HDFS (Hadoop), and (coming soon) OpenStack Swift. This is an extremely powerful combination of access methods, as it enables applications designed and written for a variety of purposes to co-exist peacefully - or more interestingly, interact (without data movement or additional copies.) Imagine (as an example) logging network access via traditional UNIX applications over NFS, using Hadoop MapReduce to find potential intrusion points, and generating graphical reports that can be viewed via a Windows workstation.

Added benefits of Isilon’s approach to the data lake is that the enterprise capabilities around storage, security, performance, and information management can all be applied uniformly to any or all of the application data. A few fun examples include the ability to provide secure multi-tenancy for Hadoop applications, de-dupe between Openstack Swift objects and NFS files, and see advanced performance details on simultaneous access (across all these protocols) to the same files!

I could go on, but I hope this explanation provides a strong foundation and sparks more curiosity.

All the best,

Nick

Jim_Cahill
1 Copper

Re: Ask the Expert: EMC Isilon Scale-out Data Lake

Thanks, Michael!

0 Kudos
Jim_Cahill
1 Copper

Re: Ask the Expert: EMC Isilon Scale-out Data Lake

Thanks, Nick. Very helpful.

0 Kudos
lnicholes
1 Nickel

Re: Ask the Expert: EMC Isilon Scale-out Data Lake

Hi Nick!

It sounds like with Isilon advancements in Data Lake infrastructure, HDFS, and the upcoming Openstack Swift that unstructured data is a real growth area. Does this introduce any security complexities? What are some of the features Isilon has to help in this area?

0 Kudos
mhiers
1 Nickel

Re: Ask the Expert: EMC Isilon Scale-out Data Lake

What new features in OneFS 7.1.1 are customers most excited about and which will help them swim, not sink, in the Data Lake?

shellies
1 Copper

Re: Ask the Expert: EMC Isilon Scale-out Data Lake

Hi Nick

What kind of performance improvements can we expect from the SmartFlash feature? Also, in what situations would GNA be a better choice for performance over SmartFlash and vice versa?

jeenak
1 Copper

Re: Ask the Expert: EMC Isilon Scale-out Data Lake

How can customers best leverage the OneFS API to automate system configuration tasks?

0 Kudos