Social, mobile, cloud, and Internet of Things (IoT) data is proliferating. This proliferation is driving enterprises to deploy hyperscale, distributed, data-centric applications such as customer analytics, e-commerce, security, surveillance, and business intelligence. To handle the data requirements of these high-volume, high–ingestion-rate, real-time applications, enterprises are rapidly adopting massively scalable and nonrelational databases such as MongoDB.

 

MongoDB is a popular general-purpose, open-source, scale-out NoSQL database. It powers modern applications, including big data analytics applications, that require low latency for reads and writes, high availability, and advanced data management. Key use cases for MongoDB include real-time analytics; single-view product catalogs; content management; and mobile, IoT, and block chain applications.

 

This blog post summarizes the NetApp® solutions for deploying MongoDB on NetApp for cost- and space-efficient protection of MongoDB Ops Manager backups. It also describes deploying a virtualized MongoDB NoSQL database on the NetApp Data Fabric and VMware vSphere.

 

NetApp Integration with MongoDB Ops Manager

MongoDB Ops Manager provides a rich set of features for data protection. These features include the ability to create consistent point-in-time backups on local storage and cloud-based storage, making sure that no data loss occurs. Because most customers store large volumes of data in MongoDB, their backup storage can grow very large. Also, there is always a trade-off between storage size and restore time objectives. As a result, local backup retention is limited, and the cost of storing those backups in the cloud can be high.

 

The solution is deploying MongoDB Enterprise Advanced on NetApp storage and using NetApp AltaVault™ cloud-integrated storage for cost- and space-efficient protection of Ops Manager backups. NetApp AltaVault provides space-efficient local storage for retention of backups, while using lower cost cloud-based storage for longer term retention.

 

The architecture of this solution is simple and requires no changes to the Ops Manager software. A virtual machine is created using existing NetApp storage to host the AltaVault virtual appliance software, enabling AltaVault to use the configured NetApp storage as a local backup copy cache. A Network File System (NFS) share is configured on the AltaVault virtual appliance and mounted on the Ops Manager server. Ops Manager is later configured to use the mounted NFS volume as its file system snapshot store. The final step is to configure AltaVault to use Amazon S3 cloud storage.

 

As Ops Manager creates snapshots, the backup files are stored in the AltaVault storage, which acts as a local storage cache. As the files are being written, inline deduplication and compression efficiencies are applied, reducing the storage footprint of the data. The data is also encrypted while being written. AltaVault begins replicating the data to Amazon S3 storage, which it manages. (See Figure 1.)

 

Figure 1) NetApp integration with MongoDB Ops Manager

Integrating the NetApp AltaVault virtual appliance with MongoDB Ops Manager for protection of Ops Manager snapshots provides:

  • Reduced footprint of Ops Manager backups, allowing longer retention of local backup copies
  • Reduced cost for cloud-based storage with NetApp storage efficiency
  • Encryption for data security while at rest and during transmission between on-premises data center and cloud

MongoDB on the NetApp Data Fabric

NetApp has validated an end-to-end solution designed for efficiently deploying a virtualized scale-out MongoDB database on the NetApp Data Fabric and VMware vSphere. In this solution, the scale-out NetApp All Flash FAS (AFF) array hosts MongoDB virtual machines and databases. Backup and disaster recovery services are provided by a NetApp ONTAP® Cloud software-defined storage solution in Amazon AWS and NetApp Private Storage. (See Figure 2.)

 

Figure 2) MongoDB on the NetApp Data Fabric

In summary, MongoDB on NetApp provides faster time to insight, seamless cloud integration for data protection and scaling, and better availability and security than provided by commodity storage

mm

Mike McNamara

Mike McNamara is a senior manager of product and solution marketing at NetApp with over 25 years of storage and data management marketing experience. Before joining NetApp over 10 years ago, Mike worked at Adaptec, EMC and Digital Equipment Corporation. Mike was a key leader driving the launch of the industry’s first unified scale-out storage system (NetApp), iSCSI and SAS storage system (Adaptec), and Fibre Channel storage system (EMC CLARiiON ). In addition to his past role as marketing chairperson for the Fibre Channel Industry Association, he is a member of the Ethernet Technology Summit Conference Advisory Board, a member of the Ethernet Alliance, a regular contributor to industry journals, and a frequent speaker at events.