On Forbes.com CTO John De Goes: Hadoop Isn’t A Requirement For Big Data Projects
SlamData CTO John De Goes on Forbes: Have Your Cake And Eat It: Big Data Without Hadoop.
From the article:
Today’s NoSQL databases share many characteristics with Hadoop, but in some cases, they are easier to manage and develop for. So if you’re about to embark on a big data project, it makes sense to investigate at lease the leading NoSQL contenders. Because for data hubs, IoT and real-time analytics uses cases, one of these may be a wiser choice than a Hadoop-built solution.
But there are a few use cases where NoSQL databases are stronger. Among them are:
Data Hubs: If you need to consolidate information from a variety of sources of data (particularly online data), NoSQL databases are a perfect fit. Common examples of this include creating a single view of customer or patient data, or linking together data from lots of different departments inside a larger organization. The insurance company MetLife uses a NoSQL database in this fashion, and I have worked with several others companies doing the same.
IoT: The Internet of Things is rife with devices and sensors that create large amounts of data whose structure can change and whose primary purpose is rollup for monitoring, alerting and large-scale analytics. NoSQL databases excel at this and can handle incredible volumes of data with ease. Engineering firm Bosch is one company using NoSQL database technology for IoT.
Real-Time Analytics: Most NoSQL databases have the ability to perform real-time aggregation on streams of data (for example, clickstream or log data), which can make them an excellent choice for high-volume, low-value data that needs to be pre-aggregated before analysis. Content delivery networks Buffer and MaxCDN both leverage NoSQL tech for real-time analytics.
John is part of the Forbes Technology Council, an invitation-only organization comprised of elite CIOs, CTOs and technology executives.
Latest posts by Chris Dima (see all)
- SlamData Secures $6.7MM Series A to Support Modern Data in the Enterprise - February 22, 2017
- SlamData 4.1 Includes New Charts, Updated Connectors for Couchbase, MarkLogic and Spark/Hadoop - January 26, 2017
- SlamData Makes the List: 2017 DBTA Trend-Setting Products - December 14, 2016
Native Analytics On MongoDb, Couchbase, MarkLogic and Hadoop.
No mapping. No ETL.
Recent News & Blogs
Boulder, Colo., February 23, 2017 – SlamData Inc., the leading open source analytics company for modern unstructured data today announced that it has raised a $6.7M Series A funding round, led by Shasta Ventures. The investment will drive further development of the firm’s breakthrough analytics solution: a single application for natively exploring, visualizing and embedding analytics against unstructured data sources including NoSQL, Hadoop, and cloud API’s.read more
SlamData just released its first update of 2017, SlamData 4.1.1. It delivers a number of new UI enhancements, performance improvements, new charts, as well as commercial releases for the Couchbase, MarkLogic and Spark/Hadoop connectors.read more
Welcome to the SlamData getting started video. Let’s jump right in. By default, SlamData runs on port 20223. You can change the port it runs on by modifying the quasar-config.json file. By default, this file is located in the following directories on Windows, Mac and Linux.read more
To help shed light on where customers can go to address their data-driven challenges, Database Trends and Applications magazine assembles an annual list of solutions…read more
The following is an interview I conducted with Jeff Carr, CEO and Founder of SlamData regarding the trends in enterprise business intelligence.read more
Enterprise Business Intelligence solutions are failing, and the reasons are very obvious. Leading analyst firms including Gartner and G2 have published rankings which show virtually no leadership and scant challengers in the market.read more
Who Is Using SlamData?
Whitepaper: The Characteristics of NoSQL Analytics Systems
by John De Goes, CTO and Co-Founder of SlamData
Semistructured data, called NoSQL data in this paper, is growing at an unprecedented rate. This growth is fueled, in part, by the proliferation of web and mobile applications, APIs, event-oriented data, sensor data, machine learning, and the Internet of Things, all of which are disproportionately powered by NoSQL technologies and data models.
This paper carves out a single concern, by focusing on the system-level capabilities required to derive maximum analytic value from a generalized model of NoSQL data. This approach leads to eight well-defined, objective characteristics, which collectively form a precise capabilities-based definition of a NoSQL analytics system.
These capabilities are inextricably motivated by use cases, but other considerations are explicitly ignored. They are ignored not because they are unimportant (quite the contrary), but because they are orthogonal to the raw capabilities a system must possess to be capable of deriving analytic value from NoSQL data.
Table of Contents
- The Nature of NoSQL Data
- NoSQL Databases
- Big Data
- A Generic Data Model for NoSQL
- Approaches to NoSQL Analytics
- Coding & ETL
- Real-Time Analytics
- Relational Model Virtualization
- First-Class NoSQL Analytics
- Characteristics of NoSQL Analytics Systems
- Generic Data Model
- Isomorphic Data Model
- Unified Schema/Data
- Polymorphic Queries
- Dynamic Type Discovery & Conversion
- Structural Patterns