Big data shouldn’t be an area for only academics, data scientists, and other specialists. In fact, it can’t be. If we want big data to benefit industry at large, it needs to be accessible by mainstream information workers. Big data technology must fit into the workflows, habits, skill sets, and requirements of business users across enterprises. Datameer is a big data analytics application doing exactly that. Combining the user interface metaphors of a file browser and a spreadsheet, Datameer runs natively on open source big data technologies like Hadoop and Spark, while hiding their complexity and facilitating their use in enterprise IT environments and business user scenarios. In other words, Datameer creates an abstraction layer over open source big data technologies that integrates them into the stable of platforms and toolchains in use in enterprise business environments. Business users tap the power of big data analytics through a familiar spreadsheet workbook … [Read more...] about Harness Hadoop and Spark for user-friendly BI
Standalone cluster mode
Internet of things (IoT) data promises to unlock unique and unprecedented business insights, but only if enterprises can successfully manage the data flowing into their organizations from IoT sources. One problem enterprises will encounter as they try to elicit value from their IoT initiatives is data drift: changes to the structure, content, and meaning of data that result from frequent and unpredictable changes to source devices and data processing infrastructure. Whether processed in stream or batch form, data typically moves from source to final storage locations through a variety of tools. Changes anywhere along this chain -- be they schema changes to source systems, shifts in the meaning of coded field values, or an upgrade or addition to the software components involved in data production -- can result in incomplete, inaccurate, or inconsistent data in downstream systems. The effects of this data drift can be especially pernicious because they often go undetected for long … [Read more...] about Tame unruly big data flows with StreamSets
From its humble beginnings in the AMPLab at U.C. Berkeley in 2009, Apache Spark has become one of the key big data distributed processing frameworks in the world. Spark can be deployed in a variety of ways, provides native bindings for the Java, Scala, Python, and R programming languages, and supports SQL, streaming data, machine learning, and graph processing. You’ll find it used by banks, telecommunications companies, games companies, governments, and all of the major tech giants such as Apple, Facebook, IBM, and Microsoft.Out of the box, Spark can run in a standalone cluster mode that simply requires the Apache Spark framework and a JVM on each machine in your cluster. However, it’s more likely you’ll want to take advantage of a resource or cluster management system to take care of allocating workers on demand for you. In the enterprise, this will normally mean running on Hadoop YARN (this is how the Cloudera and Hortonworks distributions run Spark jobs), but … [Read more...] about What is Apache Spark? The big data analytics platform explained
Is Google a good choice for communication? With the exception of email, Google is not considered a leader in person-to-person communication apps and services. Skype, owned by Microsoft, tops the video calling space. Facebook dominates social media. Slack leads business chat-room messaging. WhatsApp rules mobile messaging. And besides the big platforms from the big companies, hundreds of startups have created appealing and innovative communications apps and services.Google lags for a variety of reasons. Among these are confusion and uncertainty -- confusion about which app to use, uncertainty about whether Google will terminate any given product.Google offers 11 communications apps and services. Alphabetically, these are: Allo, Chat, Gmail, Google+, Groups, Hangouts, Inbox, Messenger, Duo, Project Fi and Voice. If you look at the various communication actions you might want to take -- voice calls, video calls, email, text messaging and social posting -- Google has at least two offerings … [Read more...] about Google’s myriad messaging apps: Which are best for you?
The Samsung Gear S3 is finally here and it's an improvement over its predecessor in many areas, although, as we found in our review of the watch, it could have been a lot better, given the one year gap between the two. Still, the Gear S3 has some new tricks up its sleeve, such as a built-in GPS for independent tracking and navigation, which developers are only now beginning to take advantage of. Samsung's latest smart wearable will get better over time, at least in terms of third-party app support, but if you got your Gear S3 just recently and are wondering were to begin, here is a list of our current favorite apps for the smartwatch! Best apps for the Samsung Gear S3 Best apps for the Samsung Gear S3 1. Speedometer for Gear Samsung yesterday launched an official speedometer app for the Gear S3 that uses the built-in GPS to track your speed in real time. The app includes convenient graphs of your top speeds, and records the distances and durations of your travels. The app is, of … [Read more...] about Best apps for the Samsung Gear S3