Exploring data using natural language ("plain English") query expressions isn't a new concept, but it has become more relevant and more feasible lately. People are used to search engines and like the metaphor as data querying experience. Products like Thoughtspot and Answer Rocket specialize in this teaming of search and data discovery. And the Q&A feature of Microsoft Power BI enables this, both for ad hoc queries in dashboards and even for use as an authoring tool when designing reports.Many natural language analytics products, however, require data to be moved into their own repositories or index structures. But today, Arcadia Data is announcing a new Search feature, in the latest release of its Arcadia Enterprise product, that adapts the natural language query paradigm to work directly on top of data lakes.The low downIn a phone briefing with Sushil Thomas, Arcadia Data's Founder & CEO, and Steve Wooledge, the company's VP of Marketing, I learned that the Arcadia Data … [Read more...] about Arcadia Data brings natural language query to the data lake
Business data lake
There has been a great deal of debate recently about how motor insurers can best clamp down on false whiplash claims. The Ministry of Justice (MoJ) has recently pledged to change the way whiplash claims are handled by setting fixed compensation amounts for claims and banning the practice of settling cases without medical proof. The Civil Liability Bill, which is still going through parliament, is raising a number of questions about how to best tackle and discourage insurance fraud. Citing whiplash injury has often been a common way for fraudsters to extract money from car insurance providers, on the basis that whiplash can be hard to prove, as it involves no broken bones, and doesn’t always have the vehicle damage to accompany it.Is whiplash insurance fraud being properly detected?Whilst the UK government is exploring ways in which to tackle the situation, questions are being posed around whether fraud of this kind is being adequately assessed. It’s in the highest … [Read more...] about Could data lakes hold the key to tackling insurance fraud?
Video: Having big data is not enough: Tips to turn it into a business advantageKaplan Test Prep is well known for helping students prepare for college-entrance exams, such as the SAT and ACT; post-grad admissions tests, such as the GRE and GMAT; and licensure exams for medical, legal, nursing, financial, and other professional careers. Unfortunately, the company wasn't making the grade when it came to using all available information for data-driven decision-making.Founded in 1938, Kaplan has decades of historical data, scores of legacy systems and diverse applications. From 2013 to 2015, it made a methodical move to a virtual private network and cloud-based application stack on Amazon Web Services (AWS), an effort that helped Kaplan modernize infrastructure and consolidate from 12 data centers down to four. But from an analytical perspective, Kaplan continued to rely on siloed tools and reporting capabilities. It lacked a centralized store where it could consolidate and analyze data … [Read more...] about Kaplan Test Prep graduates to a cloud-based data lake
The data lake concept has been around for a while now. While definitions vary, most agree that conceptually, a data lake is a shared data environment that can handle a variety of datasets, including semi-structured and unstructured data. There are a variety of ways that companies use data lakes, from IT-managed environments for data to analytically-centric platforms for data science.Data lakes have several attributes that make them good candidates for a public cloud deployment on AWS or Azure. Data is often large in scale and many times, it’s being generated outside corporate firewalls, as with sensor logs or streaming data. Ease of access to a data lake can be an important requirement, another potential benefit of the cloud model. And data lakes often have uncertain growth rates – a quick success could mean immediate expansion and scaling more easily accomplished in public clouds.That’s why more are interested in cloud-based data lakes, aka data lakes as a service. … [Read more...] about Seven times when it’s best to use a cloud-based data lake
If there’s one key phenomenon that business leaders across all industries have latched onto in recent years, it’s the value of data. The business intelligence and analytics market continues to grow, with Gartner forecasting the market will reach $18.3 billion in 2017, at a massive rate as organisations invest in the solutions that they hope will enable them to harvest the potential of that data and disrupt their industries.But while companies continue to hoard data and invest in analytics tools that they hope will help them determine and drive additional value, the General Data Protection Regulation (GDPR) is forcing best practices in the capture, management and use of personal data.The European Union’s GDPR stipulates stringent rules around how data must be handled. Impacting the entire data lifecycle, organisations must have an end-to-end understanding of its personal data, right through from its collection and processing, to storage and – finally – its … [Read more...] about Building data lakes for GDPR compliance