Zeppelin is a modern web-based tool for the data scientists to collaborate over large-scale data exploration and visualization projects. It is a notebook style interpreter that enable collaborative analysis sessions sharing between users. Zeppelin is independent of the execution framework… Continue Reading
Toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries.
Library which provides tools to construct large-scale graphs on top of Apache Hadoop.
Finagle is an asynchronous network stack for the JVM that you can use to build asynchronous Remote Procedure Call (RPC) clients and servers in Java, Scala, or any JVM-hosted language.
Palo Suite combines all core applications — OLAP Server, Palo Web, Palo ETL Server and Palo for Excel — into one comprehensive and customizable Business Intelligence platform. The platform is completely based on Open Source products representing a high-end Business… Continue Reading
SpagoBI is an Open Source Business Intelligence suite, belonging to the free/open source SpagoWorld initiative, founded and supported by Engineering Group. It offers a large range of analytical functions, a highly functional semantic layer often absent in other open source… Continue Reading
Akka is a toolkit and runtime for building highly concurrent, distributed, and fault tolerant applications on the JVM. Akka is written in Scala, with language bindings provided for both Scala and Java. In an Akka application, the basic construct is… Continue Reading
Talend simplifies big data integration with graphical tools and wizards that generate native code so you can start working with Apache Hadoop, Apache Spark, Spark Streaming and NoSQL databases today. Talend Big Data Integration platform delivers high-scale, in-memory fast data… Continue Reading
Apache Giraph is an iterative graph processing system built for high scalability. For example, it is currently used at Facebook to analyze the social graph formed by users and their connections. Giraph originated as the open-source counterpart to Pregel, the… Continue Reading
Cascading Lingual is a powerful extension to Cascading that simplifies application development and integration by providing an ANSI SQL interface for Apache Hadoop. Now you can connect existing business intelligence (BI) tools, optimize computing costs, and accelerate application development with… Continue Reading