We are pleased to announce the first public release of the open source BigDataEurope Platform. This platform is designed to help communities to solve societal challenges and problems by accelerating the process of getting started with the big data technology-zoo.
The BDE platform released today, is an easy-to-deploy, easy-to-use and adaptable (cluster-based and standalone) platform for running big data frameworks and tools. We have selected these latter based upon the requirements gathered from the seven different societal challenges. Thus, the platform allows to execute a variety of big dataflow tasks, starting from storage (Hive, Cassandra), message passing (Kafka, Flume), multi-purpose data processing and analysis (Apache Hadoop, Apache Spark, Apache Flink), to publishing (Geotriples).
Users can then pull ready-to-use images from the BDE repository or create a dataflow pipeline to realize a full data value chain using the components available at the BDE repository. This platform will be refined and expanded collaboratively with the users and the societal challenge (use case) partners. This public release of our platform represents the beginning of the operational phase, with which the use cases will be materialized and tested on the developed platform.
In the future, the BDE platform will introduce the concept of smart data to support a range of processing tools for the semantic web and knowledge graphs. This includes the concept of semantic data lake, a semantic analytics stack engine and a semantic platform for logging and integration for cluster resilience.
Overall, the platform has lowered the barrier to entry for new big data users and scientists from different domains to experiment with a variety of big data tools in a plug-and-play fashion.
See the press release for more