MULTIVAC PLATFORM

the platform of platforms!

LARGE-SCALE SCIENTIFIC DATA AT YOUR FINGERTIPS

It's All About Data!

Multivac is a data-centric platform developed by the Complex Systems Institute of Paris Ile-de-France (ISC-PIF). It loves data! It hosts more than 11 billion data and counting. These documents are being stored inside large-scale databases and search engine clusters.

Multivac Platform offers a wide range of topics to cover most social science use cases. Some of ISC-PIF initiations such as Climate, Risk and Politic has been built and powered by Multivac Platform.

Multivac Platform updates its datasets in real-time and simultaneously gives access to them in real-time. The importance of real-time streaming is near to essential in most of our scientific projects and we hope it makes the same difference to other scientists.

Global Pulse Education

+2B

Global Pulse Employment

+2B

Music dataset

+1B

Geo-tagged dataset

+1B

Public stream dataset

+1B

IT logs dataset

+800M

Risk dataset

+200M

Scientific dataset

71M

Climate dataset

+70M

political dataset

+50M

News dataset

35M

Wikipedia dataset

+12M

World beyond data

Multivac Platform is a platform of platforms! It offers scientific toolbox to dive into large-scale data for discovering and exploration. Platforms such as dashboards, powerful API engine and Hadoop cluster.

MULTIVAC DASHBOARDS

Create beautiful visualisations by using Multivac Dashboards. It allows you to discover and explore your desired topics and subjects within 11 billion data!

In version 2.0 you will be able to save your dashboards, export the visualisations and the results.

Please use the little demo on your right to see what you can expect in near future! ;)

Note: Here we use Multivac suggestion API to build an autocomplete based on English Wiktionary dataset. It also visualises the very same query within Web of Science dataset with 52 million metadata.

Multivac

Dashboard


MULTIVAC API ENGINE

Multivac Platform offers a complete set of REST APIs to communicate to its data repositories. This makes it easy to get out not only the raw data, but also the aggregated and processed results.

Multivac uses Swagger to design, build and document its RESTful APIs. Swagger is a powerful open source framework backed by a large ecosystem of tools. It also follows the Open API Initiative (OAI) to standardising on how Multivac REST APIs are described.

Take a look at the demo on the right and see how you can integrate Multivac APIs inside your code.

Multivac

API Engine

This is just an example of how to integrate Multivac Wiktionary API into your code!


var settings = {
  "async": true,
  "crossDomain": true,
  "url": "https://api.iscpif.fr/v2/pub/wikitionary/suggest?q=climate&lang=en&count=10",
  "method": "GET",
  "headers": {
    "cache-control": "no-cache"
  }
}
$.ajax(settings).done(function (response) {
  console.log(response);
});
                                    

$request = new HttpRequest();
$request->setUrl('https://api.iscpif.fr/v2/pub/wikitionary/suggest');
$request->setMethod(HTTP_METH_GET);

$request->setQueryData(array(
  'q' => 'climat',
  'lang' => 'en',
  'count' => '10'
));
$request->setHeaders(array(
  'cache-control' => 'no-cache'
));
try {
  $response = $request->send();

  echo $response->getBody();
} catch (HttpException $ex) {
  echo $ex;
}
                                    

import requests
url = "https://api.iscpif.fr/v2/pub/wikitionary/suggest"
querystring = {"q":"climat","lang":"en","count":"10"}
headers = {
    'cache-control': "no-cache"
    }
response = requests.request("GET", url, headers=headers, params=querystring)
print(response.text)
                                    

wget --quiet \
  --method GET \
  --header 'cache-control: no-cache' \
  --output-document \
  - 'https://api.iscpif.fr/v2/pub/wikitionary/suggest?q=climate&lang=en&count=10'
                                    

MULTIVAC HADOOP

MULTIVAC Hadoop Cluster

Multivac offers a large-scale Hadoop cluster with over 700 core of cpu, a terabyte of memory and 100 terabytes of storage for its HDFS. Multivac users can request ISC-PIF LDAP account in order to connect to a mediatory server as a gateway.

Users can submit jobs over Multivac Hadoop Cluster by using Apache Spark (Scala, Python and R) and Apache Hive (big SQL databases).

MULTIVAC Hadoop Notebooks

Multivac also offers Hadoop Notebooks by using Apache Hue. It allows users to login with the same account and analyze data with Multivac Hadoop Cluster.

MULTIVAC Hadoop Open Data

Multivac commits to open its datasets over Multivac Hadoop Cluster in both format of RAW (JSON) and big SQL tables (Apache Hive). Users can run their jobs against Multivac Public Data easily!

Multivac

Hadoop Notebook

Multivac is a state of the art platform, it was created with a great vision of how Big Data can help and assist scientists

PROJECTS POWERED BY MULTIVAC PLATFORM

We are showcasing some of our scientific projects which were built and powered by using Multivac Platform. By the use of Multivac API Engine, these projects have access to both raw and aggregated data.

How to access

Who can access

- Limited access to ISC-PIF residents and partners. Multivac Platform is in beta!

What can and cannot do


CAN

- Access to Multivac Dashboards
- Access to Multivac API engine (secret token is required for private RESET APIs)
- Access to Multivac Hadoop cluster (only available to partners and residents)
- Access to Multivac Data for Hadoop (only available to partners and residents)

CANNOT

- Access to any raw data! You always get filtered, limited and aggregated results
- Direct access to any database, search engine or any other technology. Every request MUST go through Multivac API engine
- Please do not ask for any database dump nor the entire datasets! :-)



Access to Multivac Platform

Access to Multivac

request access to Multivac Platform

Request for Access

The Crew

Maziyar Panahi (CNRS, ISC-PIF)

Chef de projet Multivac,
Responsable de l’infrastructure Big Data

David Chavalarias (CNRS/EHESS, ISC-PIF/CAMS)

Responsable scientifique, data-mining & visualisations

Host Institutions

Sponsors



Community Users


Technology Stack