Building Data Visualization for SolrCloud 7.0 using Apache Zeppelin

Data Visualization allow you to natively interact with all your data via custom dashboards. Apache Zeppelin is a new and incubating multi-purposed web-based notebook which brings data ingestion, data exploration, visualization, sharing and collaboration features to Hadoop, Solr and Spark.

Apache Zeppelin turns your data into informative dashboards and reports that are easy to read, easy to share, and fully customizable.

Multi-purpose Notebook 

The Notebook is the place for all your needs

  1.  Data Ingestion
  2.  Data Discovery
  3.  Data Analytics
  4.  Data
Continue Reading

Checking Arabic Relevance with Apache Solr

Content websites need a system to ensure the relevancy between the page category and the content that show on it. Therefore, to ensure high data quality, data warehouses must validate and cleanse incoming data from users. OpenSooq.com is one of Arabic content website that desperately needs such system, it is a leading classifieds ads website in the Middle East and North Africa. The website which is available in the Arabic language serves more than billion page views per month,

Continue Reading

DAS: Distributed Analytics System for Arabic Search Engines

Today’s successful organizations are data driven. At Opensooq we have tens of engineers, analysts, and data scientists who crunch petabytes of data everyday to provide a great experience for our users. We execute at massive scale using data to connect our millions of users in global classified.

Opensooq has published a paper in IEEE, We introduce the fault-tolerant Distributed Analytics System (DAS) for analyzing big data collected from search engines in Arabic. This system consists of three main

Continue Reading

Site Footer

@ OpenSooq 2019