Persisting Trained Machine Learning Models Using MsgPack

Introduction

At OpenSooq, we take AI seriously. Tensroflow and Scikit-Learn are two examples of popular packages used for machine learning, that are used in OpenSooq for different aspects of our e-commerce platform and its supporting services and moderation tools. In Supervised Machine Learning there are two phases training (fitting the model) and prediction. Training is done offline, then models get evaluated and then persisted to be deployed on production. Tensroflow models can be persisted as frozen protocol buffer files, on

Continue Reading

Mobile Push Notification Targeting Generation Using Genetic Algorithm in Classifieds

Acquiring users takes time and money, we are working hard for them. However, push notifications are the follow-up to user acquisition, to make sure that the marketing dollars stretch as far as possible. The idea comes from specifying a system that will enable the OpenSooq to profile users, segment them and target them with different parts of our platform based on a lightweight implementation.

The system is centered around the idea of generating a DNA fingerprint

Continue Reading

Building Data Visualization for SolrCloud 7.0 using Apache Zeppelin

Data Visualization allow you to natively interact with all your data via custom dashboards. Apache Zeppelin is a new and incubating multi-purposed web-based notebook which brings data ingestion, data exploration, visualization, sharing and collaboration features to Hadoop, Solr and Spark.

Apache Zeppelin turns your data into informative dashboards and reports that are easy to read, easy to share, and fully customizable.

Multi-purpose Notebook 

The Notebook is the place for all your needs

  1.  Data Ingestion
  2.  Data Discovery
  3.  Data Analytics
  4.  Data
Continue Reading

Arabic Content in Opensooq with Apache Solr

Arabic language poses several challenges faced by Natural Language Processing (NLP), largely due to the fact that Arabic language has a very rich and sophisticated morphological system. Opensooq is covering cover some of the challenges and how to solve them with Solr and will also present the challenges that were handled in Opensooq’s use case.

Opensooq shared its experience of tackling Arabic language with Solr. Arabic poses many challenges for retrieval. When dealing with social media, such as microblogs, and

Continue Reading

Opensooq Chat : The Success Story You Didn’t Know

The OpenSooq team is very careful to provide our users with special features that help them have a better experience. However, we believe in innovation. Changing times means changing how we work. We encourage innovation at all levels of our work to help us deliver more features for our users.

 

OpenSooq launched a new feature, Chat. It’s a robust tool that gives users the power to communicate real time with buyers to sell or buy faster. The days leading

Continue Reading

Solr – InMemory

Apache Solr is one of the most popular search engine nowadays, as known it’s a Java application. In this article, we will share with you the result of hosting solr data in the memory.

To achieve this experiment, we have created a tmpfs and copied Solr application ( with all data ) into the tmpfs folder. At, the same time had another instance (with same hardware and software specs) that running Solr on disk.

* Note : this is Solr 4.8 

The

Continue Reading

Running SolrCloud 5.X on Docker with 3 steps

Docker is all the rage at the moment! It was recently selected as Gartner Cool Vendor in DevOps. As you may already know, Docker is a platform to build and deploy applications as self-contained units. Those units, called containers, can be executed consistently on a developer laptop or production server. Since containers include all their dependencies, they are truly portable. And, compared to normal virtual machine images, Docker containers are much more lightweight because they don’t need as much

Continue Reading

Checking Arabic Relevance with Apache Solr

Content websites need a system to ensure the relevancy between the page category and the content that show on it. Therefore, to ensure high data quality, data warehouses must validate and cleanse incoming data from users. OpenSooq.com is one of Arabic content website that desperately needs such system, it is a leading classifieds ads website in the Middle East and North Africa. The website which is available in the Arabic language serves more than billion page views per month,

Continue Reading

DAS: Distributed Analytics System for Arabic Search Engines

Today’s successful organizations are data driven. At Opensooq we have tens of engineers, analysts, and data scientists who crunch petabytes of data everyday to provide a great experience for our users. We execute at massive scale using data to connect our millions of users in global classified.

Opensooq has published a paper in IEEE, We introduce the fault-tolerant Distributed Analytics System (DAS) for analyzing big data collected from search engines in Arabic. This system consists of three main

Continue Reading

Site Footer

@ OpenSooq 2019