On May 2021, LinkedIn releases a time-series forecasting library, Greykite to simplify prediction process for its data scientists.
The Greykite library is an open source Python library developed to support LinkedIn’s forecasting needs. LinkedIn developed GreyKite to support its team make effective decisions based on the time-series forecasting models. The primary forecasting algorithm used in this library is Silverkite, which automates the forecasting.
The Silverkite model has many pre-tuned templates (i.e. parameter configs) to fit for different forecast frequencies, horizons, and data patterns. Besides Silverkite, it also includes an interface for the Prophet model developed by Facebook. …

Prophet is a procedure for forecasting time series data based on an additive model where non-linear trends are fit with yearly, weekly, and daily seasonality, plus holiday effects. It works best with time series that have strong seasonal effects and several seasons of historical data.
Prophet is open source software released by Facebook’s Core Data Science team. It is available for download on CRAN and PyPI.
Heart disease is the leading cause of death worldwide, accounting for one third of deaths in 2019. Heart disease cases nearly doubled over the period, from 271 million in 1990 to 523 million in 2019, and the number of heart disease deaths rose from 12.1 million to 18.6 million.
Coronary Heart Disease (CHD) involves the reduction of blood flow to heart muscle due to build-up of plaque in the arteries of the heart.
One of the most challenging task is to identify the causes of this disease and prevent it to the extent possible. Medical diagnostic reasoning is becoming popular…
Logistic Regression is a Machine Learning classification algorithm that is used to predict the probability of a categorical dependent variable. It’s an extension of the linear regression model for classification problems. Unlike linear regression which outputs continuous number values, logistic regression transforms its output using the logistic sigmoid function to return a probability value which can then be mapped to two or more discrete classes.
In this section we’re trying to learn more about :

Given data on time spent studying and exam scores. …
Identification of customers based on their choices is an important strategy in any organization. This identification may help in approaching customers with specific offers. An organization with a large number of customers may experience difficulty in identifying and placing into a record each customer. A huge amount of data processing and automated techniques are involved in extracting insights from the large information collected on customers. Clustering method can help to identifying the customers based on their key characteristics.
Clustering is a set of techniques used to partition data into groups, or clusters. Clusters are loosely defined as groups of data…
In this article, we’re trying to make route using QGis, R, Open Trip Planner (OTP) and Open Source Routing Machine (OSRM). The cases that we’re trying to do are:
Open Trip Planner (OTP) is a family of the open source software projects that provide passenger information and transportation network analysis services. The core server-side Java component finds itineraries combining transit, pedestrian, bicycle, and car segments through networks built…
The CausalImpact R package implements an approach to estimating the causal effect of a designed intervention on a time series. Basically, it builds a Bayesian structural time series model based on multiple comparable control groups (or markets) and uses the model to project (or forecast) a series of the baseline values for the time period after the event. The model used to estimate how the response metric might have evolved after the intervention, if the intervention had not occured.
Traditionally the causal impact of something can be obtained through the use of randomized experiments, but in many cases doing a…
Causal inference is branch of statistics that’s concerned about the effects with the consequences of our actions and that’s really important because identifying one causal law in our data can be more powerful than dozens of correlational patterns that we might find.
The main standard method for estimating causal effects is a randomized experiment. But it’s in lots of situations we can’t run a randomized experiment because it’s too difficult or unethical or because assembly wasn’t done.

This is the example of the time series ‘clicks’, it’s has erratic patterns trends (day of week, month of year, etc). We want…
A stock (also known as company’s ‘equity’) is a financial instrument that represents ownership in a company or corporation and represents a proportionate claim on its assets (what it owns) and earnings (what it generates in profits) — Investopedia

The stock market is a market that enables the seamless exchange of buying and selling of company stocks. Every stock exchange has its own stock index value. The index is the average value that is calculated by combining several stocks. This helps in representing the entire stock market and predicting the market’s movement over time. The stock market can have a…

Abnormal User Behavior dapat didefinisikan sebagai tingkah laku ‘abnormal’ yang dilakukan oleh pengguna suatu platform.
Pada tulisan ini, akan dibahas tentang salah satu abnormal user behavior pada platform Shopee, yang selanjutnya akan didefinisikan sebagai Order Brushing.
Order Brushing merupakan sebuah teknik/ cara yang digunakan oleh penjual untuk membuat suatu pesanan palsu guna meningkatkan penjualan atau meningkatkan rating suatu item yang ada di tokonya tersebut.
Order Brushing dapat diilustrasikan sebagai berikut :
Dimisalkan terdapat sebuah item ‘Best Seller’ yang dijual oleh sebuah toko. Akan tetapi terdapat keanehan pada detail data item tersebut, dimana ditemukan bahwa sebagian besar order/ pesanan untuk item tersebut…
