Picture of a magnetic disk.

Free Data — and where to find it

Free data is everywhere! Firms publish their data anonymized on kaggle.com, governments participate in open data initiatives and banks open their payment transaction data to start-ups to offer new services for customers. Big firms like Facebook promote the data age…

Read more
Code

Analyse new Datasets automatically with R-Markdown

When you get your hands on a new dataset, describing the variables automatically can be extremely useful. Their distribution, quality (missings) and correlation to other variables determines their suitability for different statistical methods. As mentioned in the R-intro there are…

Read more
Product Development in a Digital Economy

Product Development in a Digital Economy

I recently read the book “How Google Works” by Eric Schmidt and Jonathan Rosenberg. Additional to brilliant company and industry insides, they advertise the ease of product development in a digital economy. In short they see the availability of data…

Read more
10 Data Science Use-Cases

10 Data Science Use-Cases in Firms

Data science is literally everywhere. Many firms look for skilled personnel to cover the subject in their company even if there is very little understanding for the subject. I see a lot of job offers, and talk to a lot…

Read more
Google Search

Webscraping Google Search Results

In this short post, I will show you how you can access the number of Google search results using webscraping. The code is written in R. Be aware of rating limits to not get blocked, and read Google’s Terms of…

Read more
Decision Tree for Wine Data

Decision Trees

Decision trees being around since the 70’s. They are applied to many problems in modern machine learning and have different implementations. Why should you use trees? Well, decision trees are celebrated for their interpretability. You can explain the decision, why…

Read more
Fitted-data.com

Review Tableau

Communicating your results to the management is a key skill as a data scientist. There are many tools on the market which can make your life much easier. One of these tools is Tableau. I will review it regarding its…

Read more
Data Science in Companies

How to become a Data Scientist

Data scientists are highly in demand. Data are generated everywhere — from your smartphone/TV, car sensors, machines, to every camera. And the amount of data will rise tremendously (industry 4.0, Internet of things, etc.). Some companies already invest money and…

Read more
Will Artificial Intelligence replace us?

Will Artificial Intelligence replace us?

Before the industrial revolution in the 18th century began, more than half of all Europeans earned their money working on fields. The steam machine by James Watt changed this dramatically in the 18th and 19th century. A cheaper and more…

Read more

Sentiment Analysis using Facebook Data

Unstructured data, here textual data, remain a big thing for the major players. Apple, Amazon and Microsoft use deep neural nets trained on huge amount of data obtained by their services and the web itself. Another example comes from politics.…

Read more
Opinions about Data Science on fitted-data.com

Econometrics vs. Data Science

I studied statistical models as an econometrician, meaning I was trained to care a lot about assumptions and potential selection biases in the data. The final aim was to find causation. In applied data science people emphasis on predicting class…

Read more
by Hadley Wickham and others at RStudio.

R Installation

R is an extremely popular programming language for statisticians, social science and data scientists. It ranks 5th of the most popular languages in general, it is open source and full with helpful and mostly user written packages. When you want…

Read more