DAX Fully Covered

We are happy to announce that the companies from Germany’s biggest index DAX are now all available on SimFin (for example Volkswagen).
It took a bit longer than expected to get everything exactly the way we wanted but we think it was worth it, as the addition of other companies from Germany or around the world should be much faster now. This is due to the fact that our machine learning models were not used to work with data from markets other than the US, so we checked all the data points extensively and made a lot of manual corrections at…


New US Data Crawler

We recently completed a big update to our US data crawler, completely replacing our old technology that relied on crawling the semi-structured XBRL data provided by the SEC. As detailed in some of our previous blog posts, the XBRL data has unfortunately quite a few problems: missing data points, companies reporting “weirdly” in XBRL, making it very hard to rebuild the original statements and also the XBRL data only starts at around 2009.

Our initial idea to solve these problems was to not look at the XBRL data at all, and instead crawl the original reports which the SEC is…


Web API v2

We just released the second iteration of our web API, you can find the new documentation here.

If you don’t have an API key yet, head over to SimFin and get one.

The new web API is much faster and offers a lot more functionality, for example it’s now possible for all queries to specify the ticker instead of the SimFin ID. It’s also possible now to retrieve multiple tickers and time periods at once (although this is reserved for SimFin+ users), what makes the data retrieval process very quick.

We also improved the format of the data returned from…


We just released our new bulk download and along with it our new Python API. There are various tutorials already available on Github that range from the basics of using the new Python API to statistical analysis and machine learning. All tutorials are available as Jupyter Notebooks on Google Colab so you can try everything in your browser without having to download anything.

In this blog post I’m going to show you quickly how we structured the new bulk download and what you can do with our new Python API.

Bulk Download File Structure

[If you mainly care about what amazing things you can…


Snapshot of how our PDF extractor perceives a table

We are happy to finally introduce SimFin Fuse 2.0, which you can find here. It has been online for the last two months already, but we had to improve some more things and didn’t feel it was time yet for an official presentation, so this will now be it’s official introduction.

SimFin 2.0 is the last big step towards our goal of increasing our data quality and expanding our data beyond the US market, as it combines our PDF crawling with the PDF extraction and manages the upload of the structured data to SimFin (it can also still process XBRL…


We are happy to announce that the first version of our PDF Extractor is now online. You can read more about the idea behind the PDF extraction in our previous posts (here and here), so this post will focus instead on the current state of the extractor, how it works and what the next steps are.

The PDF Extractor

You can find the PDF extractor if you head to the SimFin PDF Library and click on “open” in the “Extracted data” column for annual or quarterly reports. The extractor currently focuses solely on tables with numeric data, as these contain almost all the…


Dear SimFin users and everyone in the financial community,

We just released a new update on SimFin that includes our new PDF library, check it out here: https://simfin.com/pdf/library

What is the PDF library and how do we build it?

The PDF library aims to collect financial PDFs of all listed companies around the world, with a focus on annual/quarterly reports but also earnings releases, presentations and earnings call transcripts. The library can be accessed by anyone and it is the first time that all financial reporting is made available openly in a single place.

The PDFs in our library are crawled using our open source PDF crawler, if you want to…


Update: This tutorial is for our low-level web-API. For our new Python API, click here.

There were a few requests from SimFin users to provide a more detailed introduction on how to use the API, so this tutorial will hopefully make things more clear for people that don’t have extensive API experience yet.

We’ll take Python to exemplify the process, all the code along with one example for R can be found on Github: https://github.com/SimFin/api-tutorial

Get a SimFin API key

Register on SimFin and head to the API page to get your API key: https://simfin.com/data/access/api

You will need this key later in order to make…


In this post I want to update you on our PDF crawling and extraction efforts, the first step towards hosting data not only from US based companies but from any listed company around the world.

How we gather data currently

In case you don’t know yet, we currently get all the fundmental data on SimFin from the SEC database, that offers fundamental data for US companies in a machine readable format called XBRL. While it’s great that the SEC is offering this, the XBRL format still has a lot of problems, more than ten years after it’s been launched. …


Dear SimFin users, and those that might become them,

in this post I want to tell you a bit more about our new API as well as our decision to start offering new premium accounts called “SimFin+ accounts”. I will also tell you a bit more about the future plans of SimFin and how this endeavour is currently being run.

First of all, our foremost aim is still to make fundamental financial data freely available to everyone, and this is not going to change. …

SimFin

https://simfin.com — making financial data openly accessible

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store