š° #32 Please donāt become the next AltaVista, TikToks Algorithm, Snowflakes Difference; ThDPTh #32 š°
Why your company probably will end up on a graveyard of data companies, how TikTokās algorithm works and why snowflake seems to be different than the other databases.
If youāre reading this via email, then congrats, youāre one of the first 100 subscribers of this newsletter. Small but excellent seems to be a good mantra there.
Data will power every piece of our existence in the near future. I collect āData Pointsā to help understand & shape this future.
If you want to support this, read it.
(1) TikToks Algorithm
Why is TikTokās algorithm so good? This is a lovely video explaining some ideas behind the algorithm. WSJ simply created hundreds of fake accounts, watched videos, and then tried to reverse engineer the algorithm. Their key insight: the algorithmās most important input is how long you ālinger over/ rewatch a specific videoā. So, why is it in this newsletter? Because I had a few thoughts watching this video:
The algorithm is really successful.
Itās apparently pretty simple.
It could be much more successful (if it wasnāt optimized for short-term wins, but long-term wins, which I think a lot of the criticism is about. Call me naive, but I think every company should want healthy, functioning & well-informed customers in the long run).
Itās apparently easy to reverse engineer.
Putting 1ā3 aside, think about (4). This algorithm seems easy enough to reverse engineer, even though bytedance seems to think this is an important piece of intellectual property. I think the reason is that machine learning algorithms are seldom engineered with the goal of being hard to reverse engineer. So if you do truly think an algorithm is an important piece of intellectual property, maybe you should think about the chance of reverse engineering as well.
A Wall Street Journal investigation found that TikTok ā¦
www.wsj.com Ā ā¢Ā Share
(2) Why Snowflake is different
This article takes a stab at explaining why Snowflake is fundamentally different then whatās happening at Azure or AWS. Since no one of us has a good view into the sources of these solutions, itās only a guess, but the story makes sense.
The basic idea is that Redshift is Postgres + massive parallel processing, but still Postgres. Whereas Snowflake truly was built to decouple storage from compute. No matter where these products stand, what is true is that Snowflake focuses on decoupling storage from compute. Thatās a key difference from what e.g. Redshift does. On AWS itās a feature, on Snowflake, itās a USP.
The #1 Reason Snowflake is Different | by Doug Foo | Geek Culture | Medium
It intrigues me how a small startup like Snowflake can almost come out of stealth modeā¦.
medium.com Ā ā¢Ā Share
š®š®š® Data Company CornerĀ š®š®š®
Stuff that might be interesting for anyone at the front line of the data world, inside a data company, inspired by much positive feedback from my article on commercial open source software data companies.
(3) Graveyard of SearchĀ Engines
A point Iām trying to make for some time now is that in the data space, you very likely have to embrace open-source. In the open-source & data space, youāre headed into a winner takes all market.
But that means you are not building āanother ETL toolā or āanother CDP solutionā. Youāre either building the only oneāāāthe winner, or a company headed for bankruptcy sometime in the next 10ā15 years.
I really like the search engine market analogy because it displays this dynamic very well. The search engine market is much older than either the data or the open-source market, and developed a very clear structure:
One winner: Google
Two follow-ups because they enforce monopolies (Yandex & Baidu)
One well-differentiated player, duckduckgo, with an uncertain future (Google did try to take down Baidu, but got stopped, I am not sure whatās really stopping Google from taking down duckduckgo once it becomes big enough).
Thatās it, thatās 99% of the complete search market. Here you can find everyone else:
7 Search Engines Google Obliterated
Remember when we had search engines ā multiple? Take a trip down memory lane.
www.searchenginepeople.com Ā ā¢Ā Share
And as a side note: The CMS market looks kind of similar, dominated by one huge open-source player.
š In Other News &Ā Thanks
Thanks for reading this far! Iād also love it if you shared this newsletter with people whom you think might be interested in it.
P.S.: I share things that matter, not the most recent ones. I share books, research papers, and tools. I try to provide a simple way of understanding all these things. I tend to be opinionated. You can always hit the unsubscribe button!
Data; Business Intelligence; Machine Learning, Artificial Intelligence; Everything about what powers our future.
In order to unsubscribe, click here.
If you were forwarded this newsletter and you like it, you can subscribe here.
Powered by Revue