PinnedAdir MashiachApache Spark: 5 Performance Optimization TipsInteresting and specific lessons learned from experience5 min read·Jul 31, 2019----
PinnedAdir MashiachPartition Management in HadoopOur solution to the Hadoop small files problem9 min read·May 27, 2020----
PinnedAdir MashiachDefend Your Infrastructure — Handling 3,000 Hungry UsersWhy is it so important to track your users’ queries, and how we do it?5 min read·Feb 14, 2019----
Adir MashiachFirebolt — The new kid on the (data warehousing) blockA short description of Firebolt, for not-so-technical people3 min read·May 23, 2021----
Adir MashiachSPOT: Is Spotify a good stock to buy?A human-readable stock analysis, from a rational perspective6 min read·Dec 26, 2020----
Adir MashiachImpala Discussion With The Product Manager (Greg Rahn)Q&A session on specific issues that bothered us5 min read·Aug 31, 2018--1--1
Adir Mashiach5 Main Missing Features in Impala (Opinion)A letter to the developers and product manager of Impala5 min read·Aug 15, 2018--3--3
Adir MashiachHotspotting In Hadoop — Impala Case StudyWhy Small Frequently-Queried Tables Shouldn’t Be Stored In HDFS?4 min read·Apr 13, 2018--2--2
Adir MashiachPartition Index - Selective Queries On Really Big TablesHow to make your selective queries run 100x faster?4 min read·Apr 1, 2018--1--1
Adir MashiachApache Impala: My Insights and Best PracticesHow did we make our Impala run faster?7 min read·Mar 20, 2018--3--3