

Not withstanding the frustration of the visualization tool not updating the charts (which our Data Analysts wanted desperately), the cost was also prohibitive as Amazon Athena’s pricing is based on the bytes scanned. The query time most often exceeded the threshold set by Superset resulting in the charts on Superset not getting updated. We use Superset (visualization tool) for analyzing a dataset by querying on AWS Athena (the data being stored in AWS S3).

#Athena json query how to#
So, how to optimize the queries is the question., here are some ways to optimize a query :

We can also see a detailed information on query execution plan for the query using EXPLAIN & EXPLAIN ANALYZE statement. Having a basic idea on order of execution for query statements can be helpful.
