Tutorial Presto 8.8 May 2026

Before attempting to install version 8.8, ensure your workstation meets the baseline requirements. Because this is legacy software, compatibility is key.


Create iceberg.properties:

connector.name=iceberg
hive.metastore=file
iceberg.file-format=parquet
iceberg.catalog.type=hadoop
iceberg.warehouse=/data/iceberg/warehouse

| Feature | Description | How to use | |---------|-------------|-------------| | Geospatial functions | Support for Bing tile and H3 tile systems | SELECT h3_geohash(ST_Point(-74.006, 40.7128)) | | Array / Map improvements | array_intersect(), map_zip_with() | SELECT array_intersect(ARRAY[1,2], ARRAY[2,3]) | | Improved memory tracking | Better query memory accounting to avoid OOM | Monitor query.max-memory-per-node in config | | AWS Glue catalog | Direct integration with Glue (not just Hive metastore) | Set hive.metastore=glue in hive connector properties |


Presto 8.8 is a client-side application used primarily to establish secure VPN tunnels to private networks. It is often utilized by government agencies, educational institutions, and large corporations to allow employees to work remotely. It creates a secure "tunnel" that encrypts data traffic between your computer and the organization's internal network. tutorial presto 8.8


Create etc/node.properties:

node.environment=production
node.id=presto-master-01
node.data-dir=/var/presto/data

Problem: CBO not picking broadcast join
Solution: Ensure stats exist and are recent:

SHOW STATS FOR hive.default.customers;
-- Check if "nulls fraction" and "distinct values" are present

Problem: Join order still suboptimal
Solution: Manually hint if needed (as fallback): Before attempting to install version 8

SELECT /*+ BROADCAST(c) */ ... 

Problem: Statistics not loaded
Solution: Refresh metadata:

CALL hive.default.refresh_table_statistics('default', 'sales');

bin/launcher start

Check status:

bin/launcher status

Expected output: Running as [PID].

Use EXPLAIN to see CBO decisions:

EXPLAIN (TYPE DISTRIBUTED, FORMAT JSON)
SELECT c.customer_name, sum(s.amount)
FROM hive.default.sales s
JOIN hive.default.customers c ON s.customer_id = c.id
WHERE s.order_date = '2024-01-15';

Look for: