Episode
Practical approach to building real-time analytics for cybersecurity applications | Citus Con 2023
with Slava Moudry
Video of a conference talk about how to design a Postgres database schema and ingestion workflow that employs the Citus database extension to power Analytics for cybersecurity data. This talk was delivered at Citus Con: An Event for Postgres 2023 by Slava Moudry. Similar approaches can be used in other time-series applications, not just cybersecurity data.
In this talk, you will learn about:
- Advantages of Citus and Postgres over other streaming technologies
- How to choose the best data structures for table columns
- How to choose a distribution key to ensure colocation and even distribution in a Citus cluster
- Best practices with native Postgres partitioning
- Schema design to support flexible API request queries
- Optimizing data ingestion and data aggregation using Manual Query Propagation
- Keeping things running smoothly. Watching for I/O spikes. Optimizing autovacuum settings
Slava Moudry has been using Postgres to build analytics and data warehouses since 2005. Slava has worked as a Postgres DBA and a Database engineer—and has used Citus and Postgres together to build real-time analytics workflows in at least 2 different companies.
Recommended resources
- Watch more Citus Con 2023 talks
- Everything you need to know about Citus Con: An Event for Postgres 2023
- Star the GitHub repo for the Citus database extension, to show your support of Citus open source
Connect
- Citus Con An Event for Postgres | Twitter: @CitusCon
Video of a conference talk about how to design a Postgres database schema and ingestion workflow that employs the Citus database extension to power Analytics for cybersecurity data. This talk was delivered at Citus Con: An Event for Postgres 2023 by Slava Moudry. Similar approaches can be used in other time-series applications, not just cybersecurity data.
In this talk, you will learn about:
- Advantages of Citus and Postgres over other streaming technologies
- How to choose the best data structures for table columns
- How to choose a distribution key to ensure colocation and even distribution in a Citus cluster
- Best practices with native Postgres partitioning
- Schema design to support flexible API request queries
- Optimizing data ingestion and data aggregation using Manual Query Propagation
- Keeping things running smoothly. Watching for I/O spikes. Optimizing autovacuum settings
Slava Moudry has been using Postgres to build analytics and data warehouses since 2005. Slava has worked as a Postgres DBA and a Database engineer—and has used Citus and Postgres together to build real-time analytics workflows in at least 2 different companies.
Recommended resources
- Watch more Citus Con 2023 talks
- Everything you need to know about Citus Con: An Event for Postgres 2023
- Star the GitHub repo for the Citus database extension, to show your support of Citus open source
Connect
- Citus Con An Event for Postgres | Twitter: @CitusCon
Have feedback? Submit an issue here.