An Apache Spark-based analytics platform optimized for Azure.
Hi @Janice Chi
To provide more appropriate advice, here are a few follow-up questions help with answers
- What data volume or load are you expecting for both batch and streaming? This may impact how components should be optimized or designed.
- Are there specific transformation operations that you find complex or error-prone in your current implementation?
- Do you have existing performance metrics from your current setup that might indicate areas for improvement?
- How critical is real-time processing in your application? Would minor delays in the streaming pipeline be acceptable?