We have 80+ Airflow DAGs and they're a mess. Some DAGs take 6 hours when they should take 30 minutes. We have dependencies that are poorly documented, tasks that silently fail, and retries that flood our database.
We need an Airflow specialist to:
1. Audit our existing DAGs and identify performance issues
2. Refactor the 10 most critical DAGs
3. Establish patterns and guidelines for our data engineering team
4. Set up proper alerting and SLA monitoring