Data Engineer

What you’ll do:

Success in this role is determined by meeting these key objectives: 

[30%] Build data pipelines and tools that reduce the time and risk associated with monthly financial reporting and other finance processes. 

Reconciling data across accounts and systems is currently a time consuming, mostly manual process. You will combine data from multiple sources in Databricks and automate reconciliation tasks.

[20%] Build datasets and dashboards that make financial data more transparent, actionable, and accurate.

Only top level financial metrics are currently available in dashboards. You will make detailed financial data easily accessible for decisions and analysis, as well as automate data quality tests to validate key financial metrics.

[20%] Increase data available for analysis and ML models by ingesting new data sources.

Our data lakehouse does not currently include data from all internal data sources, such as our HR system. You will build pipelines to make this data available and useful. 

[30%] Increase speed of development, reduce infrastructure failures and cost.

On-call rotation duties: monitor pipeline jobs, review and test new pipeline PRs, track usage and cost.
Add new data quality checks and alerting.
Proactively identify and implement improvements to data infrastructure, as well as the tools used by data scientists and analysts to develop, deploy, and share their work.
Leave our systems and processes better than you found them.

What you’ll bring:

Exceptional alignment with GiveDirectly Values and active demonstration of our core competencies: emotional intelligence, problem solving, project management, follow-through, and fostering inclusivity. We welcome and strongly encourage applications from candidates who have personal or professional experience in the low-income and/or historically marginalized communities that we serve.
Language Requirement: English
Language Preferences: No additional language preferences
Critical thinking and analytical approach necessary to develop technical solutions that scale and are resilient to changes over time
Entrepreneurial mindset and stakeholder management skills required to identify, design, and execute technical solutions that solve important, ambiguous organizational problems
Python, SQL, and spark expertise, along with core competencies required to ship high quality data pipelines and data tools fast
Extensive experience with Databricks preferred; experience with Tableau is a plus
Intellectual humility, curiosity, and a commitment to being part of an exceptional team

Apply via :

job-boards.greenhouse.io