Responsibilities
- Create and maintain optimal data pipeline architecture
- Extraction, transformation, and loading of data from a wide variety of data sources using Python and SQL
- Providing users access to datasets using REST and Python APIs
- Categorizing, cataloging, cleansing, and normalizing datasets
- Communicating with business users and technology stakeholders.
Requirements
- Python and data analysis libraries (Pandas, NumPy, SciPy)
- Relational SQL database development
- Unix/Linux command-line experience
- Object-oriented languages: Java, C++, etc.
- AWS cloud services: EC2, RDS, Athena, Lambda, etc
- RDBMS: SQL Server and PostgreSQL
- Identity and Access Management: Kerberos, OAuth 2.0, LDAP
- Other: Apache HTTP Server, Kafka, Snowflake
- Broad understanding of fixed income, derivatives, futures, FX, or other financial-services instruments