Sr. Data Engineer + Data Scientist / Machine Learning Engineer&n
Cybertec Ins
2021-12-03 08:53:56
Washington, District of Columbia, United States
Job type: all
Job industry: Science & Technology
Job description
Hello, Hope you all are doing well Please let me know if you are looking for the change and interested in this role. Position : Sr. Data Engineer/Machine Learning Engineer Location : 100% remote Duration : Contract Interview : Phone and skype Description : Please note, this is working on a AI/Client team, but this role is primarily focused on building Data pipelines, using tools like Spark, Python, Quoble, Airflow, Hadeeop, Hive, ETL. It's not a Client Engineer doing Client Algorithms and Modeling. They will be optimizing data pipelines to improve the search experience. Definitely a Data Engineer working with the data science teams, so any Client exposure and understanding of how data pipelines impact Client. I think a high level understanding of Client or Computer Vision is all that is needed.Description : Getty is embarking on it's next wave of innovation in visual storytelling and how to put the perfect image or video in our customer's hands, be it for a society-changing headline or a brand's next big campaigntruly moving the world with images. We are looking for a Data Engineer to create and optimize data pipelines for a new AI/Client Team focused on meaningfully impacting Getty's search experience, be it for personalization, diversifying search results, or enabling customer exploration and discovery. We are looking for a creative and curious data engineer specializing in building and maintaining data pipelines for a data science team. You'll have the opportunity to Client the foundation for data pipelines specific to machine learning. You'll have access to a growing, rich dataset of the most trusted, esteemed, and diverse visual content in the world with over 250 million award-winning images and videos encompassing the latest global news coverage from red carpet events to football stadiums to conflict zones; exclusive conceptual creative images; and the world's largest commercial archive. The metadata on our content is human-judged and curated by our creative researchers with unmatched expertise. With a global presence, our search interaction data comes from over 50 million unique visitors a quarter from almost every country in the world. What you'll be doing: Define, architect, develop, and deploy infrastructure for large scale ETL pipelines with data processing frameworks that Client the foundation for robust, production level data science models in our products Collaborate with other technology teams including data platform, search engineers, and machine learning engineers to handle a wide variety of sources of structured/unstructured data, and integrate solutions into our engineering stack Interface with data science, machine learning engineers, software engineers, and product managers to understand data needs 201 level of understanding of Machine Learning, or Computer Vision. Keep up to date on the emerging best practices in data engineering, continuously evaluating and providing guidance on the use of new technologies that Client the foundation for data engineering best practices We'd love to hear from you if: Bachelor's degree or higher in a quantitative/technical field (ex: Computer Science, Statistics, Engineering, Natural Sciences, Information Management, Mathematics, etc. If you are self-taught and believe you are a good fit for this role, or have significant work experience in data engineering or database engineering, we would love to hear from you as well. You have experience building streaming and batch data pipelines and are comfortable working within a large-scale distributed environment with tools such as Spark, python, Quoble, Airflow, Hadoop, Hive Hands-on experiencein custom ETL design, implementation and maintenance Previous experience working with data scientists, machine learning engineers, and data analysts Hands-on experience writing complex, highly-optimized queries across large datasets You are comfortable in dynamic, ambiguous environments A strong understanding of current data engineering tools and industry standards Excellent communication skills. You are a good listener open to many diverse?voices?and perspectives. You are transparent, trustworthy, and honest. Ability to independently execute on a project, from ideation to delivery to stakeholders, and can pro-actively interact with other engineers at Getty Images to access necessary resources or data.