Veeva10mo ago

Data Scientist

China - DalianFull-Timemid

Data ScienceData ScientistDataData & AI

5 views0 saves0 applied

Apply Now

Quick Summary

Overview

Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in history, we surpassed $3B in revenue in our last fiscal year with extensive growth potential ahead.

Key Responsibilities

Technical Tools

excelmysqlpostgresqlpower-bipythonpytorchsnowflaketableautensorflowcustomer-successdata-analysisdatabase-designdeep-learningsaas

At the heart of Veeva are our values: Do the Right Thing, Customer Success, Employee Success, and Speed. We're not just any public company – we made history in 2021 by becoming a public benefit corporation (PBC), legally bound to balancing the interests of customers, employees, society, and investors.

Join us in transforming the life sciences industry, committed to making a positive impact on its customers, employees, and communities.

The Role

Data scientists play a pivotal role as data-driven decision engines within Veeva OpenData.‌ We expect their work to effectively translate business teams' ambiguous requests into quantifiable data problems, leverage data analysis to provide a solid foundation for decision-making, and further improve the overall data product via new algorithms. We also expect the candidate to combine strong algorithm development capabilities with business acumen, enabling the translation of cutting-edge AI technologies into actionable business value.

The Data Scientist position is part of the Veeva OpenData Product team.‌ This role provides data-backed support to both internal and external customers while coordinating with cross-functional teams to ensure the delivery of related features and to keep product excellence.

Lead the design and iterative upgrades of data matching algorithms, including HCP matching, HCO matching, and other business scenarios involving matching

Be responsible for the design and monitoring of data validation results' storage functions in the main database

Manage internal and external data sources currently used in OpenData, including data source collection, data structure transformation, and update mechanism design

Utilize NLP, vectorization, and large language model technologies to design algorithms that address business team challenges, optimizing performance and efficiency in business scenarios

Collaborate with business teams to analyze production-related issues through data analysis and provide reasonable solutions

Consolidate tool requirements arising from data management processes, draft requirement documents, and coordinate development team resources to ensure timely implementation

At least bachelor’s degree in math, statistics, computer science or equivalent relevant working experience

5+ years of experience in data modeling or algorithm development, with complete project implementation cases

Proficient in Python/Excel, familiar with frameworks such as TensorFlow/PyTorch

Master the principles and tuning methods of algorithms such as decision trees, SVMs, and neural networks

Expertise in relational databases or data warehouse products, like MySQL, PostgreSQL, Redshift, Snowflake.

Proficient in prompt word engineering, and able to skillfully use tools such as Dify for prototype verification

Strong data sensitivity and logical analysis ability

Excellent cross-team communication and results transformation capabilities

Fluent in written English and good in oral English communications

Ability to use tools such as powerBI or tableau to present data analysis results

Pharma industry knowledge;

Master data management experience