Medicine & AI to Improve Outcomes by Discovering Undiagnosed Patients- NHS Clinicians Presenting at HIMSS

Predicting Length-of-Stay & Risk of Mortality for ICU Patients

Pangaea Data Awarded Top Tier Co-Sell Partnership Status by Microsoft

Pangaea Named ‘Digital Solution of the Year’ by UK Government’s Department for International Trade

WHITEPAPER: How can Pharmaceutical & Healthcare Providers Collaborate in a Financially Sustainable & Scalable Manner to Improve Patient Outcomes?

Go back

Data Engineer

Preference for Location: London, UK
Number of vacancies: 1

About Pangaea Data Limited

 ​​Pangaea Data (Pangaea) is a South San Francisco and London based business founded by Dr Vibhor Gupta and Prof Yike Guo (Director Data Science Institute at Imperial College London; Provost, Hong Kong University of Science and Technology). They have worked in medicine and computing for over 20 years and have raised over $300 million through their academic research, including a recent $110 million grant focused on development work on large language models in medicine. Pangaea’s product platform enables financially sustainable and scalable collaborations between the healthcare and pharmaceutical industries to help find more undiagnosed and miscoded patients across 7,000 hard-to-diagnose conditions. Pangaea’s advisors include industry veterans from healthcare and the life sciences, including Lord David Prior (former chairman, NHS England) and Mr. Andy Palmer (former CIO, Novartis). Pangaea was recognized as ‘Digital Solution of the Year’ 2023 by the UK government and awarded top tier co-sell partnership status by Microsoft, which is exclusive to only 25 of the 30,000 companies Microsoft works with globally. 

The Role

As a Data Engineer, you need to be able to understand product data APIs and client data APIs so as to develop robust data integration, perform data analytics, and ensure Pangaea’s product is compatible with client data environment. Knowledge in databases, data science and AI will be critical.

Key technical responsibilities will include:

  • Design, develop and test robust, flexible, efficient and secure product data APIs.
  • Design, develop, test and deploy robust, flexible, secure and computationally efficient data integration pipeline (including conversion, pre-processing, loading, post-processing and etc) based on client data APIs (data specification, databases) so that the product is compatible. Such pipeline may require customisation per client requirements.
  • Perform remote/local data analytics based on client data samples, product loggings and overall system performance to obtain insights about data and product improvements.
  • Clearly communicate product data APIs to the clients, and clearly understand and communicate client requirements and client data specifications to the internal technical team.

Requirements

Personal traits:

  • A strong intuition for what makes products a joy to use.
  • Empathy for how different users will need different things out of a product at different stages, and how to effectively serve these different needs in one product.
  • Strong communication and mediation skills.
  • Strong people skills and the ability to engage all levels of the organization (especially the front line).
  • Ability to work collaboratively in a team environment.
  • Ability to communicate complex ideas effectively, both verbally and in writing, in English.
  • A strong software engineering background to understand how the user facing product will tie into backend and architectural decisions. ​

Technical skills:

  • With university qualification (Bachelors, Masters, Doctorate) who have completed at least two years of university study in Computer Science, Informatics, Engineering or related.
  • Experience (classroom/work) in databases, different data specifications/models/APIs, data analytics and data science.
  • Experience on general programming languages: Python, C++, Java, etc.
  • Experience with working in Linux.

Nice to Have

  • Experience with deep learning, machine learning and NLP frameworks such as PyTorch (or TensorFlow), HuggingFace Transformer, Scikit-learn.
  • Relevant work experience, including internships, full time industry experience or as a researcher in a lab.
  • Experience with cloud platforms such as AWS, Azure, Google Cloud Platform.

 Perks and Benefits

  • Flexible working hours.
  • Salary depending on experience.
  • Benefits include private medical insurance, life insurance and travel cards.
  • You would join a small, dedicated and fast-growing team.
  • You will have the opportunity to learn about building a startup business from experienced professionals and serial entrepreneurs.
  • We are currently supported by serial entrepreneurs and angel investors. You will have the opportunity to experience an investment life cycle for a startup and meet leading venture capitalists.
  • We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, colour, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. ​

Contact Details

Please email your CV to careers@pangaeadata.ai outlining your relevant experience.

General Information

Pangaea Data’s headquarters is in London (UK) with teams in San Francisco (US) and Hong Kong. For more information please visit www.pangaeadata.ai.

Pangaea Data is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, colour, sex, sexual orientation, gender identity or expression, religion, national origin or ancestry, age, disability, marital status, pregnancy, protected veteran status, protected genetic information, political affiliation, or any other characteristics protected by local laws, regulations, or ordinances