WE SPECIALISE IN FINDING FANTASTIC OPPORTUNITIES
FOR DIGITAL AND DATA SPECIALISTS WITH THE MOST INNOVATIVE BUSINESS ACROSS EUROPE AND THE USA.
US$300000.00 - US$500000.00 per annum + Equity + Benefits
16 days ago
(REMOTE - US ONLY) Sr. NLP Research Scientist (ML/DL/GPT-3)
In 2020 Darwin Recruitment placed x3 of the best NL Researchers in the USA into an early-stage
start-up. Since then this company has increased their customer based from 300,000 to over 4+
million people, have tripled their revenue and are about to secure $20M in series A funding, of
which a large percentage of this investment will go into R&D for GPT-3.
The Founder reached out to us to let us know that they were looking for another Sr. NLP
Research Scientist who has worked on cutting edge language modelling, has translation
expertise, deep learning and someone that can train GPT-3 from scratch without screwing it up
(anyone who knows will know that this is costly process and requires precision to save both time
This is a growth role that gives you huge influence and the long-term opportunity to grow a pod
amounts some of the best Researchers in the industry. In the short term however, you need to
be comfortable rolling up your sleeves and getting your hands dirty!
· Doctorate/Masters degree required in Computer Science, or Equivalent Experience.
· 7+ years of experience leading software development projects with a distinguished track record on technically demanding projects.
· Extensive experience building and training GPT-3 models. Knowing how the nuts and bolts of them work.
· 3+ years of strong experience in NLP domain.
· Solid understand of AI/ML domain and hand on experience in building new models and deploying to production.
· Has worked with deep sequence to sequence models in the context of translation or similar tasks.
· Strong statistical analysis skills and demonstrated experience in deriving insights from unstructured data.
· Is able to think creatively about novel datasets and architectures.
· Good command over English language.
Darwin Recruitment is acting as an Employment Agency in relation to this vacancy.
Do you have experience applying NLP to the pharmaceutical space? Are you proficient in Python with experience in understanding existing complex code? Have you developed machine learning models at scale from inception to business impact? My client, an NYC based drug discovery firm located in NYC is looking for an NLP Scientist with experience in Natural Language Understanding applied to information extraction to join their team an as Associate Director - NLP Scientist. You're main responsibility in this role will be to build and optimize the unstructured data ingestion pipeline that underlies their pharmaceutical analysis engine. Here's a deeper insight into what you'll be doing day to day… Information extraction and linkage: - Implement/improve state-of-the-art Named Entity Recognition for multiple entities leveraging multi-task learning - Establish relationship extraction between entities with classification of relationship types - Perform topic modeling and documents classification - Implement/improve state-of-the-art Named Entity Normalization to link entities to the right entry in our Knowledge Graph - Leverage graph algorithms for insight extraction and to enhance NLP - Support the deployment of our NLP models into our production ETL Insight extraction: - Leverage the information extracted from documents to design algorithms to alert analysts of important catalyst events - Apply a combination of NLP models to perform analysis for the various stakeholders (drug discovery, competitive intelligence, etc…) To be considered you must have: - Master's degree plus 5+ years' work experience (PhD plus 2+ years' experience preferred) - Experience working in the pharmaceutical industry or bio-engineering is essential - Experience working in Linux environment and using GitHub - Experience reproducing published results and improving on them (using either TensorFlow 2, Keras or PyTorch) - Experience working with NLP frameworks (e.g., NLTK, spaCy) and pre-trained models (e.g., BERT) is a plus - Mathematical and statistical understanding of mainstream NLP and machine learning techniques (e.g., TF-IDF, CRFs, similarity measure) - Successful experience in collaborating on code development - Experience in graph database (RDF triple store), SPARQL query language and knowledge graph algorithms is a plus - Familiarity with cloud-hosted distributed computing (AWS EC2) - Familiarity with knowledge graph inference and interfacing with Neural Network is a plus Darwin Recruitment is acting as an Employment Agency in relation to this vacancy.
TECHNICAL REQUIREMENTS: R (Rshiny, ggplot2, tidyverse, ggplot2, plotly) Basic familiarity with concepts in machine learning PostgreSQL Engineering best practices: git, agile/kanban methodologies, code documentation Experience designing relational databases/ERDs (nice to have) Experience running ML algorithms in AWS or similar platforms (nice to have) Experience working or interning on the data science team at another high performing company (nice to have) CONTACT: Our company is head-quartered in Houston, TX but has employees in Austin, NYC and in Philadelphia. In light of current conditions, we are open to hiring remotely at first provided you can work on Eastern Standard Time, and can then relocate to one of our Texas locations. If interested please reach out ASAP - moving quickly. We are an equal opportunity employer. Darwin Recruitment is acting as an Employment Agency in relation to this vacancy.
Data Scientist/Consultant - Machine Learning, Deep Learning, NLP, Python, AWS, German, Munich This is a great opportunity to work for a leading data-driven consultancy in Munich. This company develops big data and AI solutions for international companies. A talented team of data scientists, data engineers and application developers implement innovation projects using advanced analytics and machine learning. The business is looking for talented Data Scientists and Consultants to join the business and work on some exciting projects with customers. In this role, you would identify and evaluate use cases and advise customers on the use of advanced data analytics and machine learning methods. You would develop modern AI environments to prepare data for machine learning, train and select machine learning models. Evaluating the latest methods and technologies is also an important part of this role; allowing you to influence and drive solutions for customers. The business is looking for someone who has a degree in computer science, data science, machine learning or related fields. You should have strong experience of working as a data scientist with a focus on advanced analytics, machine learning and deep learning. Advanced programming experience in Python and SQL is also required. Experience in implementing machine learning models with a range of frameworks (tensorflow, pytorch etc..) is important, as well as knowledge in continuous integration and deployment tools too. As a customer facing role, fluency in German is absolutely essential; as well as being willing to travel to customer sites in the Munich area (when restrictions are lifted to allow this of course). If this role is of interest to you and you'd be keen to find out more, please apply as soon as possible and we'll schedule a call soon. Darwin Recruitment is acting as an Employment Agency in relation to this vacancy.
Our client is a hyper growth start-up that's focussed on paraphrase generation (amongst many other things writing related).. This company has 5m MAU and a team size of 40. The b2c writing and research platform leverages state-of-the-art NLP to deliver human-in-the-loop products which expedite and improve the quality of writing and research. Their Founder reached to let us know that they need to find a Director of NLP Research who has worked on cutting edge NLG language models (GPT, seq2seq, T5, etc.), has translation expertise, deep learning and someone that can both train GPT-3 from scratch and know what's going on under the hood! For context they have already built an end to end co-writer that produces better more accurate results that GPT3… Ideal Candidate: Doctorate/Masters degree in Computer Science or equivalent experience. 10+ years of experience leading software development projects with a distinguished track record on technically demanding projects. 5+ years of experience in NLP Solid understanding of the AI/ML domain and hands-on experience building new models and deploying them to production. Has worked with deep sequence to sequence models in the context of translation or similar tasks. Has worked with SOTA language models such as T5, GPT-3, and BERT Has a deep understanding of natural language generation Strong statistical analysis skills and demonstrated experience in deriving insights from unstructured data. Is able to think creatively about novel datasets and architectures. Good command over the English language. Job Responsibilities: Lead a group of research scientists to solve previously un-tackled problems and advance the state-of-the-art in NLP Lead the execution of a high number of experiments in parallel, analyze results, and iterate. Lead the development and delivery of scalable transformer based models into production. Understand the product and collaborate with product teams to enhance the product roadmap. Bring thought leadership in architecture design and efficient research engineering processes. Proactively foresee issues and resolve them Here's what the first 30/60/90 days will look like on paper but for complete transparency, they're looking to hire someone that can leverage their experience/expertise to help define this strategy for optimal success: The first 30 days> Get to know the team, the product, etc. Understand team members strengths/weaknesses Understand current state of the research roadmap and begin to make recommendations Audit prior research and begin to make recommendations Pick low-hanging fruit based on domain expertise start designing experiment to train in-house XXL LM 60 Days> Run end-to-end experiments Take 1:1 meetings with direct reports, begin guiding their research direction/responsibilities Proactively make adjustments to current research roadmap and guide company leadership on future possibilities (including new product features based on SOTA NLP). Make recommendations on novel architectures Begin the process of training in-house XXL LM 90 days> Fully own the research team, taking responsibilities for deadlines and overall output Help direct reports set their OKRs Proactively mentor direct reports, meaningfully contribute to their growth Start setting future research roadmap Move the needle on core KPIs associated with model proficiencies on our various tasks (paraphrasing, summarization, GEC, language generations, etc.) Get in touch for more information: firstname.lastname@example.org / +1 617 480 9327 Darwin Recruitment is acting as an Employment Agency in relation to this vacancy.