Intermediate Data Scientist
Provide a description of the typical day to day in this role.
• Attending daily sprint stand up with the program/project team.
• Working on NLP models – training/retraining/evaluating/auditing
• Setting up labelling tasks and meeting with the business to review label criteria and with the offshore labellers
• Completing documentation on the models
• Working with data analysts for validating the model results and logic on the input/output
• Working towards integrating LLMs to solve business problems
• Presenting work to the business stakeholders that will leverage these models
• Create and maintain NLP models leveraging Huggingface Transformers
• Working on Google Cloud, leveraging best practices for cloud
• Knowledge of prompt engineering and LLMs (large language models) an asset, including PEFT and RAG
• Generating new use case ideas to solve business problems/improve results
Top 3 skills sets and qualifications:
a. Building NLP models (BERT, Electra)
b Working with large datasets (SQL, Python, any modelling libraries)
c. Anything with LLMs (large language models like ChatGPT) or generative AI