Doctorat Toulouse - offresEmploi/pages/consultationOffres.pl?action=detail&idOffre=23720

Ex : ingénieur, physique, Paris, CDD... ➢ Plus de critères

Research Scientist - PhD Candidate

Imprimer

Référence : 23720
Date de parution : 26-09-2024
Societe : Criteo
Type de contrat : CDD
Date limite de recrutement : 26-10-2024
Secteur : Adtech

J'ai un compte Adum et je souhaite sauvegarder cette offre.

Description
In this PhD research, we aim to bridge the gap between the two approaches
by developing a theory of informed RL, where information—also called advice—
is generated and used in a way that limits performance losses. In our
view, an RL-informed algorithm can get advice in various ways. It could be derived
from running trajectories on a simulator, knowledge sharing by a trained
agent, expert advice, learning to solve a related task, or any available source of
knowledge.
The first task is to investigate the impact of limited, inexact, and adversarial
advice. Following this, the second task focuses on generating the advice itself
and understanding the interaction between its generation and usage. Finally,
the third task involves applying this theory to real industrial data.
Profil
The successful candidate will have:
• a M.Sc. degree in Mathematics, Statistics, Computer Science, or a related
discipline
• a strong background in the theoretical analysis of algorithms as documented
by previous courses or research experience
• Good coding skills (Python)
Informations complémentaires
---> En savoir plus