Articles Collectés

873

Total

864

Non traités

9

Traités

Non traitée 07/02/2018 09:00
Discovering types for entity disambiguation

RSS: OpenAI News

We’ve built a system for automatically figuring out which object is meant by a word by having a neural network decide if the word belongs to each of about 100 automatically-discovered “types” (n...

Non traitée 31/01/2018 09:00
Requests for Research 2.0

RSS: OpenAI News

We’re releasing a new batch of seven unsolved problems which have come up in the course of our research at OpenAI.

Non traitée 18/01/2018 09:00
Non traitée 06/12/2017 09:00
Block-sparse GPU kernels

RSS: OpenAI News

We’re releasing highly-optimized GPU kernels for an underexplored class of neural network architectures: networks with block-sparse weights. Depending on the chosen sparsity, these kernels can run o...

Non traitée 04/12/2017 09:00
Non traitée 02/11/2017 08:00
Non traitée 26/10/2017 09:00
Learning a hierarchy

RSS: OpenAI News

We’ve developed a hierarchical reinforcement learning algorithm that learns high-level actions useful for solving a range of tasks, allowing fast solving of tasks requiring thousands of timesteps. O...

Non traitée 19/10/2017 09:00
Generalizing from simulation

RSS: OpenAI News

Our latest robotics techniques allow robot controllers, trained entirely in simulation and deployed on physical robots, to react to unplanned changes in the environment as they solve simple tasks. Tha...

Non traitée 18/10/2017 09:00
Non traitée 18/10/2017 09:00
Non traitée 17/10/2017 09:00
Non traitée 11/10/2017 09:00
Competitive self-play

RSS: OpenAI News

We’ve found that self-play allows simulated AIs to discover physical skills like tackling, ducking, faking, kicking, catching, and diving for the ball, without explicitly designing an environment wi...

Non traitée 11/10/2017 09:00
Meta-learning for wrestling

RSS: OpenAI News

We show that for the task of simulated robot wrestling, a meta-learning agent can learn to quickly defeat a stronger non-meta-learning agent, and also show that the meta-learning agent can adapt to ph...

Non traitée 29/09/2017 09:00
Non traitée 14/09/2017 09:00
Learning to model other minds

RSS: OpenAI News

We’re releasing an algorithm which accounts for the fact that other agents are learning too, and discovers self-interested yet collaborative strategies like tit-for-tat in the iterated prisoner’s ...

Non traitée 13/09/2017 09:00
Non traitée 18/08/2017 09:00
OpenAI Baselines: ACKTR & A2C

RSS: OpenAI News

We’re releasing two new OpenAI Baselines implementations: ACKTR and A2C. A2C is a synchronous, deterministic variant of Asynchronous Advantage Actor Critic (A3C) which we’ve found gives equal perf...

Non traitée 16/08/2017 09:00
More on Dota 2

RSS: OpenAI News

Our Dota 2 result shows that self-play can catapult the performance of machine learning systems from far below human level to superhuman, given sufficient compute. In the span of a month, our system w...

Non traitée 11/08/2017 09:00
Dota 2

RSS: OpenAI News

We’ve created a bot which beats the world’s top professionals at 1v1 matches of Dota 2 under standard tournament rules. The bot learned the game from scratch by self-play, and does not use imitati...

Non traitée 03/08/2017 09:00
Gathering human feedback

RSS: OpenAI News

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towa...