-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
84ae361
commit 3b4b5a1
Showing
2 changed files
with
57 additions
and
0 deletions.
There are no files selected for viewing
Binary file added
BIN
+119 KB
posts/2023-12-25-ai-robots-are-mastering-tasks-faster-than-humans-/IMG_8503.jpeg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
57 changes: 57 additions & 0 deletions
57
posts/2023-12-25-ai-robots-are-mastering-tasks-faster-than-humans-/index.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,57 @@ | ||
--- | ||
title: AI Robots are mastering tasks faster than humans | ||
description: | | ||
From walking to labyrinth, the future looks bright for these machines. The key is the objective functions, time, and training data. | ||
date: 2023-12-25 | ||
image: photo.jpeg | ||
categories: [ai, tech, futurism] | ||
draft: false | ||
--- | ||
|
||
::: callout-tip | ||
## With a proper objective function, machines can do really hard, isolated tasks | ||
|
||
- they can walk | ||
- they can solve complex physical challenges | ||
- they can play the toughest strategic games on earth and win (AlphaGo 2017, AlphaStar 2019) | ||
- but can they determine the objective function by themselves? | ||
|
||
::: | ||
|
||
# Reinforcement Learning | ||
|
||
Great article written [at Techradar](https://www.techradar.com/computing/artificial-intelligence/my-jaw-hit-the-floor-when-i-watched-an-ai-master-one-of-the-worlds-toughest-physical-games-in-just-six-hours) going over the advances in reinforcement learning that allow a robot to finish labyrinth faster than humans. The video does the feat even more justice: | ||
|
||
{{< video https://youtu.be/zQMKfuWZRdA?si=lWEaMy_BhKQGDDQd >}} | ||
|
||
This stands out to me in a few ways, principally that they crippled the machine to not take shortcuts. It still won. | ||
|
||
Reminds me first of AlphaGo: | ||
|
||
{{< video https://youtu.be/WXuK6gekU1Y?si=UGZea1kUWNV_4iTr >}} | ||
|
||
Which reminds me of [AlphaStar](https://deepmind.google/discover/blog/alphastar-mastering-the-real-time-strategy-game-starcraft-ii/). Here's a video with commentary on AlphaStar's performance: | ||
|
||
{{< video https://youtu.be/6EQAsrfUIyo?si=t3x_Nn8Mr2SDJ9uH >}} | ||
|
||
The thing I connect this video (or some other I watched) with labyrinth is that AlphaStar devised brand new strategies. Players against AlphaStar remarked how they felt like it was indeed being creative. Likewise, Cyberrunner (labyrinth) found shortcuts. | ||
|
||
All of this points to reinforcement learning as a viable solution to getting stuff done both in the digital and physical worlds. Take, for example, how RL is used to teach a [robot how to walk](https://interestingengineering.com/innovation/humanoid-taught-itself-to-walk). | ||
|
||
![An AI robot learning to play StarCraft for the first time...](photo.jpeg){.preview-image} | ||
|
||
# What's the objective? | ||
|
||
The goal for Super AI will be to figure out what the objective is. If we can marry digital and physical worlds, teaching the systems how to solve objective functions, the possibilities become...super/human. | ||
|
||
The unique thing about a human, it seems, is that we have our objective function hardwired into us: survive. First this is "food/shelter". Next it's social survival. | ||
|
||
How do humans figure out objective functions so fast? A human looking at labyrinth will know what to do within a second. Does it have interest/patience to sit there and learn it? Does GPT 4Vision have the ability to figure out these objective functions? | ||
|
||
We learn by objective functions. Minimize wrong answers on a test. Minimize missed baskets on the basketball court. Minimize rejections in a sales funnel. Maximize great conversation and connection at a dinner party. | ||
|
||
Humans learn very quickly. For now, robots are being isolated to specific tasks and their results are impressive. But perhaps to become our overlords all they need is the one objective function to rule them all. Er, uhm, and by "them" I of course don't mean "us"... | ||
|
||
|
||
|
||
|