Melanie Mitchell


29 Quotes

"To solve this problem, they believe, we must find ways to align AI systems with human preferences, goals and values."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"any years ago, I learned to program on an old Symbolics Lisp Machine. The operating system had a built-in command spelled “DWIM,” short for “Do What I Mean.”"
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"If I typed a command and got an error, I could type “DWIM,” and the machine would try to figure out what I meant to do. A surprising fraction of the time, it actually worked."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"Another enterprising programmer wanted his Roomba vacuum cleaner to stop bumping into furniture, so he connected the Roomba to a neural network that rewarded speed but punished the Roomba when the front bumper collided with something. The machine accommodated these objectives by always driving backward."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"In fact, they believe that the machines’ inability to discern what we really want them to do is an existential risk"
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"“An entity is considered to be intelligent, roughly speaking, if it chooses actions that are expected to achieve its objectives, given what it has perceived.”"
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"Bostrom based his view of AI’s risks on two theses."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"The first is the orthogonality thesis, which states, in Bostrom’s words, “Intelligence and final goals are orthogonal axes along which possible agents can freely vary."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"In other words, more or less any level of intelligence could in principle be combined with more or less any final goal.”"
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"The second is the instrumental convergence thesis, which implies that an intelligent agent will act in ways that promote its own survival, self-improvement and acquisition of resources, so long as these make the agent more likely to achieve its final goal."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"Then he made one final assumption: Researchers would soon create an AI superintelligence — one that “greatly exceeds the cognitive performance of humans in virtually all domains of interest.”"
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"For Bostrom and others in the AI alignment community, this prospect spells doom for humanity unless we succeed in aligning superintelligent AIs with our desires and values."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"If you believe that intelligence is defined by the ability to achieve goals, that any goal could be “inserted” by humans into a superintelligent AI agent, and that such an agent would use its superintelligence to do anything to achieve that goal, then you will arrive at the same conclusion that Russell did:"
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"“All that is needed to assure catastrophe is a highly competent machine combined with humans who have an imperfect ability to specify human preferences completely and correctly.”"
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"It turns out that there’s little overlap between the communities concerned primarily with such short-term risks and those who worry more about longer-term alignment risks."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"Many researchers are actively engaged in alignment-based projects, ranging from attempts at imparting principles of moral philosophy to machines, to training large language models on crowdsourced ethical judgments."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"Many writers have noted the many obstacles preventing machines from learning human preferences and values"
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"After all, it’s not clear whose values we should have machines try to learn."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"Many in the alignment community think the most promising path forward is a machine learning technique known as inverse reinforcement learning (IRL). With IRL, the machine is not given an objective to maximize; such “inserted” goals, alignment proponents believe, can inadvertently lead to paper clip maximizer scenarios."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"Instead, the machine’s task is to observe the behavior of humans and infer their preferences, goals and values."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"However, I think this underestimates the challenge. Ethical notions such as kindness and good behavior are much more complex and context-dependent than anything IRL has mastered so far."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"Other ethical concepts are just as complex. It should be clear that an essential first step toward teaching machines ethical concepts is to enable machines to grasp humanlike concepts in the first place, which I have argued is still AI’s most important open problem."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"Most discussions imagine a superintelligent AI as a machine that, while surpassing humans in all cognitive tasks, still lacks humanlike common sense and remains oddly mechanical in nature."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"And importantly, in keeping with Bostrom’s orthogonality thesis, the machine has achieved superintelligence without having any of its own goals or values, instead waiting for goals to be inserted by humans."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"Yet could intelligence work this way? Nothing in the current science of psychology or neuroscience supports this possibility."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"In humans, at least, intelligence is deeply interconnected with our goals and values, as well as our sense of self and our particular social and cultural environment."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"From what we know, it seems much more likely that a generally intelligent AI system’s goals could not be easily inserted, but would have to develop, like ours, as a result of its own social and cultural upbringing."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"“The right time to worry about a potentially serious problem for humanity depends not just on when the problem will occur but also on how long it will take to prepare and implement a solution.”"
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine
"But without a better understanding of what intelligence is and how separable it is from other aspects of our lives, we cannot even define the problem, much less find a solution."
Melanie Mitchell
What Does It Mean to Align AI With Human Values? | Quanta Magazine

Want to Save Quotes?

Glasp is a social web highlighter that people can highlight and organize quotes and thoughts from the web, and access other like-minded people’s learning.