19

A handbook to demystify algorithmic differentiation

 4 years ago
source link: https://www.tuicool.com/articles/Y7NNjaq
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

Differentiation for Hackers

The goal of this handbook is to demystify algorithmic differentiation , the tool that underlies modern machine learning. It begins with a calculus-101 style understanding and gradually extends this to build toy implementations of systems similar to PyTorch and TensorFlow. I have tried to clarify the relationships between every kind of differentiation I can think of – including forward and reverse, symbolic, numeric, tracing and source transformation. Where typical real-word ADs are mired in implementation details, these implementations are designed to be coherent enough that the real, fundamental differences – of which there are surprisingly few – become obvious.

The intro notebook is recommended to start with, but otherwise notebooks do not have a fixed order.

  • Intro – explains the basics, beginning with a simple symbolic differentiation routine.
  • Back & Forth – discusses the difference between forward and reverse mode AD.
  • Forward – discusses forward-mode AD and its relationship to symbolic and numerical differentiation.
  • Tracing – discusses tracing-based implementations of reverse mode, as used by TensorFlow and PyTorch.
  • Reverse – discusses a more powerful reverse mode based on source transformation (not complete).

If you want to run the notebooks locally, they can be built by running the src/notebooks.jl script using Julia. They should appear inside a /notebooks folder. Alternatively, you can run through the scripts in Juno.


About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK