domain transfer methods find mappings to transfer points from two different data distributions, both observed empirically, for the purpose of image-to-image translation, style transfer, and domain adaption [e.g., 100, 16, 79, 59]. All these tasks can be framed unifiedly as finding a transport map between two distributions:
Several lines of techniques have been developed depending on how to represent and train the map T.
a neural network is trained to represent the drift force of the processes and a numerical ODE/SDE solver is used to simulate the process during inference. The key idea is that, by leveraging the mathematical structures of ODEs/SDEs, the continuous-time models can be trained efficiently without resorting to minimax or traditional approximate inference techniques.
map T에 π0~π1을 잇는 선을 찾아 style transfer한다. GAN는 모드 붕괴와 같은 문제가 있지만 ODE와 SDE는 그런 것도 없고 수학적인 구조로 leveraging한다.
We introduce rectified flow, a surprisingly simple approach to the transport mapping problem, which unifiedly solves both generative modeling and domain transfer. The rectified flow is an ODE model that transport distribution π0 to π1 by following straight line paths as much as possible. The straight paths are preferred both theoretically because it is the shortest path between two end points, and computationally because it can be exactly simulated without time discretization. Hence, flows with straight paths bridge the gap between one-step and continuous-time models.
rectified flow 정말 중요하다
The procedure of obtaining the rectified flow from the training data has the attractive theoretical property of 1) yielding a coupling with non-increasing transport cost jointly for all convex cost c, and 2) making the paths of flow increasingly straight and hence incurring lower error with numerical solvers. Therefore, with a reflow procedure that iteratively trains new rectified flows with the data simulated from the previously obtained rectified flow, we obtain nearly straight flows that yield good results even with the coarsest time discretization,
Rectified flow Given empirical observations of X0 ∼ π0, X1 ∼ π1, the rectified flow induced from (X0, X1) is an ordinary differentiable model (ODE) on time t ∈ [0, 1],ㅁ
which converts Z0 from π0 to a Z1 following π1. The drift force v : R d → R d is set to drive the flow to follow the direction (X1 − X0) of the linear path pointing from X0 to X1 as much as possible, by solving a simple least squares regression problem:
where Xt is the linear interpolation of X0 and X1. Naviely, Xt follows the ODE of dXt = (X1 − X0)dt, which is non-causal (or anticipating) as the update of Xt requires the information of the final point X1. By fitting the drift v with X1 − X0, the rectified flow causalizes the paths of linear interpolation Xt , yielding an ODE flow that can be simulated without seeing the future.
x_t는 x0과 x1의 선형 보간이다. dxt=(x1-x0)dt의 ODE를 따르고 Xt의 업데이트가 최종 포인트 x1의 정보를 필요해서 비인과적(예상)이다. drift v를 맞추면 xt의 경로를 인과화하여 future를 보지 않고도 생성할 수 있다.
In practice, we parameterize v with a neural network or other nonlinear models and solve (1) with any offthe-shelf stochastic optimizer, such as stochastic gradient descent, with empirical draws of (X0, X1). See Algorithm 1. After we get v, we solve the ODE starting from Z0 ∼ π0 to transfer π0 to π1, backwardly starting from Z1 ∼ π1 to transfer π1 to π0. Specifically, for backward sampling, we simply solve dX˜ t = −v(X˜ t , t)dt initialized from X˜ 0 ∼ π1 and set Xt = X˜ 1−t . The forward and backward sampling are equally favored by the training algorithm, because the objective in (1) is time-symmetric in that it yields the equivalent problem if we exchange X0 and X1 and flip the sign of v.