(discrete-distribution-networks.github.io)

392 points diyer22 | 3 comments | 10 Oct 25 09:01 UTC | HN request time: 0.823s | source

I invented Discrete Distribution Networks, a novel generative model with simple principles and unique properties, and the paper has been accepted to ICLR2025!

Modeling data distribution is challenging; DDN adopts a simple yet fundamentally different approach compared to mainstream generative models (Diffusion, GAN, VAE, autoregressive model):

1. The model generates multiple outputs simultaneously in a single forward pass, rather than just one output. 2. It uses these multiple outputs to approximate the target distribution of the training data. 3. These outputs together represent a discrete distribution. This is why we named it "Discrete Distribution Networks".

Every generative model has its unique properties, and DDN is no exception. Here, we highlight three characteristics of DDN:

- Zero-Shot Conditional Generation (ZSCG). - One-dimensional discrete latent representation organized in a tree structure. - Fully end-to-end differentiable.

Reviews from ICLR:

> I find the method novel and elegant. The novelty is very strong, and this should not be overlooked. This is a whole new method, very different from any of the existing generative models. > This is a very good paper that can open a door to new directions in generative modeling.

1. FitchApps ◴[10 Oct 25 12:38 UTC] No.45538286[source]▶

>>45536694 (OP) #

Can you train this model to detect objects (e.g detect a fish in the picture)?

replies(1): >>45538449 #

2. diyer22 ◴[10 Oct 25 12:55 UTC] No.45538449[source]▶

>>45538286 (TP) #

I believe DDN is exceptionally well-suited to the “generative models for discriminative tasks” paradigm for object detection.

Much like DiffusionDet, which applies diffusion models to detection, DDN can adopt the same philosophy. I expect DDN to offer several advantages over diffusion-based approaches: - Single forward pass to obtain results, no iterative denoising required. - If multiple samples are needed (e.g., for uncertainty estimation), DDN can directly produce multiple outputs in one forward pass. - Easy to impose constraints during generation due to DDN's Zero-Shot Conditional Generation capability. - DDN supports more efficient end-to-end optimization, thus more suitable for integration with discriminative models and reinforcement learning.

replies(1): >>45541017 #

3. porridgeraisin ◴[10 Oct 25 16:51 UTC] No.45541017[source]▶

>>45538449 #

Yep, the mental model I have from a cursory read of the paper is "generative decision tree".

↑

Show HN: I invented a new generative model and got accepted to ICLR