Update README.md
Browse files
README.md
CHANGED
@@ -5,6 +5,8 @@ language:
|
|
5 |
---
|
6 |
## Open-MAGVIT2: Democratizing Autoregressive Visual Generation
|
7 |
|
|
|
|
|
8 |
Until now, VQGAN, the initial tokenizer is still acting an indispensible role in mainstream tasks, especially autoregressive visual generation. Limited by the bottleneck of the size of codebook and the utilization of code, the capability of AR generation with VQGAN is underestimated.
|
9 |
|
10 |
Therefore, [MAGVIT2](https://arxiv.org/abs/2310.05737) proposes a powerful tokenizer for visual generation task, which introduces a novel LookUpFree technique when quantization and extends the size of codebook to $2^{18}$, exhibiting promising performance in both image and video generation tasks. And it plays an important role in the recent state-of-the-art AR video generation model [VideoPoet](https://arxiv.org/abs/2312.14125). However, we have no access to this strong tokenizer so far. ☹️
|
|
|
5 |
---
|
6 |
## Open-MAGVIT2: Democratizing Autoregressive Visual Generation
|
7 |
|
8 |
+
[[Project Page]](https://github.com/TencentARC/Open-MAGVIT2)
|
9 |
+
|
10 |
Until now, VQGAN, the initial tokenizer is still acting an indispensible role in mainstream tasks, especially autoregressive visual generation. Limited by the bottleneck of the size of codebook and the utilization of code, the capability of AR generation with VQGAN is underestimated.
|
11 |
|
12 |
Therefore, [MAGVIT2](https://arxiv.org/abs/2310.05737) proposes a powerful tokenizer for visual generation task, which introduces a novel LookUpFree technique when quantization and extends the size of codebook to $2^{18}$, exhibiting promising performance in both image and video generation tasks. And it plays an important role in the recent state-of-the-art AR video generation model [VideoPoet](https://arxiv.org/abs/2312.14125). However, we have no access to this strong tokenizer so far. ☹️
|