Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about underlay effectiveness of Layoutprompter #45

Open
theKinsley opened this issue Sep 9, 2024 · 1 comment
Open

Question about underlay effectiveness of Layoutprompter #45

theKinsley opened this issue Sep 9, 2024 · 1 comment

Comments

@theKinsley
Copy link

Thank you for such a great work. Recently, I delved into the paper and the code provided for the content-aware layout generation task, and it appears that Layoutprompter handles the underlay element in a manner quite consistent with the treatment of other elements. Furthermore, within the Ranker module, the overlap of the underlay with other elements is actually subject to a penalty. This raises some surprise regarding the high Underlay effectiveness reported in the paper.

Considering that GPT-3 text-davinci-003 is no longer supported, I have chosen to use Llama 3 as the LLM component for running Layoutprompter. The outcomes revealed a notably low underlay effectiveness, particularly the strict effectiveness, which is nearly 0. I am wondering if there might be specialized Content-aware Ranker code that has not been released to the community. If such code is available, I would be very keen to learn about it.

Your insights on this matter would be greatly appreciated.

@kaoshizhou
Copy link
Contributor

Thanks for your interest in our work. Currently, we use unified Ranker module for different layout generation tasks. Actually, we have used the language command to guide LLM generating underlay elements as the background of other elements. And we empirically found that GPT-3 text-davinci-003 could well comply this.

Given your experimental results on LLaMa3, we speculate that this might be due to the capability gap of different LLMs (i.e., GPT-3 could be better at such task). And we highly recommend that you could try other available LLMs to reproduce the results.

Finally, I personally agree that developing tailored Ranker module for content-aware layout generation could be useful and worth a try (e.g., encourage high overlap between underlay and other elements). And it promises to improve the two Und metrics.

Hope this helps you and feel free to ask if you have other questions :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants