NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Constructing Multimodal Datasets from Scratch for Rapid Development of a Japanese Visual Language Model
Keito Sasagawa
|
Koki Maeda
|
Issa Sugiura
|
Shuhei Kurita
|
Naoaki Okazaki
|
Daisuke Kawahara
|
Paper Details:
Month: April
Year: 2025
Location: Albuquerque, New Mexico
Venue:
NAACL |
WS |
Citations
URL
No Citations Yet
https://huggingface.co/llm-jp/llm-jp-3-vila-14b
https://github.com/llm-jp/llm-jp-VILA
https://huggingface.co/datasets/turing-motors/LLaVA-
https://github.com/JohannesBuchner/imagehash
https://github.com/mlfoundations/dataset2metadata
https://huggingface.co/laion/CLIP-ViT-H-14-frozen-
https://pypi.org/project/lapjv/
https://github.com/HojiChar/HojiChar
https://huggingface.co/line-corporation/clip-japanese-
https://huggingface.co/datasets/ThePioneer/japanese-
https://huggingface.co/google/siglip-so400m-patch14-
https://huggingface.co/llm-jp/llm-jp-3-13b-instruct
https://huggingface.co/datasets/liuhaotian/LLaVA-
https://huggingface.co/datasets/SakanaAI/JA-VG-VQA-
https://huggingface.co/datasets/SakanaAI/JA-VG-VQA-
https://platform.openai.com/docs/advanced-
https://cloud.google.com/vertex-ai/generative-
https://huggingface.co/datasets/turing-motors/LLaVA-
https://github.com/kakaobrain/
Field Of Study