NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
M3AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
Zhe Chen
|
Heyang Liu
|
Wenyi Yu
|
Guangzhi Sun
|
Hongcheng Liu
|
Ji Wu
|
Chao Zhang
|
Yu Wang
|
Yanfeng Wang
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand
Venue:
ACL |
Citations
URL
No Citations Yet
https://jack-zc8.github.io/
https://github.com/allenai/science-parse
https://azure.microsoft.com/en-us/products/
https://github.com/openai/whisper
https://github.com/SpeechColab/Leaderboard
https://github.com/PaddlePaddle/PaddleOCR
https://mathpix.com/ocr
https://github.com/SpeechColab/Leaderboard
https://huggingface.co/models?pipeline_tag=
https://catalog.ngc.nvidia.com/orgs/nvidia/
https://catalog.ngc.nvidia.com/orgs/nvidia/
https://huggingface.co/facebook/
https://huggingface.co/suno/bark
https://huggingface.co/microsoft/speecht5_tts
https://huggingface.co/jonatasgrosman/
https://docs.nvidia.com/deeplearning/
https://docs.nvidia.com/deeplearning/nemo/
https://github.com/MontrealCorpusTools/
https://github.com/PaddlePaddle/PaddleOCR/
https://huggingface.co/sentence-transformers/
Field Of Study