NCA-GENM 試験問題を無料オンラインアクセス
試験コード: | NCA-GENM |
試験名称: | NVIDIA Generative AI Multimodal |
認定資格: | NVIDIA |
無料問題数: | 403 |
更新日: | 2025-09-04 |
Which of the following are potential benefits of using multi-modal learning compared to single-modal learning? (Select all that apply)
You are working on a multimodal model for video captioning, where the model needs to generate captions describing the actions and events happening in a video. You notice that the model tends to focus only on the most salient objects in the scene and ignores subtle but important actions. Which of the following techniques can help the model attend to these subtle actions and generate more comprehensive captions?
Consider the following code snippet used within a U-Net architecture. What is its purpose?
torch.cat ([up, skip], dim=1)
You are building a multimodal application that analyzes images and generates descriptive captions. The application needs to handle noisy images and maintain caption consistency. Which of the following techniques would be MOST effective in achieving this?