最新的NVIDIA Generative AI Multimodal - NCA-GENM免費考試真題
You are tasked with evaluating a multimodal A1 model that combines image and text inputs to generate product descriptions. You observe that the model performs well on common product categories (e.g., clothing, electronics) but struggles with niche categories (e.g., antique furniture, scientific instruments). Which of the following strategies would be MOST effective in improving the model's performance on niche categories?
正確答案: B
說明:(僅 Fast2test 成員可見)
You are building a text-to-image generation pipeline using CLIP and a diffusion model. After training, you notice that the generated images often lack the specific details mentioned in the text prompts. Which of the following strategies could you employ to improve the alignment between text and image?
正確答案: A
說明:(僅 Fast2test 成員可見)
A multimodal A1 model is trained on a dataset containing biased text and images. This bias leads to the model generating outputs that reinforce negative stereotypes. Which of the following steps are crucial for addressing and mitigating this bias during the model development lifecycle? (Select TWO)
正確答案: B,C
說明:(僅 Fast2test 成員可見)
You are deploying a multimodal Generative A1 model on a cloud platform. The model takes video and text as input to generate video descriptions. The model's performance needs to be monitored to ensure it meets certain performance SLAs. Which of the following metrics are MOST crucial to monitor in a production environment to ensure both computational efficiency and output quality? (Select TWO)
正確答案: B,C
說明:(僅 Fast2test 成員可見)
You are fine-tuning a large pre-trained language model for a specific downstream task. During training, you observe that the model performs well on the training data but generalizes poorly to the validation dat a. Which of the following strategies could help improve the model's generalization performance?
正確答案: B,C,D,E
說明:(僅 Fast2test 成員可見)
Which of the following are key challenges specific to training multimodal models compared to unimodal models? (Select TWO)
正確答案: C,D
說明:(僅 Fast2test 成員可見)
You are building a multimodal application that needs to understand both image and text dat a. You want to use a pre-trained model but fine-tune it for your specific task. Which of the following strategies is MOST effective for fine-tuning a large pre-trained multimodal model?
正確答案: A
說明:(僅 Fast2test 成員可見)
You're training a multimodal model for generating stories from images and audio. You use a Transformer architecture. During training, you notice that the model struggles to maintain long-range dependencies in the generated stories, leading to incoherent narratives. Which of the following techniques would be MOST effective in addressing this issue within the Transformer architecture?
正確答案: D
說明:(僅 Fast2test 成員可見)